~/about

Hey, I’m Bobby Cheng.

I work on reinforcement learning for LLMs, building the environments, training pipelines, and evaluation frameworks that let language models learn through interaction. My work spans self-play optimization, multi-agent reasoning benchmarks, and distributed RL infrastructure.


[Email] [X] [GitHub] [Google Scholar]

~/work