About Me

I am learning to scale reinforcement learning for large language models.

CV  |  Google Scholar  |  GitHub  |  Kaggle  |  Zhihu

Projects

Selected Publications