About Me
I am a technical staff at Alibaba Group.
CV | Google Scholar | GitHub | Kaggle | Zhihu
Projects
- RL2: Ray Less Reinforcement Learning
Lead developer, >1.1K GitHub stars
Publications
GEM: A Gym for Agentic LLMs
Co-author, ICLR’26 and Outstanding in SEA Workshop of NeurIPS’25Massive Editing for Large Language Models via Meta Learning
First author, >150 citations, top 10% cited in ICLR’24Learning Rewards to Optimize Global Performance Metrics in Deep RL
Co-author, AAMAS’23CVaR-Regret Bounds for Multi-armed Bandits
First author, ACML’22
