About Me
I am an algorithm engineer in Alibaba Group. I obtained M.Res. in Informatics at the University of Edinburgh under the supervision of prof. Shay Cohen. Prior to that, I interned at the Hong Kong University of Science and Technology under the supervision of Dr. Jie Fu. Before that, I obtained B.Sc. in Applied Mathematics at the University of Nottingham Ningbo China.
Projects
RL2: Ray Less Reinforcement Learning [Code] [Blog]
Liked by Andrej Karpathy and reposted by Ying Sheng
GEM: A Gym for Agentic LLMs [Paper] [Code]
Oral in NeurIPS’25 SEA Workshop and Spotlight in NeurIPS’25 MTI-LLM Workshop