About Me
I am an algorithm engineer affiliated with Alibaba Group. I obtained M.Res. in Informatics at the University of Edinburgh under the supervision of prof. Shay Cohen. Prior to that, I interned at the Hong Kong University of Science and Technology under the supervision of Dr. Jie Fu. My interest lies in training agentic large language models (LLMs) by reinforcement learning (RL).
Google Scholar | GitHub | Hugging Face | Kaggle | X | Zhihu
Projects
- RL2: Ray Less Reinforcement Learning
Liked by Andrej Karpathy and reposted by Ying Sheng
Selected Publications
- Massive Editing for Large Language Models via Meta Learning
Chenmien Tan, Ge Zhang, and Jie Fu
ICLR’24
Competitions
- Learning Equality – Curriculum Recommendations
Ranking: 17/1057 = 1.6% - Google AI4Code – Understand Code in Python Notebooks
Ranking: 25/1135 = 2.2% - H&M Personalized Fashion Recommendations
Ranking: 45/2952 = 1.5%
Academic Services
Acknowledgement
I am lucky to work with many enthusiastic, intelligent, and hardworking peers, such as Yijun Yang@Edinburgh, Weipeng Zhang@Huawei, Jiahong Xie@SJTU, Xun Zhao@UCAS, Shengda Fan@RUC, Ge Zhang@ByteDance, Hanxu Hu@UZH, and Simon Yu@NEU. I learnt a lot from them. Foremost, I thank Chenxi Chen for her companionship. She is my greatest fortunate.