Changbin Zhang

张长彬

Ph.D. Candidate

The University of Hong Kong

Emailzhangchbin@gmail.com

Google ScholarGoogle Scholar

Image


I'm currently a final-year Ph.D. candidate working with Dr. Yujie Zhong and Prof. Kai Han at The University of Hong Kong (HKU) . Before that, I spent wonderful years at Nankai University (NKU), supervised by Prof. Ming-Ming Cheng and received my M.Eng degree in Computer Science.

I am currently working on research related to RL / Agentic RL, on-policy distillation (OPD), Multi-Agent Systems, and self-evolving agents. I have published 5 first-author and 2 co-first-author papers in CCF-A venues, including 1 ESI Highly Cited Paper (Top 1%) and 1 CVPR Highlight paper (Top 2.8%). My work has received over 2,300 citations on Google Scholar. I also won a 🥈 Silver Medal in the ACM-ICPC Asia Regional Contest. I am currently on the job market, seeking roles in post-training and agents.

我目前从事 RL / Agentic RL、on-policy distillation (OPD)、Multi-Agent Systems、self-evolving agents 等相关研究。我以第一作者(含共同一作)身份在 CCF-A 类会议上发表论文 7 篇(5 篇一作、2 篇共同一作),其中包含 1 篇 ESI 高被引论文(Top 1%)和 1 篇 CVPR Highlight 论文(Top 2.8%)。我在 Google Scholar 上累计获得超过 2300 次引用。我也曾获得 ACM-ICPC 亚洲区域赛 🥈 银牌。我目前正在求职,寻找 post-training 与 agent 相关的职位。

Publications

Visual Perception Foundation Model MoE Foundation Model Reinforcement Learning for MLLMs Think with visual primitives All

Experience

Honors & Awards