Hong Chenchen
洪晨辰
LLM Infra @ RedNote · Hangzhou, China
LLM Infra engineer focused on inference acceleration, reinforcement learning, and diffusion LLMs. Core committer of SGLang-Omni and contributor to SGLang.
About
I work at the intersection of machine learning systems and large-scale model serving. My focus areas include inference for multimodal and language models, and reinforcement learning. Currently at RedNote, working on systems that make large-scale AI workloads run faster.
Interests
Publications
Projects
A production MLIR compiler targeting FT-Matrix with cost-model-driven optimization, PyTorch frontend, and a comprehensive benchmark framework achieving ~57x kernel speedup
An intelligent LaTeX citation assistant that automates reference discovery and BibTeX generation using LLM + NLP fusion