Hong Chenchen
洪晨辰
ByteDance · Shanghai, China
Focused on machine learning systems, LLM inference optimization, and compiler infrastructure. Building tools that bridge the gap between high-level ML frameworks and efficient hardware execution.
About
I work at the intersection of machine learning systems and compiler infrastructure. My focus areas include LLM inference optimization, MLIR-based compilation pipelines, and building efficient bridges between high-level ML frameworks and domain-specific hardware. Currently at ByteDance, working on systems that make large-scale AI workloads run faster.
Interests
Publications
Projects
A production MLIR compiler targeting FT-Matrix with cost-model-driven optimization, PyTorch frontend, and a comprehensive benchmark framework achieving ~57x kernel speedup
An intelligent LaTeX citation assistant that automates reference discovery and BibTeX generation using LLM + NLP fusion