I am a research scientist at Adobe Research, San Jose, working on multimodal foundation models. My work focuses on making these models improve after pre-training, judge visual and language outputs reliably, and scale through better architectures and learning dynamics.
Before joining Adobe, I received my CS PhD from the University of Rochester, advised by Chenliang Xu, and my B.E. from the University of Electronic Science and Technology of China.
Internship advice: Interested PhD students can email me a CV and a brief research plan.
Most recent publications on Google Scholar.
‡ indicates equal contribution.
Full Resume in PDF.