I am a research scientist at the Gen AI team of Meta. My day-to-day work aims at building open LLMs with leading performance. Before joining Meta, I received my PhD in NLP from University of California, Santa Barbara and my bachelor degree from University of Science and Technology of China.
My recent research interests lie primarily in long-context LLMs targeting tasks that necessitate dense information flow and highly skilled expertise. This line of work typically involves meticulous data recipes, efficient parallel training framework, hardware-aware model architecture designs that go beyond theoretical speedups and dedicated alignment methods. Please do not hessitate to reach out if you are also interested in these topics!