Minglai Yang

I open black boxes to build trustworthy language models. 印章

prof_big.jpg

📧 minglai.yang@scale.com

📍 San Francisco, CA, USA

I am a Research Scientist at Scale AI logoScale AI. Earlier in 2026, I was a Senior Member of Technical Staff at Abaka AI logoAbaka AI. I received my B.S. in University of Arizona logoComputer Science from the University of Arizona (GPA: 4.0/4.0) in Fall 2025, completing my degree in just over 2 years.

My research focuses on building LLMs that are trustworthy: robust (EMNLP 25), explainable (TMLR) and useful (EMNLP 25). Ultimately, I’m interested in these two overarching questions:

  • 🔍 Deconstruction of LLMs: How can we open the black box to reveal the internal mechanisms?
  • 🛠️ Reconstruction toward Trustworthy LLMs: How do we translate mechanistic insight into models that are robust, explainable, and useful in practice?

During my undergraduate years, I was fortunate to conduct research co-advised by Profs. Mihai Surdeanu, Liangming Pan, Kobus Barnard and Steven Bethard, in CLULAB logoCLULAB, IVILAB logoIVILAB and ML4AI logoML4AI LAB. I also collaborated with Profs. Adarsh Pyarelal, William Yang Wang and Chicheng Zhang. As Founder & President of AI Club at UA logoAI Club at UA, I ran workshops, hosted invited speakers, and led industry collaborations—raising $14K+ to support student AI research and education.

In summer 2025, I was a research intern at Tsinghua University logoKnowledge Engineering Group (KEG), Tsinghua University, supervised by Prof. Juanzi Li, working on LLM reasoning mechanisms. Before that, I worked as a Machine Learning Engineer intern at CoreTechs logoCoreTechs.


news

Jul 03, 2026 AlignSAE was accepted to TMLR 🎉 — the action editor recommended “Accept as is”. Grateful to all my co-authors!
Jul 01, 2026 I officially joined Scale AI as a Research Scientist! 🎉
Jan 05, 2026 New chapter: I joined Abaka AI as a Senior Member of Technical Staff! 🚀
Oct 19, 2025 We took 2nd place at the Reddit Wildcat Hackathon 2025!
Oct 17, 2025 Honored to earn UA’s Top 10 Undergraduate Research Travel Grant 🎓—headed to my EMNLP oral; see you in Suzhou. ✈️
Aug 20, 2025 Both of my submissions were accepted to EMNLP 2025 Main (Oral) 🎉 (Acceptance Rate: 22.16%). Grateful to all my co-authors, with special thanks to Profs. Liangming Pan, Mihai Surdeanu and William Wang.
Jun 05, 2025 I will be a research intern at THUKEG, Department of CS in Tsinghua University this summer advised by Prof. Juanzi Li, focusing on reasoning mechanism.
May 09, 2025 Galileo Circle Scholar, University of Arizona — Top 0.8% academic award.
Feb 18, 2025 As President of the AI Club at the University of Arizona, I led the club to raise over $12,000.
Dec 03, 2024 Excited to receive an RAship! I’ll lead a project advised by Liangming Pan and collaborate with William Wang at UCSB NLP Group.
Nov 01, 2024 I’m featured in our department newsletter! Huge thanks to Rishu Singh for the shoutout.

selected publications

  1. AlignSAE: Concept-Aligned Sparse Autoencoders
    Minglai Yang*Xinyu GuoZhengliang Shi, Jinhe Bi, Steven BethardMihai Surdeanu*, and Liangming Pan*
    Transactions on Machine Learning Research (TMLR), 2026
  2. How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark
    Minglai Yang*, Ethan Huang , Liang Zhang, Mihai SurdeanuWilliam Wang, and Liangming Pan*
    Oral Presentation
    EMNLP Main Conference , 2025
  3. CopySpec: Accelerating LLMs with Speculative Copy-and-Paste Without Compromising Quality
    Razvan-Gabriel Dumitru, Minglai YangVikas Yadav, and Mihai Surdeanu
    Oral Presentation
    EMNLP Main Conference , 2025