[Thesis Defense] Yung-Sung Chuang: "Towards Factual and Trustworthy Large Language Models"

Speaker

Yung-Sung Chuang

MIT CSAIL

Host

James Glass

MIT CSAIL

Thesis Advisor: James Glass

Thesis Committee: Yoon Kim, Jacob Andreas

Calendar Invitation: http://people.csail.mit.edu/yungsung/defense.ics

Speaker's Website: https://yung-sung.github.io

Abstract:

Large Language Models (LLMs) have transformed how we interact with information, yet hallucinations, e.g., plausible but factually incorrect outputs, remain a critical barrier to their deployment in high-stakes applications. This thesis presents a comprehensive approach to understanding and mitigating hallucinations across several fundamental dimensions of knowledge in AI systems: parametric, contextual, and attribution knowledge.

We identify that hallucinations arise from different failure modes requiring distinct solutions. First, models may fail to leverage parametric knowledge already encoded in their weights. We introduce DoLa (Decoding by Contrasting Layers), which amplifies factual knowledge by dynamically contrasting predictions across transformer layers, improving factuality without training or external knowledge. Second, in retrieval-augmented generation settings, models often fail to properly use provided context. We develop Lookback Lens, which analyzes attention patterns to detect and reduce hallucinations. Third, even when models generate correct content, users need verifiable evidence. We present SelfCite, a self-supervised alignment method that enables LLMs to provide accurate sentence-level citations through a reward design of context ablation. Together, these methods form a roadmap towards better AI systems, working towards systems that are not only capable but also reliable, transparent, and trustworthy.

Add to Calendar 2025-12-09 15:00:00 2025-12-09 16:00:00 America/New_York [Thesis Defense] Yung-Sung Chuang: "Towards Factual and Trustworthy Large Language Models" Thesis Advisor: James GlassThesis Committee: Yoon Kim, Jacob AndreasCalendar Invitation: http://people.csail.mit.edu/yungsung/defense.icsSpeaker's Website: https://yung-sung.github.ioAbstract: Large Language Models (LLMs) have transformed how we interact with information, yet hallucinations, e.g., plausible but factually incorrect outputs, remain a critical barrier to their deployment in high-stakes applications. This thesis presents a comprehensive approach to understanding and mitigating hallucinations across several fundamental dimensions of knowledge in AI systems: parametric, contextual, and attribution knowledge.We identify that hallucinations arise from different failure modes requiring distinct solutions. First, models may fail to leverage parametric knowledge already encoded in their weights. We introduce DoLa (Decoding by Contrasting Layers), which amplifies factual knowledge by dynamically contrasting predictions across transformer layers, improving factuality without training or external knowledge. Second, in retrieval-augmented generation settings, models often fail to properly use provided context. We develop Lookback Lens, which analyzes attention patterns to detect and reduce hallucinations. Third, even when models generate correct content, users need verifiable evidence. We present SelfCite, a self-supervised alignment method that enables LLMs to provide accurate sentence-level citations through a reward design of context ablation. Together, these methods form a roadmap towards better AI systems, working towards systems that are not only capable but also reliable, transparent, and trustworthy. TBD

Organizer & Contact

Yung-Sung Chuang

yungsung@csail.mit.edu

Part of

Thesis Defense

[Thesis Defense] Yung-Sung Chuang: "Towards Factual and Trustworthy Large Language Models"

Speaker

Host

December 09 2025

Location

Organizer & Contact

Part of

December 01

Toward Provable Privacy for Black-Box Algorithms via Algorithmic Stability

October 24

(THESIS DEFENSE) LUJIE YANG: "Bridging Model-Based and Learning-Based Methods for Robotic Loco-Manipulation and Control"

[Thesis Defense] Yung-Sung Chuang: "Towards Factual and Trustworthy Large Language Models"

Speaker

Host

December 09 2025

Location

Organizer & Contact

Part of

Related Events

December 01

Toward Provable Privacy for Black-Box Algorithms via Algorithmic Stability

October 24

(THESIS DEFENSE) LUJIE YANG: "Bridging Model-Based and Learning-Based Methods for Robotic Loco-Manipulation and Control"