Understanding AI Systems at Scale: Applied Interpretability, Agent Robustness, and the Science of Model Behaviors

Speaker

Transluce

Host

MIT CSAIL, AI@MIT Reading Group

RSVP here: https://forms.gle/XpirPb17Q9HgoshP6

Join two Transluce researchers as they discuss their latest work and research vision. Transluce is a company building the public tech stack for understanding AI systems. Topics will include applied interpretability, scalable oversight, reinforcement learning, and discovering rare behaviors in language models.

Add to Calendar 2025-09-18 18:00:00 2025-09-18 19:00:00 America/New_York Understanding AI Systems at Scale: Applied Interpretability, Agent Robustness, and the Science of Model Behaviors RSVP here: https://forms.gle/XpirPb17Q9HgoshP6 Join two Transluce researchers as they discuss their latest work and research vision. Transluce is a company building the public tech stack for understanding AI systems. Topics will include applied interpretability, scalable oversight, reinforcement learning, and discovering rare behaviors in language models. TBD

Organizer & Contact

Zhening Li

zli11010@mit.edu

8579289515

Part of

AI@MIT Reading Group

Understanding AI Systems at Scale: Applied Interpretability, Agent Robustness, and the Science of Model Behaviors

Speaker

Host

September 18 2025

Location

Organizer & Contact

Part of

November 19

Integrating Formal and Informal Reasoning

November 05

Improving and Proving: Reinforcement Learning for Proof Shortening and Hierarchical Theorem Proving in Lean

Understanding AI Systems at Scale: Applied Interpretability, Agent Robustness, and the Science of Model Behaviors

Speaker

Host

September 18 2025

Location

Organizer & Contact

Part of

Related Events

November 19

Integrating Formal and Informal Reasoning

November 05

Improving and Proving: Reinforcement Learning for Proof Shortening and Hierarchical Theorem Proving in Lean