Models That Prove Their Own Correctness

Speaker

UC Berkeley

Host

Rahul Ilango

MIT

Abstract:
This talk introduces Self-Proving models, a new class of models that formally prove the correctness of their outputs via an Interactive Proof system. After reviewing some related literature, I will formally define Self-Proving models and their per-input (worst-case) guarantees. I will then present algorithms for learning these models and explain how the complexity of the proof system affects the complexity of the learning algorithms. Finally, I will show experiments where Self-Proving models are trained to compute the Greatest Common Divisor of two integers, and to prove the correctness of their results to a simple verifier.

No prior knowledge of autoregressive models or Interactive Proofs will be assumed of the listener. This is a joint work with Noga Amit, Shafi Goldwasser, and Guy Rothblum.

Add to Calendar 2024-11-20 16:00:00 2024-11-20 17:00:00 America/New_York Models That Prove Their Own Correctness Abstract: This talk introduces Self-Proving models, a new class of models that formally prove the correctness of their outputs via an Interactive Proof system. After reviewing some related literature, I will formally define Self-Proving models and their per-input (worst-case) guarantees. I will then present algorithms for learning these models and explain how the complexity of the proof system affects the complexity of the learning algorithms. Finally, I will show experiments where Self-Proving models are trained to compute the Greatest Common Divisor of two integers, and to prove the correctness of their results to a simple verifier.No prior knowledge of autoregressive models or Interactive Proofs will be assumed of the listener. This is a joint work with Noga Amit, Shafi Goldwasser, and Guy Rothblum. G575

Organizer & Contact

Rahul Ilango

rilango@csail.mit.edu

Part of

Algorithms and Complexity (A&C) 2024 - 2025

Models That Prove Their Own Correctness

Speaker

Host

November 20 2024

Location

Organizer & Contact

Part of

May 14

Catalytic Computing: A Primer

May 01

Understanding the Trade-Offs Between Hallucinations and Mode Collapse in Language Generation

Models That Prove Their Own Correctness

Speaker

Host

November 20 2024

Location

Organizer & Contact

Part of

Related Events

May 14

Catalytic Computing: A Primer

May 01

Understanding the Trade-Offs Between Hallucinations and Mode Collapse in Language Generation