ML Tea: Planning and Problem-Solving with General, Scalable Neuro-Symbolic Models

Speaker: Yongchao Chen

Title: Planning and Problem-Solving with General, Scalable Neuro-Symbolic Models

Abstract: Foundation models excel at large-scale learning but underuse their potential for search—critical for digital agents and physical robots requiring symbolic reasoning and optimization. Pure text-based reasoning in LLMs or waypoints-based TAMP in VLAs remain limited in both reliability and efficiency. To address this, I explore augmenting LLMs with symbolic computation through three approaches: integrating external planners and tools, steering models to generate code as planners, and unifying text, code, and tool-use modes. Using supervised fine-tuning, multi-stage curriculum learning with GRPO, and tool-augmented test-time scaling, our models CodeSteer and R1-Code-Interpreter—downloaded over 300k times on HuggingFace—demonstrate strong performance. Our TUMIX method further boosts Gemini-2.5-pro from 21.6 to 34.1 on the Humanity Last Exam (HLE) benchmark.

Speaker Bio: Yongchao is a fifth-year Ph.D. student in Electrical Engineering at Harvard SEAS and MIT LIDS. His research focuses on Neuro-Symbolic Foundation Models for Reasoning and Planning, advised by Prof. Chuchu Fan and Prof. Nicholas Roy at MIT, and co-advised by Prof. Na Li at Harvard. Yongchao has conducted research at Google Research and DeepMind, Microsoft Research, and the MIT-IBM Watson AI Lab. His work has been featured by MIT News Spotlight.

Add to Calendar 2025-11-24 16:00:00 2025-11-24 17:00:00 America/New_York ML Tea: Planning and Problem-Solving with General, Scalable Neuro-Symbolic Models Speaker: Yongchao ChenTitle: Planning and Problem-Solving with General, Scalable Neuro-Symbolic ModelsAbstract: Foundation models excel at large-scale learning but underuse their potential for search—critical for digital agents and physical robots requiring symbolic reasoning and optimization. Pure text-based reasoning in LLMs or waypoints-based TAMP in VLAs remain limited in both reliability and efficiency. To address this, I explore augmenting LLMs with symbolic computation through three approaches: integrating external planners and tools, steering models to generate code as planners, and unifying text, code, and tool-use modes. Using supervised fine-tuning, multi-stage curriculum learning with GRPO, and tool-augmented test-time scaling, our models CodeSteer and R1-Code-Interpreter—downloaded over 300k times on HuggingFace—demonstrate strong performance. Our TUMIX method further boosts Gemini-2.5-pro from 21.6 to 34.1 on the Humanity Last Exam (HLE) benchmark.Speaker Bio: Yongchao is a fifth-year Ph.D. student in Electrical Engineering at Harvard SEAS and MIT LIDS. His research focuses on Neuro-Symbolic Foundation Models for Reasoning and Planning, advised by Prof. Chuchu Fan and Prof. Nicholas Roy at MIT, and co-advised by Prof. Na Li at Harvard. Yongchao has conducted research at Google Research and DeepMind, Microsoft Research, and the MIT-IBM Watson AI Lab. His work has been featured by MIT News Spotlight. TBD

Part of

ML Tea

ML Tea: Planning and Problem-Solving with General, Scalable Neuro-Symbolic Models

November 24 2025

Location

Part of

November 17

ML Tea: Domain-Aware Scaling Laws Uncover Data Synergy / Ambient Diffusion Omni: Training Good Models with Bad Data

November 12

ML Tea: State, Polynomials, and Parallelism in a Time of Neural Sequence Modeling

ML Tea: Planning and Problem-Solving with General, Scalable Neuro-Symbolic Models

November 24 2025

Location

Part of

Related Events

November 17

ML Tea: Domain-Aware Scaling Laws Uncover Data Synergy / Ambient Diffusion Omni: Training Good Models with Bad Data

November 12

ML Tea: State, Polynomials, and Parallelism in a Time of Neural Sequence Modeling