From Agentic LLMs to an Agentic Data System for Autonomous Data Science
Renmin University of China
Add to Calendar
2026-02-04 13:00:00
2026-02-04 14:00:00
America/New_York
From Agentic LLMs to an Agentic Data System for Autonomous Data Science
Abstract: Autonomous data science, a longstanding goal of the data community, aims to automate the entire data science pipeline for extracting insights from structured data. Existing approaches to applying large language models (LLMs) to autonomous data science are either domain-specific or rely on predefined pipelines, which limits their autonomy and adaptivity in end-to-end data science tasks. Recent advances in agentic LLMs create new opportunities to support autonomous planning, execution, and iteration over multi-step tasks. However, moving from agentic LLMs to true end-to-end autonomy for data science introduces new challenges and calls for a shift toward an agentic data system. In this talk, I will first present a vision of agentic data systems and highlight key research challenges, including autonomous pipeline orchestration, environment-aware iterative reasoning, and data-intensive execution environments. I will then discuss our recent work that begins to address these challenges and outline future directions.Bio: Ju Fan is a Professor at Renmin University of China. He received his Ph.D. from Tsinghua University and was previously a research fellow at the National University of Singapore. His research interests lie in intelligent data systems (AI4DB), with a current focus on building agentic data science systems that enable end-to-end autonomous data science. He has published over 70 papers in top conferences and journals, including SIGMOD, VLDB, ICDE, and TKDE. He served as Publication Chair for VLDB 2023 and 2024, and has been a Program Committee member for leading database conferences, including SIGMOD, VLDB, ICDE, and KDD. He also led the development of DeepAnalyze, an early end-to-end agentic LLM system for autonomous data science. His work has received the ACM SIGMOD 2024 Research Highlight Award and the ICDE 2025 Best Paper Runner-Up Award, and he is also a recipient of the ACM China Rising Star Award.----Please reach out to markakis@mit.edu for the Zoom password.
TBD