Thesis Defense | MIT CSAIL

Back to Events

Seminar Series

Thesis Defense

July 25

Thesis Defense: Programmable Architectural Support for Diverse Sparse Workloads - Ryan Lee

Part Of

Thesis Defense

1:00P

- 2:00P

Location

36-428

Add to Calendar 2025-07-25 13:00:00 2025-07-25 14:00:00 America/New_York Thesis Defense: Programmable Architectural Support for Diverse Sparse Workloads - Ryan Lee Defense Title: Programmable Architectural Support for Diverse Sparse WorkloadsSparsity is abundant in many workload domains, but presents challenges that results in under- utilization of the available resources in existing hardware. Sparse workloads exhibit irregular control-flow and long-latency memory accesses, starving the core of useful work, and perform fine-grained accesses leading to inefficient use of the available memory bandwidth.Prior work has proposed several software and hardware mechanisms to accelerate sparse workloads, but there has been a lack of a general technique that is applicable to the diverse set of applications in this domain. In particular, existing solutions have had limited support for workloads that concurrently read and update the underlying sparse data structure, such as dynamic graph applications and databases. Prior proposals have instead limited various dimensions of the applications they target in this space, such as restricting the formats they support (e.g., only hash tables) or constraining the types of concurrent operations (e.g., read- only), thereby limiting their applicability. In addition, prior work has insufficiently addressed the inefficient data transfer between compute and memory, instead opting to put expensive compute elements near memory or only supporting restricted forms of fine-grained accesses.This thesis shows that it is possible to design a general and programmable architecture that supports a wide range of sparse workloads. To this end, this thesis presents two hardware accelerators. First, Terminus adds a small hardware unit near each core that accelerate a wide range of data structures types and concurrent reads and updates to these structures, achieving a gmean of 7.4× speedup over a CPU baseline. Second, Gist enhances each DRAM chip with a flexible hardware unit that autonomously performs fine-grained scatter/gather operations for sparse workloads. This allows Gist to more efficiently use the memory bus by returning a compact stream of data, and achieves a gmean of 1.6× speedup over state-of-the-art support for sparse workloads.https://mit.zoom.us/j/8203717891Advisor: Professor Daniel Sanchez TBD

July 18

Thesis Defense: Optimizing Data Layouts for Evolving Cloud Table Storage

Siva Sudhir

MIT CSAIL

Part Of

Thesis Defense

2:00P

- 3:00P

Location

Add to Calendar 2025-07-18 14:00:00 2025-07-18 15:00:00 America/New_York Thesis Defense: Optimizing Data Layouts for Evolving Cloud Table Storage Modern data analytics platforms increasingly adopt disaggregated architectures, storing data in cost-effective cloud object stores. While this approach enables a clean separation of concerns, allowing each layer to be independently managed and scaled, it introduces significant performance bottlenecks due to expensive data movement. Effective data layouts, which organize data to minimize unnecessary data reads, are thus critical to achieving high query performance. However, existing techniques typically rely on manually specified layouts, collect limited metadata, or lack mechanisms to dynamically adapt to changing data and workloads.This thesis investigates adaptive, metadata-rich, expressive data layouts for cloud table storage. First, we introduce Pando, a correlation-aware layout technique that leverages rich metadata on query predicates to significantly improve data skipping. Next, we propose CopyRight, a partial replication strategy that selectively replicates subsets of data and optimizes each replica differently, efficiently serving heterogeneous query patterns. Finally, we describe Self-Organizing Data Containers (SDCs), a practical table storage layer for the cloud that incrementally reorganizes complex data layouts based on changes in data and workload distributions.-- Please email siva@csail.mit.edu or markakis@mit.edu for the Zoom password.  TBD

July 15

THESIS DEFENSE: Seeing Beyond Limits with Physics-Informed Priors

Part Of

Thesis Defense

2:00P

- 3:00P

Location

Add to Calendar 2025-07-15 14:00:00 2025-07-15 15:00:00 America/New_York THESIS DEFENSE: Seeing Beyond Limits with Physics-Informed Priors THESIS DEFENSE: Seeing Beyond Limits with Physics-Informed PriorsSpeaker: Yang LiuSpeaker Affiliation: MIT EECS & CSAILHost: Frédo DurandHost Affiliation: MIT EECS & CSAILDate: Tuesday, July 15, 2025Time: 2:00 PM to 3:00 PMLocation: 32-D463 (Star) or Zoom Link: https://mit.zoom.us/j/98534109114Abstract:Conventional imaging systems face inherent dimensionality and visibility limits, primarily because image sensors are typically two-dimensional, and light tends to diffuse on rough surfaces or scatter within complex media. In this talk, I will reframe imaging systems through the lens of optical encoding and neural decoding, presenting my key contributions aimed at transcending the traditional limits of dimensionality and visibility. The idea is modelling the forward physical process and iteratively optimizing it with deep denoisers as visual priors, where eventually the priors are physics-informed. First, I introduce Privacy Dual Imaging, which reveals the privacy risk that ambient light sensors embedded in most smart devices could capture images of the scene in front of the screen. This idea of seeing the invisible from subtle intensity fluctuations is inspired by George Orwell’s novel 1984, wherein Big Brother is watching you through a two-way telescreen, and it closely relates to incoherent lensless imaging and non-line-of-sight imaging. Second, I present Snapshot Compressive Imaging, which encodes multiple temporal, spectral, or angular frames into a single measurement captured by a standard two-dimensional sensor. By learning high-dimensional visual priors from image or video data, we can efficiently reconstruct the original higher-dimensional data cube at scale. Lastly, I show that large AI models, particularly diffusion models, can serve as generic visual priors for both cases and beyond. I aim to push the boundaries of imaging and sensing within relevant domains of AI for science and healthcare (with an example).Committee Members: Frédo Durand (advisor, MIT), William T. Freeman (MIT & Google), Kaiming He (MIT & Google)Relevant URL: https://mit.zoom.us/j/98534109114For more information please contact: Roger White <whiter@mit.edu>  TBD

June 02

[Thesis Defense] Personalizing Robot Assistance under Uncertainty about the Human

Shen Li

MIT

Part Of

Thesis Defense

9:00A

- 11:00A

Location

TBD

32-155

Add to Calendar 2025-06-02 9:00:00 2025-06-02 11:00:00 America/New_York [Thesis Defense] Personalizing Robot Assistance under Uncertainty about the Human Date: June 2Time: 9:00-11:00 AM ETLocation: 32-155Zoom: https://mit.zoom.us/j/9731989629Title: Personalizing Robot Assistance under Uncertainty about the HumanAbstractRobots have the potential to improve the quality of life by assisting with daily tasks, such as helping older adults and people with disabilities get dressed. But meaningful assistance requires personalization: each person has unique preferences, behaviors, and needs.A central challenge is that robots often operate under uncertainty about the human they are helping. This uncertainty may involve the person's preferences, hidden physical states, or reactions to assistance. If not properly addressed, such uncertainty can lead to ineffective, undesired, or even unsafe outcomes.This thesis asks: How should a robot behave when it is uncertain about the human? I present a unified framework for uncertainty-aware personalization in human-robot interaction, spanning three core components of robot intelligence: preference learning, state estimation, and motion planning.1. Preference learning: I introduce the first method that uses response time, a subtle but informative cognitive signal, as implicit feedback. By combining human choices with response times, robots can infer not only what a person prefers but also how strongly they prefer it. This reduces uncertainty and accelerates preference learning.2. State estimation: To support safe physical assistance when parts of the human body (e.g., the elbow) are occluded, I introduce a state estimator that models uncertainty in learned human dynamics and robot sensing. It constructs a geometric set (e.g., a 3D box) that reliably contains the true hidden human state, enabling safer and more precise robot behavior.3. Motion planning: When a robot is uncertain about future human motion, it may behave overly conservatively to avoid causing harm, resulting in ineffective assistance. To address this, I propose a relaxed safety formulation that allows the robot to either avoid collisions or make low-impact contact. This approach enables the robot to act more effectively while still maintaining safety under uncertainty.Together, these contributions lay a foundation for assistive robots that personalize their behavior while adapting to the uncertain and dynamic nature of human needs.Thesis Supervisor: Julie A. ShahCommittee Members: Julie A. Shah, Dylan Hadfield-Menell, Na (Lina) Li, Aude BillardThesis Readers: Vaibhav Unhelkar, Tariq IqbalContact: shenli@mit.edu TBD

May 29

[Thesis Defense] Generalizable Robot Manipulation through Unified Perception, Policy Learning, and Planning

Xiaolin Fang

MIT CSAIL

Part Of

Thesis Defense

10:00A

- 12:00P

Location

45-792

Add to Calendar 2025-05-29 10:00:00 2025-05-29 12:00:00 America/New_York [Thesis Defense] Generalizable Robot Manipulation through Unified Perception, Policy Learning, and Planning Abstract:Advancing robotic manipulation to achieve generalization across diverse goals, environments, and embodiments is a critical challenge in robotics research. While the availability of data and large-scale training has brought exciting progress in robotics manipulation, current methods often struggle with generalizing to unseen, unstructured environments and solving long-horizon tasks. In this thesis, I will present my contributions that bridge structured decision-making frameworks with learned perceptual and policy components to enable multi-step manipulation in partially observable environments. Specifically, I will talk about my work in 1) constructing a modular framework that estimates affordances using learned perceptual models with task and motion planning (TAMP) for object rearrangement in unstructured scenes, 2) learning generative diffusion models of robot skills, which can be composed to solve unseen combination of environmental constraints through infeference-time optimization, 3) leveraging large vision-language models (VLMs) in building task-oriented visual abstractions, allowing skills to generalize across different environments with only 5 to 10 demonstrations. Together, these approaches contribute to the generality and scalability of embodied agents towards solving real-world manipulation in unstructured environments.Thesis Committee: Leslie Kaelbling, Tomás Lozano-Pérez, Russ Tedrake TBD

May 12

Thesis Defense: Scaling Cooperative Intelligence via Inverse Planning and Probabilistic Programming

Tan Zhi-Xuan

MIT

Part Of

Thesis Defense

11:00A

- 12:30P

Location

46-3189

Building 46, Room 3189 (43 Vassar St)

Add to Calendar 2025-05-12 11:00:00 2025-05-12 12:30:00 America/New_York Thesis Defense: Scaling Cooperative Intelligence via Inverse Planning and Probabilistic Programming Thesis Defense: Scaling Cooperative Intelligence via Inverse Planning and Probabilistic ProgrammingPresenter: Tan Zhi-XuanPlease email xuan@mit.edu for Zoom linkHow can we build cooperative machines that model and understand human minds — machines that assist us with our goals, coordinate on shared plans, infer the intentions behind our words, and even learn our norms and values? In this talk, I will introduce a scalable Bayesian approach to building such systems via inverse planning and probabilistic programming. By combining online model-based planners and sequential Monte Carlo inference into a single architecture, Sequential Inverse Plan Search (SIPS), we can infer human goals from actions in faster-than-real-time, while scaling to environments with hundreds of possible goals and long planning horizons that have proved intractable for earlier methods. SIPS can additionally make use of large language models (LLMs) as likelihood functions within probabilistic programs, allowing us to build AI assistants and copilots that reliably infer human goals from ambiguous instructions, then provide assistance under uncertainty with much higher success rates than LLMs can on their own. By applying this Bayesian approach in many-agent environments, we are also able to design agents that rapidly learn cooperative social norms from others' behavior, achieving mutually beneficial outcomes with orders of magnitude less data than model-free deep RL. I will conclude by charting out how this research program could deliver a new generation of cooperative AI systems grounded in rational AI engineering, while illuminating the computational foundations of human cooperation and addressing fundamental challenges in building human-aligned AI.Thesis Committee: Vikash Mansinghka, Joshua Tenenbaum, Dylan Hadfield-Menell, Leslie Kaelbling  TBD

May 08

Mark Hamilton's Thesis Defense - Unsupervised Discovery of Structure in Complex Systems

Mark Hamilton

MIT and Microsoft

Part Of

Thesis Defense

3:00P

- 4:00P

Location

Stata Center - Hewlett Room (Gates Side)

Add to Calendar 2025-05-08 15:00:00 2025-05-08 16:00:00 America/New_York Mark Hamilton's Thesis Defense - Unsupervised Discovery of Structure in Complex Systems How does the human mind make sense of raw information without being taught how to see or hear? This thesis will explore how to build algorithms that can uncover interpretable structure from large collections of data like images and video without needing human annotations or labels. First, we will see how to build algorithms that can perform tasks like classifying every pixel of the world, localizing sound, and decoding natural language, just by watching unlabeled videos without any knowledge of text. Second, we will see how these ideas lead us to a new unifying theory of representation learning. In particular, I will show how 20+ common machine learning methods, such as dimensionality reduction, clustering, contrastive learning, and spectral methods emerge from a single unified equation. Finally, we will discuss how this unifying theory applies to our ongoing efforts to decode animal communication using large-scale, unsupervised, and interpretable learners. We will conclude with some preliminary analysis of the complex vocalizations of Atlantic Spotted Dolphins. TBD

Automatic Integration and Differentiation of Probabilistic Programs

Alexander Lew

MIT CSAIL

Part Of

Thesis Defense

2:15P

- 3:30P

Location

46-3310

Building 46 (43 Vassar St.), Room 3310

Add to Calendar 2025-05-08 14:15:00 2025-05-08 15:30:00 America/New_York Automatic Integration and Differentiation of Probabilistic Programs Automatic Integration and Differentiation of Probabilistic ProgramsPresenter: Alexander LewThesis Supervisors: Vikash K. Mansinghka and Joshua B. TenenbaumDate: May 8, 2025Time: 2:15pm ETLocation: Building 46, room 3310Please contact alexlew@mit.edu for a Zoom link.Abstract:By automating the error-prone math behind deep learning, systems such as TensorFlow and PyTorch have supercharged machine learning research, empowering hundreds of thousands of practitioners to rapidly explore the design space of neural network architectures and training algorithms. In this talk, I will show how new programming language techniques, especially generalizations of automatic differentiation, make it possible to generalize and extend such systems to support probabilistic models. Our automation is implemented as a suite of composable program transformations for integrating, differentiating, and deriving densities of probabilistic programs. These transformations are rigorously proven sound using new semantic techniques for reasoning about expressive probabilistic programs, and static types are employed to ensure important preconditions for soundness, eliminating large classes of implementation bugs. Providing a further boost, our tools can help users correctly implement fast, low-variance, unbiased estimators of integrals, gradients, and probability densities that are too expensive to compute exactly, enabling orders-of-magnitude speedups in downstream optimization and inference algorithms.To illustrate the value of these techniques, I’ll show how they have helped us experiment with new architectures that could address key challenges with today’s dominant AI models. In particular, I’ll showcase systems we’ve built for (1) auditable reasoning and learning in relational domains, enabling the detection of thousands of errors across millions of Medicare records, and (2) probabilistic inference over large language models, enabling small open models to outperform GPT-4 on several code generation and constrained generation benchmarks. TBD

May 07

Thesis Defense, Noah Golowich - Title: Theoretical Foundations for Learning in Games and Decision-Making

Part Of

Thesis Defense

2:15P

- 3:15P

Location

Add to Calendar 2025-05-07 14:15:00 2025-05-07 15:15:00 America/New_York Thesis Defense, Noah Golowich - Title: Theoretical Foundations for Learning in Games and Decision-Making Abstract: As learning algorithms become increasingly capable of acting autonomously, it is important to better understand the behavior that results from their interactions (1) amongst themselves and (2) with their environments. This talk will present work addressing each of these aspects:(1) A pervasive challenge in multi-agent learning settings, which spans both theory and practice and dates back decades, has been the failure of convergence for iterative algorithms such as gradient descent. Accordingly, a longstanding central question with broad relevance is: how quickly can we compute solution concepts, i.e., equilibria, in multi-agent settings? I will discuss results which address this question at several scales, starting with simpler normal-form games and building up to larger games such as extensive-form games.(2) To understand how agents can optimally act in dynamic environments, the framework of reinforcement learning (RL) is used. A notorious challenge in RL is partial observability of the environment, which is typically modeled using Partially Observable Markov Decision Processes (POMDPs). Many existing provable guarantees for POMDPs relied on computationally intractable oracles. I will present the first guarantees for end-to-end learning of a near-optimal policy under a simple condition on the environment known as observability.  TBD

May 02

[Thesis Defense - Ce Jin] Exploiting Additive Structure in Algorithm Design and Fine-Grained Complexity

Part Of

Thesis Defense

3:30P

- 5:30P

Location

Add to Calendar 2025-05-02 15:30:00 2025-05-02 17:30:00 America/New_York [Thesis Defense - Ce Jin] Exploiting Additive Structure in Algorithm Design and Fine-Grained Complexity Abstract:In this thesis, we investigate the fine-grained complexity of various algorithmic problems with an additive flavor, including 3SUM, Subset Sum, and their close relatives. We explore their connections to various areas, such as graph algorithms, discrete optimization, combinatorial pattern matching, and computational geometry. Our new results include improved algorithms and conditional lower bounds for a wide range of problems, answering multiple open questions from the literature:-Conditional lower bounds for graph problems:  We prove new lower bounds for 4-Cycle Listing and Approximate Distance Oracles conditioned on the 3SUM Hypothesis. As a key intermediate step, we show a fine-grained reduction from 3SUM to the special case of 3SUM where all pairwise sums of input numbers are distinct.-Combinatorial pattern matching:  We design improved algorithms for Text-to-Pattern Hamming Distances, Pattern Matching with Wildcards, and Geometric Pattern Matching, by drawing connections from 3SUM and sparse convolution.-Knapsack-type problems:  We obtain a pseudo-polynomial time algorithm for 0-1 Knapsack with (conditionally) near-optimal dependence on the maximum item weight, an improved approximation scheme for the counting problem #Knapsack, and improved exponential time algorithms for the total search problem Pigeonhole Equal Subset Sum.In order to obtain these results, we employ and develop techniques based on convolution algorithms and their extensions, as well as classic tools from additive combinatorics. Thesis Committee: Ryan Williams (advisor), Virginia Vassilevska Williams (advisor), and Mohsen Ghaffari TBD

(Thesis Defense) Building Intelligence that can Interact with the Physical World

Part Of

Thesis Defense

9:30A

- 10:30A

Location

Add to Calendar 2025-05-02 9:30:00 2025-05-02 10:30:00 America/New_York (Thesis Defense) Building Intelligence that can Interact with the Physical World Speaker: Johnson Tsun-Hsuan WangAffiliation: MIT EECS (CSAIL)Title: [Thesis Defense] Building Intelligence that can Interact with the Physical WorldDate: Friday, May 2nd 2025Time: 9:30 am EDTLocation: 32-G449 (Patil/Kiva)Zoom: https://mit.zoom.us/j/95448197150?pwd=W0uEtKXgUjoXawXp2cGWrIcsFGtGlO.1Abstract: Recent advances in Artificial Intelligence (AI) have demonstrated remarkable success in parsing, reasoning, and generating digital content across modalities such as natural language, speech, images, videos, and 3D data. However, these breakthroughs have yet to extend meaningfully beyond the digital realm into the physical world. Developing AI for physical interaction poses challenges such as limited grounding, scarce physical data, and high reliability demands in safety-critical settings.This talk outlines a holistic approach to physical AI—through the lenses of data, brain, and body. We begin with data, the foundation of learning, and introduce data-driven and knowledge-driven robot simulation that generates data to improve policy learning and to systematically evaluate and probe existing models. Next, we turn to the brain, focusing on how to bridge the internet-scale knowledge of digital AI with the physical world to improve generalization and interpretability. Finally, we examine the body—the morphological component of intelligence—demonstrating how pre-trained generative models, when integrated with physics-based simulation, can automate the design of robot bodies. Together, this talk explores how digital AI can be extended into the physical world through a comprehensive investigation of data, brain, and body – laying the groundwork for building physical AI.Committee:Prof. Daniela Rus, MIT CSAIL (Advisor)Prof. Sertac Karaman, MIT LIDSProf. Wojciech Matusik, MIT CSAIL TBD

February 25

[Thesis Defense] Steering Robots with Inference-Time Interactions

Felix Yanwei Wang

EECS/CSAIL

Part Of

Thesis Defense

12:00P

- 1:30P

Location

45-792

(the big glass room in the middle on the 7th floor)

Add to Calendar 2025-02-25 12:00:00 2025-02-25 13:30:00 America/New_York [Thesis Defense] Steering Robots with Inference-Time Interactions Date: Tuesday, February 25, 2025Time: 12:00 PM - 1:30 PMLocation: 45-792Zoom: https://mit.zoom.us/j/95052951960Abstract:Imitation learning has driven the development of generalist policies capable of autonomously solving multiple tasks. However, when a pretrained policy makes errors during deployment, there are limited mechanisms for users to steer its behavior. While collecting additional data for fine-tuning can address such issues, doing so for each downstream use case is inefficient at scale. My research proposes an alternative perspective: framing policy errors as task misspecifications rather than skill deficiencies. By enabling users to specify tasks unambiguously via interactions at inference-time, the appropriate skill for a given context can be retrieved without fine-tuning. Specifically, I propose (1) inference-time steering, which leverages human interactions for single-step task specification, and (2) task and motion imitation, which uses symbolic plans for multi-step task specification. These frameworks correct misaligned policy predictions without requiring additional training, maximizing the utility of pretrained models while achieving inference-time user objectives.Thesis Supervisor: Julie ShahCommittee Members: Leslie Kaelbling, Jacob Andreas, Dorsa SadighContact: felixw@mit.edu TBD

February 05

Thesis Defense: Designing Hardware Accelerators for Solving Sparse Linear Systems - Axel Feldmann

Part Of

Thesis Defense

3:00P

- 4:30P

Location

Add to Calendar 2025-02-05 15:00:00 2025-02-05 16:30:00 America/New_York Thesis Defense: Designing Hardware Accelerators for Solving Sparse Linear Systems - Axel Feldmann Solving systems of linear equations with sparse coefficient matrices is a key primitive that sits at the heart of many important numeric algorithms. Because of this primitive's importance, algorithm designers have spent many decades optimizing linear solvers for high performance hardware. However, despite their efforts, existing hardware has let them down. State-of-the-art linear solvers often utilize <1% of available compute throughput on existing architectures such as CPUs and GPUs.There are many different algorithms used to solve sparse linear systems. These algorithms are diverse and often have very different computational bottlenecks. These include low arithmetic intensity, fine-grained parallellism, common control dependences, and sparsity-induced load imbalance.This thesis studies the problem of designing hardware accelerators for sparse linear solvers. We propose three novel architectures that explore different parts of the design space. First, we introduce Spatula, an architecture designed to accelerate direct solvers. Then, we propose Azul, a hardware accelerator targeted at iterative solvers. Taken together, Spatula and Azul demonstrate significant speedups on both of the main classes of sparse linear solver algorithms. Finally, to show that our techniques are useful for end-to-end applications, we present Ōmeteōtl, an accelerator targeted at applications that use iterative solvers in their inner loop. Ōmeteōtl also shows that the techniques in this thesis generalize to sparse matrix computations beyond linear solvers.https://mit.zoom.us/j/98122373906 (no password) TBD

January 22

Thesis Defense: Taming Data Movement Overheads in Latency-Critical Cloud Services

Nikita Lazarev

CSAIL

Part Of

Thesis Defense

3:00P

- 4:30P

32-D463 (STAR)

Add to Calendar 2018-08-16 10:00:00 2018-08-16 11:00:00 America/New_York Thesis Defense: Below P vs NP: Fine-Grained Hardness for Big Data Problems Abstract: The theory of NP-hardness has been remarkably successful in identifying problems that are unlikely to be solvable in polynomial time. However, many other important problems do have polynomial-time algorithms, but large exponents in their runtime bounds can make them inefficient in practice. For example, quadratic-time algorithms, although practical on moderately sized inputs, can become inefficient on big data problems that involve gigabytes or more of data. Although for many data analysis problems no sub-quadratic time algorithms are known, any evidence of quadratic-time hardness has remained elusive. In this thesis we present hardness for several text analysis and machine learning tasks:* Lower bounds for edit distance, regular expression matching and other pattern matching and string processing problems.* Lower bounds for empirical risk minimization such as kernel support vectors machines and other kernel machine learning problems.All of these problems have polynomial time algorithms, but despite extensive amount of research, no near-linear time algorithms have been found. We show that, under a natural complexity-theoretic conjecture, such algorithms do not exist. We also show how these lower bounds have inspired the development of efficient algorithms for some variants of these problems. 32-D463 (STAR)