Algorithms and Complexity (A&C) 2024 - 2025

Back to Events

Seminar Series

May 14

Catalytic Computing: A Primer

Ian Mertz

Charles University

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

3:00P

- 4:00P

Location

New location!

Add to Calendar 2025-05-14 15:00:00 2025-05-14 16:00:00 America/New_York Catalytic Computing: A Primer Can memory be useful even when it's already full? In the catalytic computing model (Buhrman et al. 2014), we consider a space bounded Turing machine with additional access to a much larger hard drive, with the caveat that the initial contents of this extra space must be restored after any computation. Despite this restriction, catalytic computation gains surprising power over ordinary space-bounded computation, even above and beyond resources such as randomness and non-determinism.In this talk we will survey the field of catalytic computation. We will cover the base catalytic model and where it fits into traditional complexity theory; variants of the model, such as lossy, randomized, non-deterministic, and non-uniform catalytic computation; known techniques, such as register programs and compress-or-random approaches; applications of catalytic ideas to other settings; and potential directions for the future of the field. TBD

May 01

Understanding the Trade-Offs Between Hallucinations and Mode Collapse in Language Generation

Grigoris Velegkas

Yale University

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-05-01 16:00:00 2025-05-01 17:00:00 America/New_York Understanding the Trade-Offs Between Hallucinations and Mode Collapse in Language Generation Specifying all desirable properties of a language model is challenging, but certain requirements seem essential. Given samples from an unknown language, the trained model should produce valid strings not seen in the training set, and be expressive enough to capture the language's full breadth. Otherwise, outputting invalid strings constitutes "hallucination," and failing to capture the full breadth leads to "mode collapse." Recent work by Kleinberg and Mullainathan [KM24], building on classical work on the closely related problem of language identification by Gold [Gol67] and Angluin [Ang79, 80], provides a concrete mathematical framework to study the problem of language generation. Kleinberg and Mullainathan showed that for all countable collections of languages, it is possible to create a language model that does not hallucinate but suffers from mode collapse. They asked whether this tension between validity and breadth is inherent for language generation.In this talk, we define various notions of breadth for language generation, and completely characterize when generation with validity and breadth is possible under each of these notions. Our results answer the question of [KM24] and show that this tension between validity and breadth is indeed inherent for language generation. Moreover, we formalize the notion of stable generation, a natural requirement derived from Gold’s work [Gol67], and discuss when this type of generation is achievable. Finally, we discuss the implications of our results in the universal rates setting of Bousquet, Hanneke, Moran, van Handel, and Yehudayoff [BGMvY21]. The talk is based on joint works with Alkis Kalavasis and Anay Mehrotra. TBD

April 30

How to Appease a Voter Majority

Prasanna Ramakrishnan

Stanford University

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-04-30 16:00:00 2025-04-30 17:00:00 America/New_York How to Appease a Voter Majority In 1785, Condorcet established a frustrating property of elections and majority rule: it is possible that, no matter which candidate you pick as the winner, a majority of voters will prefer someone else. You might have the brilliant idea of picking a small set of winners instead of just one, but how do you avoid the nightmare scenario where a majority of the voters prefer some other candidate over all the ones you picked? How many candidates suffice to appease a majority of the voters? In this talk, we will explore this question. Along the way, we will roll some dice — both because the analysis involves randomness and because of a connection to the curious phenomenon of intransitive dice, that has delighted recreational and professional mathematicians alike ever since Martin Gardner popularized it in 1970.Based on joint work with Moses Charikar, Alexandra Lassota, Adrian Vetta, and Kangning Wang. TBD

April 23

Revisiting the Predictability of Social Events

Juan C Perdomo

Harvard Center for Research on Computation and Society

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-04-23 16:00:00 2025-04-23 17:00:00 America/New_York Revisiting the Predictability of Social Events Social predictions do not passively describe the future; they actively shape it. They inform actions and change individual expectations in ways that influence the likelihood of the predicted outcome. Given these dynamics, to what extent can social events be predicted? If predictions shape the data we see, what does it even mean to make a good “prediction”?In this talk, I'll provide an overview of the history behind these classical questions and share new insights derived from revisiting them using modern tools from performative prediction [PZMH20’] and (online) multicalibration / outcome indistinguishability [HKRR18’, DKRRY21’]. TBD

April 16

DDPM Score Matching and Distribution Learning

Alkis Kalavasis

Yale University

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-04-16 16:00:00 2025-04-16 17:00:00 America/New_York DDPM Score Matching and Distribution Learning Score estimation is the backbone of score-based generative models (SGMs), especially denoising diffusion probabilistic models (DDPMs). A key result in this area shows that with accurate score estimates, SGMs can efficiently generate samples from any realistic data distribution (Chen et al., ICLR '23; Lee et al., ALT '23). However, this distribution learning result, where the learned distribution is implicitly that of the sampler's output, does not explain how score estimation relates to classical tasks of parameter and density estimation. In this talk, we will discuss a framework that reduces score estimation to these two tasks, with various implications for statistical and computational learning theory.This is based on joint work with Sinho Chewi, Anay Mehrotra, and Omar Montasser. TBD

April 09

Learning from Missing and Imperfect Data

Anay Mehrotra

Yale University

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-04-09 16:00:00 2025-04-09 17:00:00 America/New_York Learning from Missing and Imperfect Data Positive-Unlabeled Learning (PU Learning) is a framework for learning when only positive and unlabeled data are available, which is a common scenario in Bioinformatics, Medicine, and Fraud Detection, where obtaining negative samples is challenging or costly.In this talk, we present an extension of the PU Learning paradigm: Positive and Imperfect Unlabeled Learning (PIU Learning). PIU Learning accounts for the low-quality of unlabeled data that can arise due to biases, covariate shifts, and adversarial corruptions – which are frequently encountered when leveraging public and crowdsourced datasets.Beyond its practical relevance, this change in the formulation of PU learning leads to some new theoretical implications. We show how it connects to fundamental problems, such as learning from smoothed distributions, detecting data truncation, and estimation under truncation, each central to Statistics and Learning Theory.This talk is based on joint work with Jane H. Lee and Manolis Zampetakis. TBD

March 19

Constructive Criticisms of Complexity: Unifying Proofs and Algorithms

Stefan Grosser

McGill

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-03-19 16:00:00 2025-03-19 17:00:00 America/New_York Constructive Criticisms of Complexity: Unifying Proofs and Algorithms For decades, fundamental questions in complexity theory have remained wide open. A classic counting argument by Shannon shows that most Boolean functions on n bits require circuits of size 2^n/n, yet we lack even superlinear circuit lower bounds for explicit functions. This raises a natural question: can we make these counting arguments constructive?In this talk, we explore constructivity through the lens of mathematical logic. Weak fragments of Peano Arithmetic, known as theories of Bounded Arithmetic, characterize "efficient" reasoning and exhibit a constructive property—proofs of existence correspond to efficient search algorithms. In particular, Buss's seminal work introduced the theories S^i_2, which capture reasoning at the i-th level of the polynomial hierarchy. We focus on S^1_2, a theory powerful enough to formalize much of modern complexity theory, from the Cook-Levin theorem to the PCP theorem.Our main results establish that:(1) Proving known, non-constructive lower bounds within S^1_2 would yield breakthrough lower bounds.(2) Under a reasonable conjecture in logic, certain circuit lower bounds are unprovable in S^1_2.These findings build on and unify previous work on constructive complexity, which traditionally employs the algorithmic notion of efficient refuters rather than bounded arithmetic. Additionally, our work provides the first conditional separation between S^1_2 and APC, a theory corresponding to ZPP reasoning.This talk is based on joint work with Marco Carmosino. TBD

March 12

Extractors for Samplable Distributions with Low Min-Entropy

Jad Silbak

Northeastern University

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-03-12 16:00:00 2025-03-12 17:00:00 America/New_York Extractors for Samplable Distributions with Low Min-Entropy Trevisan and Vadhan (FOCS 2000) introduced the notion of (seedless) extractors for samplable distributions. They showed that under a strong complexity theoretic hardness assumption, there are extractors for samplable distributions with large min-entropy of k = (1 − γ) · n, for some small constant γ>0. Recent work by Ball, Goldin, Dachman-Soled and Mutreja (FOCS 2023) weakened the hardness assumption. However, since the original paper by Trevisan and Vadhan, there has been no improvement in the min-entropy threshold k.In this paper we give a construction of extractors for samplable distributions with low min-entropy of k = n^{1−γ} for some constant γ, and in particular we achieve k < n/2 (which is a barrier for the construction of Trevisan and Vadhan). Our extractors are constructed under a hardness assumption that is weaker than the one used by Trevisan and Vadhan, and stronger than that used by Ball, Goldin, Dachman-Soled and Mutreja. Specifically, that there exists a constant β>0, and a problem in E = DTIME(2^O(n)) that cannot be computed by size 2^{βn}  circuits that have an oracle to Σ_5.Our approach builds on the technique of Trevisan and Vadhan, while introducing new objects and ideas. We introduce and construct two objects: an errorless (seedless) condenser for samplable distributions, and functions that are hard to compute on every samplable distribution with sufficient min-entropy. We use techniques by Shaltiel and Silbak (STOC 2024), as well as additional tools and ideas, to construct the two new objects, under the hardness assumption. We then show how to modify the construction of Trevisan and Vadhan, using these new objects, so that the barrier of k=n/2 can be bypassed, and we can achieve an extractor for samplable distributions with low min-entropy.This is a joint work with Marshall Ball and Ronen Shaltiel. TBD

March 05

Correlation Clustering and (De)Sparsification: Graph Sketches Can Match Classical Algorithms

Louie Putterman

Harvard

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-03-05 16:00:00 2025-03-05 17:00:00 America/New_York Correlation Clustering and (De)Sparsification: Graph Sketches Can Match Classical Algorithms Correlation clustering is a widely-used approach for clustering large data sets based only on pairwise similarity information. In recent years, there has been a steady stream of better and better classical algorithms for approximating this problem. Meanwhile, another line of research has focused on porting the classical advances to various sublinear algorithm models, including semi-streaming, Massively Parallel Computation (MPC), and distributed computing. Yet, these latter works typically rely on ad-hoc approaches that do not necessarily keep up with advances in improved approximation ratios achieved by classical algorithms. This raises the following natural question: can the gains made by classical algorithms for correlation clustering be ported over to sublinear algorithms in a black-box manner? We answer this question in the affirmative by introducing the paradigm of graph de-sparsification that may be of independent interest.Joint work with Sepehr Assadi and Sanjeev Khanna. TBD

February 20

Locally Sampleable Uniform Symmetric Distributions

Kewen Wu

UC Berkeley

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-02-20 16:00:00 2025-02-20 17:00:00 America/New_York Locally Sampleable Uniform Symmetric Distributions We characterize the power of constant-depth Boolean circuits in generating uniform symmetric distributions. Let f:{0,1}^m -> {0,1}^n be a Boolean function where each output bit of f depends only on O(1) input bits. Assume the output distribution of f on uniform input bits is close to a uniform distribution D with a symmetric support. We show that D is essentially one of the following six possibilities: (1) point distribution on 0^n, (2) point distribution on 1^n, (3) uniform over {0^n,1^n}, (4) uniform over strings with even Hamming weights, (5) uniform over strings with odd Hamming weights, and (6) uniform over all strings. This confirms a conjecture of Filmus, Leigh, Riazanov, and Sokolov (RANDOM 2023). Joint work with Daniel Kane and Anthony Ostuni. TBD

February 19

Parallelizing Sequential Iterative Algorithms

Yan Gu

UC Riverside

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-02-19 16:00:00 2025-02-19 17:00:00 America/New_York Parallelizing Sequential Iterative Algorithms This talk will delve into our decade-long research in parallelizing sequential iterative algorithms, such as greedy algorithms and dynamic programming that are covered in undergrad algorithm courses. The core concept here is to identify the inherent computational structures and develop general frameworks for their parallel execution. I will overview the key concepts and techniques proposed throughout this research process, including the dependence graphs, asynchronous execution, phase parallelism, and the cordon algorithm. Illustrative examples will include random permutation, maximal independent set (MIS), and dynamic programming applications. This talk will cover the results in several papers, including a JACM'20 paper, Outstanding Papers at SPAA'20 and SPAA'24, and a best paper at ESA'23. TBD

February 12

A&C Seminar - Theory and applications of complete Min-CSPs: A case study in Correlation Clustering

Euiwoong Lee

University of Michigan

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

Add to Calendar 2025-02-12 16:00:00 2025-02-12 17:00:00 America/New_York A&C Seminar - Theory and applications of complete Min-CSPs: A case study in Correlation Clustering Abstract: We will discuss recent algorithmic results on fundamental problems in data science and clustering, including Correlation Clustering, Low-Rank Approximation, and Metric Distance Violation. A unifying theme will be their connections to Minimum Constraint Satisfaction Problems (CSPs) in complete instances. Starting from the rich theory of dense Max CSPs with several algorithmic tools (e.g., convex hierarchy, random sampling, regularity lemma), we show how this theory can be augmented to handle minimization objectives in applied domains. These efforts also inspired a systematic study on Min-CSPs in complete instances.As a technical example, we will highlight our results for Correlation Clustering, one of the most well-studied graph clustering problems. Bypassing the previous barrier of a 2-approximation based on the standard LP, we obtain a 1.44-approximation, first using a Sherali-Adams hierarchy, later also matched by a random sampling technique. TBD

December 11

New Breakthrough in Matrix Multiplication

Renfei Zhou

CMU

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

G575

Add to Calendar 2024-12-11 16:00:00 2024-12-11 17:00:00 America/New_York New Breakthrough in Matrix Multiplication Abstract:Fast matrix multiplication is one of the most fundamental problems in computer science. We present new algorithms that improve the time complexity of matrix multiplication to n^2.371339, surpassing the previous bound of n^2.372860. Our result is the largest improvement to the matrix multiplication exponent since 2010. In this talk, we will introduce the modern framework for matrix multiplication algorithms and highlight the key ideas in our algorithms. G575

December 04

From Distinguishers To Predictors and Beyond

Ted Pyne

MIT

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

G575

Add to Calendar 2024-12-04 16:00:00 2024-12-04 17:00:00 America/New_York From Distinguishers To Predictors and Beyond Abstract: A central tool for constructing pseudorandom generators has been the “reconstruction paradigm” — proofs that if a generator fails to fool a circuit C, we can compute a supposedly-hard function f more efficiently with the help of C. Going from C to a small circuit for f crucially uses Yao's transformation of distinguishers to next-bit predictors. In fact, this transformation is the “bottleneck” in many results in pseudorandomness.A recent line of work has investigated the complexity of this transformation — how hard is it to turn distinguishers into predictors? Can we do it more efficiently? And what can we get out of it? I'll describe recent work that partially answers these questions, and obtains new win-win results in space complexity.Based on joint works with Dean Doron, Jiatu Li, Roei Tell, and Ryan Williams. G575

November 20

Models That Prove Their Own Correctness

Orr Paradise

UC Berkeley

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

G575

Add to Calendar 2024-11-20 16:00:00 2024-11-20 17:00:00 America/New_York Models That Prove Their Own Correctness Abstract: This talk introduces Self-Proving models, a new class of models that formally prove the correctness of their outputs via an Interactive Proof system. After reviewing some related literature, I will formally define Self-Proving models and their per-input (worst-case) guarantees. I will then present algorithms for learning these models and explain how the complexity of the proof system affects the complexity of the learning algorithms. Finally, I will show experiments where Self-Proving models are trained to compute the Greatest Common Divisor of two integers, and to prove the correctness of their results to a simple verifier.No prior knowledge of autoregressive models or Interactive Proofs will be assumed of the listener. This is a joint work with Noga Amit, Shafi Goldwasser, and Guy Rothblum. G575

November 13

Gaussian Polytope Approximation

Shivam Nadimpalli

MIT

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

D507

Add to Calendar 2024-11-13 16:00:00 2024-11-13 17:00:00 America/New_York Gaussian Polytope Approximation Abstract: We study the approximability of high-dimensional convex sets by intersections of halfspaces, where the approximation quality is measured with respect to the standard Gaussian distribution and the complexity of an approximation is the number of halfspaces used. We establish a range of upper and lower bounds both for general convex sets and for specific natural convex sets that are of particular interest. We rely on techniques from many different areas, including classical results from convex geometry, Cramér-type bounds from probability theory, and—perhaps surprisingly—a range of topics from computational complexity theory, including computational learning theory, unconditional pseudorandomness, and the study of influences and noise sensitivity in the analysis of Boolean functions. Based on joint work (https://arxiv.org/abs/2311.08575) with Anindya De and Rocco Servedio. D507

November 06

High-Temperature Gibbs States are Unentangled and Efficiently Preparable

Allen Liu

MIT

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

G575

Add to Calendar 2024-11-06 16:00:00 2024-11-06 17:00:00 America/New_York High-Temperature Gibbs States are Unentangled and Efficiently Preparable Abstract:We show that thermal states of local Hamiltonians are separable above a constant temperature. Specifically, for a local Hamiltonian $H$ on a graph with degree $d$, its Gibbs state at inverse temperature $\beta$, denoted by $\rho =e^{-\beta H}/ \tr(e^{-\beta H})$, is a classical distribution over product states for all $\beta < 1/(c d)$, where $c$ is a constant. This proof of sudden death of thermal entanglement resolves the fundamental question of whether many-body systems can exhibit entanglement at high temperature.Moreover, we show that we can efficiently sample from the distribution over product states. In particular, for any $\beta < 1/( c d^2)$, we can prepare a state $\eps$-close to $\rho$ in trace distance with a depth-one quantum circuit and $\poly(n, 1/\eps)$ classical overhead.

October 24

Hypothesis selection with computational constraints

Maryam Aliakbarpour

Rice University

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

G575

Add to Calendar 2024-10-24 16:00:00 2024-10-24 17:00:00 America/New_York Hypothesis selection with computational constraints Abstract: With the ever-growing volume of data, understanding the computational aspects of statistical inference is increasingly critical. A key question arises: Can we develop algorithms that are both fast and memory-efficient to tackle these challenges? In this talk, we focus on the computational aspects of Hypothesis Selection, a fundamental problem in learning theory and statistics. The task is to select a distribution from a finite set of candidate distributions that best matches the underlying distribution of the given dataset. This talk will delve into the hypothesis selection problem under constraints of memory and time. We will explore how to achieve a nearly optimal tradeoff between memory usage and sample complexity, as well as methods to attain optimal accuracy using algorithms with near-optimal time complexity. This talk is based on joint work with Mark Bun and Adam Smith. G575

October 21

Almost-Linear Time Algorithms for Partially Dynamic Graphs

Simon Meierhans

ETH Zurich

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

G575

Add to Calendar 2024-10-21 16:00:00 2024-10-21 17:00:00 America/New_York Almost-Linear Time Algorithms for Partially Dynamic Graphs Abstract: A partially dynamic graph is a graph that undergoes edge insertions or deletions, but not both. In this talk, I present a unifying framework that resolves numerous well-studied problems on such partially dynamic graphs almost-optimally. These include cycle detection, strongly connected components, s-t distances, transshipment, bipartite matching, maximum flow and minimum-cost flow.We achieve this unification by solving the partially dynamic threshold minimum-cost flow problem. In this problem, one is given a fixed demand vector and a threshold $F$, and tasked with reporting the first time a flow of cost $F$ exists or ceases to exist for insertions and deletions respectively. We give separate algorithms for solving this problem in the incremental and the decremental case. Both follow the L1-IPM framework introduced as part of the first almost linear time minimum-cost flow algorithm [Chen-Kyng-Liu-Peng-Probst Gutenberg-Sachdeva, FOCS'22]. For handling edge insertions, we develop new powerful data structures [Kyng-Meierhans-Probst Gutenberg, STOC'24] to solve the central min-ratio cycle problem against a general adversary [Chen-Kyng-Liu-Meierhans-Probst-Gutenberg, STOC'24]. For handling edge deletions, we take the dual perspective. This leads us to a min-ratio cut problem, which we solve by constructing a dynamic tree that approximately preserves all cuts [van den Brand-Chen-Kyng-Liu-Meierhans-Probst Gutenberg-Sachdevea, FOCS'24]. G575

October 16

Locally Stationary Distributions: Inference and Optimization Beyond Rapid Mixing

David Wu

Berkeley

Part Of

Algorithms and Complexity (A&C) 2024 - 2025

4:00P

- 5:00P

Location

G575

Add to Calendar 2024-10-16 16:00:00 2024-10-16 17:00:00 America/New_York Locally Stationary Distributions: Inference and Optimization Beyond Rapid Mixing G575