Events

Hide Filters Show Filters

May 02, 2024

Decomposing Predictions by Modeling Model Computation

Harshay Shah

MIT CSAIL

4:00P

- 4:30P

Location

Room 32-G449 (Patil/Kiva)

Add to Calendar 2024-05-02 16:00:00 2024-05-02 16:30:00 America/New_York Decomposing Predictions by Modeling Model Computation Abstract: How does the internal computation of a machine learning model transform inputs into predictions? In this paper, we introduce a task called component modeling that aims to address this question. The goal of component modeling is to decompose an ML model's prediction in terms of its components -- simple functions (e.g., convolution filters, attention heads) that are the "building blocks" of model computation. We focus on a special case of this task, component attribution, where the goal is to estimate the counterfactual impact of individual components on a given prediction. We then present COAR, a scalable algorithm for estimating component attributions; we demonstrate its effectiveness across models, datasets, and modalities. Finally, we show that component attributions estimated with COAR directly enable model editing across five tasks, namely: fixing model errors, ``forgetting'' specific classes, boosting subpopulation robustness, localizing backdoor attacks, and improving robustness to typographic attacks. Paper: https://arxiv.org/abs/2404.11534Blog post: https://gradientscience.org/modelcomponents/Bio: Harshay is a PhD student at MIT CSAIL, advised by Aleksander Madry. His research interests are broadly in developing tools to understand and steer model behavior. Recently, he has been working on understanding how training data and learning algorithms collectively shape neural network representations. Room 32-G449 (Patil/Kiva)

EI Seminar - Hyung Won Chung (OpenAI) - Don’t teach. Incentivize: Scale-first view of Large Language Models

Hyung Won Chung

OpenAI

4:00P

- 5:00P

Location

32-D463 (Star)

Add to Calendar 2024-05-02 16:00:00 2024-05-02 17:00:00 America/New_York EI Seminar - Hyung Won Chung (OpenAI) - Don’t teach. Incentivize: Scale-first view of Large Language Models 32-D463 (Star)

May 03, 2024

Aparna Gupte: How to Construct Quantum FHE, Generically

Aparna Gupte (MIT)

10:30A

- 12:00P

Location

32-G882 (Hewlett)

Add to Calendar 2024-05-03 10:30:00 2024-05-03 12:00:00 America/New_York Aparna Gupte: How to Construct Quantum FHE, Generically We construct a (compact) quantum fully homomorphic encryption (QFHE) scheme starting from any (classical) fully homomorphic encryption scheme (with decryption in NC^1) together with a dual-mode trapdoor claw-free function family. Compared to previous constructions (Mahadev, FOCS 2018; Brakerski, CRYPTO 2018) which made non-black-box use of similar underlying primitives, our construction provides a pathway to instantiations from different assumptions. Our construction uses the techniques of Dulek, Schaffner and Speelman (CRYPTO 2016) and shows how to make the client in their QFHE scheme classical using dual-mode trapdoor claw-free functions. As an additional contribution, we show a new instantiation of dual-mode trapdoor claw-free functions from group actions. This is based on joint work with Vinod Vaikuntanathan. 32-G882 (Hewlett)

May 03, 2024

Aparna Gupte: How to Construct Quantum FHE, Generically

Aparna Gupte (MIT)

10:30A

- 12:00P

Location

32-G882 (Hewlett)

May 06, 2024

What I learned: stories and lessons from an academic lifetime

Oren Etzioni

TrueMedia.org, Allen Institute for AI

4:00P

- 5:00P

Location

32-G449 (Kiva)

Add to Calendar 2024-05-06 16:00:00 2024-05-06 17:00:00 America/New_York What I learned: stories and lessons from an academic lifetime Abstract: In high school, I aspired to be a standup comic, but I walked into the wrong bar and ended up studying computer science. Over the last 40 years, I had some wild rides in both academia and outside of it, leading up to fighting political deepfakes at TrueMedia.org. I plan to share some amusing highlights with you as parables whose morals are broadly applicable. Bio: Prof. Oren Etzioni is the founder of TrueMedia.org, a nonprofit fighting political deepfakes. He was the Founding Chief Executive Officer at the Allen Institute for AI (AI2), having served as CEO from its inception in 2013 until late 2022. He is Professor Emeritus at the University of Washington where he helped to pioneer meta-search, online comparison shopping, machine reading, and open information extraction. He has authored several award-winning technical papers, achieving an H-index of 100 (100 technical papers each cited over 100 times). Finally, he is a technical director of the AI2 Incubator and a Venture Partner at Madrona. He has founded several companies including Farecast (acquired by Microsoft). 32-G449 (Kiva)

May 07, 2024

Quest | CBMM Seminar Series: Invariance and equivariance in brains and machines

Bruno Olshausen

UC Berkeley

4:00P

- 5:30P

Location

Singleton Auditorium (46-3002)

Add to Calendar 2024-05-07 16:00:00 2024-05-07 17:30:00 America/New_York Quest | CBMM Seminar Series: Invariance and equivariance in brains and machines Abstract: The goal of building machines that can perceive and act in the world as humans and other animals do has been a focus of AI research efforts for over half a century. Over this same period, neuroscience has sought to achieve a mechanistic understanding of the brain processes underlying perception and action. It stands to reason that these parallel efforts could inform one another. However recent advances in deep learning and transformers have, for the most part, not translated into new neuroscientific insights; and other than deriving loose inspiration from neuroscience, AI has mostly pursued its own course which now deviates strongly from the brain. Here I propose an approach to building both invariant and equivariant representations in vision that is rooted in observations of animal behavior and informed by both neurobiological mechanisms (recurrence, dendritic nonlinearities, phase coding) and mathematical principles (group theory, residue numbers). What emerges from this approach is a neural circuit for factorization that can learn about shapes and their transformations from image data, and a model of the grid-cell system based on high-dimensional encodings of residue numbers. These models provide efficient solutions to long-studied problems that are well-suited for implementation in neuromorphic hardware or as a basis for forming hypotheses about visual cortex and entorhinal cortex.Bio: Professor Bruno Olshausen is a Professor in the Helen Wills Neuroscience Institute, the School of Optometry, and has a below-the-line affiliated appointment in EECS. He holds B.S. and M.S. degrees in Electrical Engineering from Stanford University, and a Ph.D. in Computation and Neural Systems from the California Institute of Technology. He did his postdoctoral work in the Department of Psychology at Cornell University and at the Center for Biological and Computational Learning at the Massachusetts Institute of Technology. From 1996-2005 he was on the faculty in the Center for Neuroscience at UC Davis, and in 2005 he moved to UC Berkeley. He also directs the Redwood Center for Theoretical Neuroscience, a multidisciplinary research group focusing on building mathematical and computational models of brain function (see http://redwood.berkeley.edu).Olshausen's research focuses on understanding the information processing strategies employed by the visual system for tasks such as object recognition and scene analysis. Computer scientists have long sought to emulate the abilities of the visual system in digital computers, but achieving performance anywhere close to that exhibited by biological vision systems has proven elusive. Dr. Olshausen's approach is based on studying the response properties of neurons in the brain and attempting to construct mathematical models that can describe what neurons are doing in terms of a functional theory of vision. The aim of this work is not only to advance our understanding of the brain but also to devise new algorithms for image analysis and recognition based on how brains work. Singleton Auditorium (46-3002)

Siva Vaidhyanathan - Digital Hegemony and Digital Sovereignty

Siva Vaidhyanathan

University of Virginia

4:00P

- 5:00P

Location

Kiva (G449)

Add to Calendar 2024-05-07 16:00:00 2024-05-07 17:00:00 America/New_York Siva Vaidhyanathan - Digital Hegemony and Digital Sovereignty Abstract: Through the first 30 years of the development of the internet, we were promised a global “network of networks” that would offer free speech, democratic empowerment, and the spread of democracy. Leaders from Ronald Reagan to Margaret Thatcher to Barack Obama all promised that technology would unite and enlighten the world. Somehow it all went differently, and now we live in a world traversed by networks dominated by hegemons like the United States, Russia, and China. In this talk, Professor Siva Vaidhyanathan will explain the idea of “digital sovereignty,” the ways that a nation state creates and enforces its own sense of what should be allowed and watched on digital networks, resisting digital hegemony through strategies of digital sovereingty. There are many models of “digital sovereignty,” each offering a distinct set of value and opportunities, as well as methods of oppression. This talk will focus on how the Russian invasion of Ukraine exposes the dangers and necessities of digital sovereignty.Bio:Siva Vaidhyanathan is the Robertson Professor of Media Studies and director of the Center for Media and Citizenship at the University of Virginia. He is the author of Antisocial Media: How Facebook Disconnects Us and Undermines Democracy (2018), Intellectual Property: A Very Short Introduction (2017), The Googlization of Everything -- and Why We Should Worry (2011), Copyrights and Copywrongs: The Rise of Intellectual Property and How it Threatens Creativity ( 2001), and The Anarchist in the Library: How the Clash between Freedom and Control is Hacking the Real World and Crashing the System (2004). He also co-edited (with Carolyn Thomas) the collection, Rewiring the Nation: The Place of Technology in American Studies (2007). Vaidhyanathan is a columnist for The Guardian and has written for many other periodicals, including The New York Times, Wired, Bloomberg View, American Scholar, Reason, Dissent, The Chronicle of Higher Education, The New York Times Magazine, Slate.com, BookForum, Columbia Journalism Review, Washington Post, The Virginia Quarterly Review, The New York Times Book Review, and The Nation. He is a frequent contributor to public radio programs. And he has appeared on news programs on BBC, CNN, NBC, CNBC, MSNBC, ABC, and on The Daily Show with Jon Stewart on Comedy Central. In 2015 he was portrayed on stage at the Public Theater in a play called Privacy. After five years as a professional journalist, he earned a Ph.D. in American Studies from the University of Texas at Austin. Vaidhyanathan has also taught at Wesleyan University, the University of Wisconsin at Madison, Columbia University, New York University, McMaster University, and the University of Amsterdam. He is a fellow at the New York Institute for the Humanities and a Faculty Associate of the Berkman Center for Internet and Society at Harvard University. He was born and raised in Buffalo, New York and resides in Charlottesville, Virginia.This talk will also be streamed over Zoom: https://mit.zoom.us/j/95568018736. Kiva (G449)

May 10, 2024

Pseudorandom Error-Correcting Codes

Miranda Christ (Columbia University)

10:30A

- 12:00P

Location

32-G882 Hewlett Room

Add to Calendar 2024-05-10 10:30:00 2024-05-10 12:00:00 America/New_York Pseudorandom Error-Correcting Codes We construct pseudorandom error-correcting codes (or simply pseudorandom codes), which are error-correcting codes with the property that any polynomial number of codewords are pseudorandom to any computationally-bounded adversary. Efficient decoding of corrupted codewords is possible with the help of a decoding key.We build pseudorandom codes that are robust to substitution and deletion errors, where pseudorandomness rests on standard cryptographic assumptions. Specifically, pseudorandomness is based on either 2^{O(\sqrt{n})}-hardness of LPN, or polynomial hardness of LPN and the planted XOR problem at low density.As our primary application of pseudorandom codes, we present an undetectable watermarking scheme for outputs of language models that is robust to cropping and a constant rate of random substitutions and deletions. The watermark is undetectable in the sense that any number of samples of watermarked text are computationally indistinguishable from text output by the original model. This is the first undetectable watermarking scheme that can tolerate a constant rate of errors.Our second application is to steganography, where a secret message is hidden in innocent-looking content. We present a constant-rate stateless steganography scheme with robustness to a constant rate of substitutions. Ours is the first stateless steganography scheme with provable steganographic security and any robustness to errors.This is based on joint work with Sam Gunn: https://eprint.iacr.org/2024/235 32-G882 Hewlett Room

May 14, 2024

Low-Step Multi-Commodity Flow Emulators

Thatchaphol Saranurak

University of Michigan

4:15P

- 5:15P

Location

Refreshments at 4:00pm

Add to Calendar 2024-05-14 16:15:00 2024-05-14 17:15:00 America/New_York Low-Step Multi-Commodity Flow Emulators Abstract: We introduce the concept of low-step multi-commodity flow emulators for any undirected, capacitated graph. At a high level, these emulators contain approximate multi-commodity flows whose paths contain a small number of edges, shattering the infamous flow decomposition barrier for multi-commodity flow.We prove the existence of low-step multi-commodity flow emulators and develop efficient algorithms to compute them. We then apply them to solve constant-approximate $k$-commodity flow in $O((m+k)^{1+\epsilon})$ time. To bypass the $O(mk)$ flow decomposition barrier, we represent our output multi-commodity flow implicitly; prior to our work, even the existence of implicit constant-approximate multi-commodity flows of size $o(mk)$ was unknown.Our results generalize to the \emph{minimum cost} setting, where each edge has an associated cost and the multi-commodity flow must satisfy a cost budget. Our algorithms are also parallel. 32-G449

May 17, 2024

Zhengzhong Jin: Universal SNARGs for NP from Proofs of Completeness

Zhengzhong Jin (Northeastern University)

10:30A

- 12:00P

Location

32-G882 (Hewlett)

Add to Calendar 2024-05-17 10:30:00 2024-05-17 12:00:00 America/New_York Zhengzhong Jin: Universal SNARGs for NP from Proofs of Completeness We construct a succinct non-interactive argument system (SNARG) for any NP language L, and prove the non-adaptive soundness assuming the security of an FHE scheme, a batch argument (BARG) scheme, as well as the existence of any two-message argument system for L that has a polynomial-size Extended Frege proof of the completeness property. Our SNARG is *universal* in the sense that the construction does not depend on the two-message argument system.We also show how to convert any adaptively sound designated verifier SNARG into publicly verifiable SNARGs with adaptive soundness, assuming the underlying designated verifier SNARG has a polynomial-size Extended Frege proof of completeness.Our framework yields several corollaries, including:- a SNARG for NP with a transparent CRS and non-adaptive soundness, assuming LWE and the (non-explicit) existence of any witness encryption for NP that has a polynomial-size 'Extended Frege proof of correctness'. As a corollary, we obtain SNARGs for NP under the evasive LWE and subexponential LWE assumptions, with a (long) transparent CRS and non-adaptive soundness. - a SNARG for UP with a long (and even transparent!) CRS and adaptive soundness under the evasive LWE and subexponential LWE assumptions.- a SNARG for NP with a short CRS and non-adaptive soundness assuming LWE, FHE, and the (non-explicit) existence of any hash function that makes Micali's SNARG construction sound.We prove our results by extending the encrypt-hash-and-BARG paradigm of [Jin-Kalai-Lombardi-Vaikuntanathan, STOC '24]; in this work, we use Extended Frege proofs as a security reduction from one argument system to another, rather than as an outright security proof. Our universal construction suggests that the encrypt-hash-and-BARG construction can be viewed as a ``best possible SNARG''.Based on the joint work with Yael Tauman Kalai, Alex Lombardi, and Surya Mathialagan. 32-G882 (Hewlett)

May 24, 2024

Dynamic Adaptive Optimization: Recovering from Hardware Errors and Software Crashes in a Distributed Virtual Machine

Ike Nassi

University of California, Santa Cruz

2:00P

- 3:00P

Location

32-G575

Add to Calendar 2024-05-24 14:00:00 2024-05-24 15:00:00 America/New_York Dynamic Adaptive Optimization: Recovering from Hardware Errors and Software Crashes in a Distributed Virtual Machine Abstract: TidalScale was a startup aquired by HPE in December 2022. TidalSale developed a software architecture called distributed virtual machines. Today's virtual machines in widespread use today allows multiple operating systems to share a server. TidalScale inverts this paradigm. A single virtual machine running on TidalScale runs a single operating system instance across a cluster of standard servers. This virtual machine sits between an operating system and a cluster of servers. It runs on premise or in the cloud. Because they are virtual, resources like processors and memory can migrate among nodes in the cluster. The virtual machine dynamically self-optimizes resource placement in real time under contol of a set of machine learning algorithms. Servers can automatically and dynamically be added and removed depending on fluctuationg workloads, allowing for dynamic hardware scalability, but also increasing reliability and resiliency. In this talk, we specifically show how these servers automatically, without any human intervention, recover from most hardware failures, and and provide excellent restart performance should OS failures occur.Bio: Ike Nassi is a consultant and an Adjunct Professor of Computer Science at UC Santa Cruz, a Founding Trustee at the Computer History Museum and an advisory board member of TTI/Vanguard. Ike was the founder of TidalScale, sold to HPE Dec. 2022. Previously, he was an Executive Vice President and Chief Scientist at SAP. Ike started or helped to start four companies: Encore Computer Corporation building hierarchical strongly coherent shared memory symmetric multiprocessors (1984); InfoGear Technology, which developed both Internet appliances (including the first iPhone) (1996); Firetide, an early wireless mesh networking company (2000), and TidalScale (2012). He was SVP for Software at Apple Computer and a Corporate Officer. He worked at Visual Technology, and Digital Equipment Corporation. In the past, Dr. Nassi was a Visiting Scholar at Stanford University, twice a Research Scientist at MIT, and a Visiting Scholar at University of California, Berkeley. He has served on the board of the Anita Borg Institute for Women and Technology, and the IEEE Computer Society Industry Advisory Board. He holds a PhD in Computer Science from Stony Brook University.He was awarded two certificates for Distinguished Service from the Department of Defense, one for his work on the design of the programming language Ada and one for his work on DARPA ISAT. He is a Life Fellow of IEEE and a Life member of ACM. He is named on over 35 patents. 32-G575

Events

Event Type

Impact Area

Research Area

Seminar Series

May 02, 2024

Decomposing Predictions by Modeling Model Computation

Location

EI Seminar - Hyung Won Chung (OpenAI) - Don’t teach. Incentivize: Scale-first view of Large Language Models

Location

May 03, 2024

Aparna Gupte: How to Construct Quantum FHE, Generically

Location

May 03, 2024

Aparna Gupte: How to Construct Quantum FHE, Generically

Location

May 06, 2024

What I learned: stories and lessons from an academic lifetime

Location

May 07, 2024

Quest | CBMM Seminar Series: Invariance and equivariance in brains and machines

Location

Siva Vaidhyanathan - Digital Hegemony and Digital Sovereignty

Location

May 10, 2024

Pseudorandom Error-Correcting Codes

Location

May 14, 2024

Low-Step Multi-Commodity Flow Emulators

Location

May 17, 2024

Zhengzhong Jin: Universal SNARGs for NP from Proofs of Completeness

Location

May 24, 2024

Dynamic Adaptive Optimization: Recovering from Hardware Errors and Software Crashes in a Distributed Virtual Machine

Location