Making Deep Learning more Robust, Modular, and Efficient

Speaker

Zico Kolter

Carnegie Mellon University

Host

Aleksander Madry

Abstract:
Deep learning is often seen as the "breakthrough" AI technology of recent years, revolutionizing areas spanning computer vision, natural language processing, and game playing. However, if we seek to deploy such systems in real-world, safety-critical domains, a starker reality emerges. Modern deep learning systems are brittle (sensitive to adversarial manipulation and a general lack of robustness), opaque (difficult to interpret and debug their components), and expensive (often requiring vastly more data than practical in real-world settings).

These failings are sometimes billed as an argument against deep learning as a whole. But in this talk, I will argue instead for new methods that can address these challenges, while preserving the fundamental benefits of deep learning (namely, end-to-end training of composable, differentiable architectures). First, I will discuss our approaches to designing provably robust deep networks using tools from convex relaxations and duality. I also highlight recent work on scaling these methods to much larger domains, including some initial work on provable robustness at ImageNet scale. Second, I will present our work on integrating more complex modules as interpretable layers within deep learning architectures. I show how modules such as optimization solvers, physical simulation, model-based control, and game equilibrium solvers can all be integrated as layers within a deep network, enabling more intuitive architectures that can learn from vastly less data. Last, I will highlight some additional ongoing directions and open questions in both these areas.

Bio:
Zico Kolter is an Assistant Professor in the Computer Science Department at Carnegie Mellon University, and also serves as chief scientist of AI research for the Bosch Center for Artificial Intelligence. His work focuses on the intersection of machine learning and optimization, with a large focus on developing more robust, interpretable, and rigorous methods in deep learning. In addition, he has worked in a number of application areas, highlighted by work on sustainability and smart energy systems. He is a recipient of the DARPA Young Faculty Award, and best paper awards at KDD, PESGM, and IJCAI.

Add to Calendar 2019-04-03 16:30:00 2019-04-03 17:30:00 America/New_York Making Deep Learning more Robust, Modular, and Efficient Abstract: Deep learning is often seen as the "breakthrough" AI technology of recent years, revolutionizing areas spanning computer vision, natural language processing, and game playing. However, if we seek to deploy such systems in real-world, safety-critical domains, a starker reality emerges. Modern deep learning systems are brittle (sensitive to adversarial manipulation and a general lack of robustness), opaque (difficult to interpret and debug their components), and expensive (often requiring vastly more data than practical in real-world settings).These failings are sometimes billed as an argument against deep learning as a whole. But in this talk, I will argue instead for new methods that can address these challenges, while preserving the fundamental benefits of deep learning (namely, end-to-end training of composable, differentiable architectures). First, I will discuss our approaches to designing provably robust deep networks using tools from convex relaxations and duality. I also highlight recent work on scaling these methods to much larger domains, including some initial work on provable robustness at ImageNet scale. Second, I will present our work on integrating more complex modules as interpretable layers within deep learning architectures. I show how modules such as optimization solvers, physical simulation, model-based control, and game equilibrium solvers can all be integrated as layers within a deep network, enabling more intuitive architectures that can learn from vastly less data. Last, I will highlight some additional ongoing directions and open questions in both these areas.Bio:Zico Kolter is an Assistant Professor in the Computer Science Department at Carnegie Mellon University, and also serves as chief scientist of AI research for the Bosch Center for Artificial Intelligence. His work focuses on the intersection of machine learning and optimization, with a large focus on developing more robust, interpretable, and rigorous methods in deep learning. In addition, he has worked in a number of application areas, highlighted by work on sustainability and smart energy systems. He is a recipient of the DARPA Young Faculty Award, and best paper awards at KDD, PESGM, and IJCAI. 34-101

Organizer & Contact

Marcia G. Davidson

marcia@csail.mit.edu

617-253-3049

Part of

Machine Learning Seminar Series 2019

Making Deep Learning more Robust, Modular, and Efficient

Speaker

Host

April 03 2019

Location

Organizer & Contact

Part of

December 05

The Non-Stochastic Control Problem

October 29

David Spiegelhalter: Communicating uncertainty about facts, numbers and science

Making Deep Learning more Robust, Modular, and Efficient

Speaker

Host

April 03 2019

Location

Organizer & Contact

Part of

Related Events

December 05

The Non-Stochastic Control Problem

October 29

David Spiegelhalter: Communicating uncertainty about facts, numbers and science