HCI Seminar

Speaker

Host

Arvind Satyanarayan

Assistant Professor, EECS

Interpretability and Safety
How can we understand the inner workings of neural networks? Neural networks greatly exceed anything humans can design directly at computer vision by building up their own internal hierarchy of internal visual concepts. So, what are they detecting? How do they implement these detectors? How do they fit together to create the behavior of the network as a whole?

At a more practical level, can we use these techniques to audit neural networks? Or find cases where the right decision is made for bad reasons? To allow human feedback on the decision process, rather than just the final decision? Or to improve our ability to design models?

Bio: Chris Olah is best known for DeepDream, the Distill journal, and his blog. He spent five years at Google Brain, where he focused on neural network interpretability and safety. He's also worked on various other projects, including early TensorFlow, generative models, and NLP. Prior to Google Brain, Chris dropped out of university and did deep learning research independently as a Thiel Fellow. Chris will be joining OpenAI in October to start a new interpretability team there.

Add to Calendar 2018-09-18 13:00:00 2018-09-18 14:00:00 America/New_York HCI Seminar Interpretability and SafetyHow can we understand the inner workings of neural networks? Neural networks greatly exceed anything humans can design directly at computer vision by building up their own internal hierarchy of internal visual concepts. So, what are they detecting? How do they implement these detectors? How do they fit together to create the behavior of the network as a whole?At a more practical level, can we use these techniques to audit neural networks? Or find cases where the right decision is made for bad reasons? To allow human feedback on the decision process, rather than just the final decision? Or to improve our ability to design models?Bio: Chris Olah is best known for DeepDream, the Distill journal, and his blog. He spent five years at Google Brain, where he focused on neural network interpretability and safety. He's also worked on various other projects, including early TensorFlow, generative models, and NLP. Prior to Google Brain, Chris dropped out of university and did deep learning research independently as a Thiel Fellow. Chris will be joining OpenAI in October to start a new interpretability team there.

Organizer & Contact

Amy Shea-Slattery

amyshea@csail.mit.edu

617-253-6002

Part of

HCI Seminar Series 2018

HCI Seminar

Speaker

Host

September 18 2018

Location

Organizer & Contact

Part of

October 03

Seeing What We (Should) Think Through Visualization Interaction

November 06

Explicit Direct Instruction in Programming Education

HCI Seminar

Speaker

Host

September 18 2018

Location

Organizer & Contact

Part of

Related Events

October 03

Seeing What We (Should) Think Through Visualization Interaction

November 06

Explicit Direct Instruction in Programming Education