Antonio Torralba

Human-Computer Interaction

Impact Areas

Big Data

Spoken Language Systems Group

Transportation

Publications

Projects

Project

Crossing the Vision-Language Boundary

Building models that learn spoken language by seeing and hearing

Leads

Research Areas

Impact Areas

Big Data

Project

Understanding Food images

We focus on learning combined modalities (cooking recipes and food images).

Antonio Torralba

Ingmar Weber

Leads

Antonio Torralba

Ingmar Weber

Research Areas

Antonio Torralba

Ingmar Weber

Project

VirtualHome: Representing Activities as Programs

We aim to create a virtual environment where agents learn to perform human tasks by executing programs. Furthermore, we aim to develop models that can generate such programs from video or text, enabling agents to understand and imitate such activities.

Sanja Fidler

Kevin Ra

Xavier Puig Fernandez

Leads

Xavier Puig Fernandez

Research Areas

Sanja Fidler

Kevin Ra

Xavier Puig Fernandez

Project

Places Database for Scene Recognition

Places is a 10 million image database for scene recognition. It contains images from more than 400 scene categories. Places-CNNs are trained to recognize scene context in human-level accuracy.

Agata Lapedriza

Leads

Research Areas

Agata Lapedriza

Project

Semantic Scene Understanding through ADE20K dataset

We build up a pixel-wise annotated image dataset for scene parsing. Scene parsing network are also proposed to detect and segment visual concepts from any input images.

Sanja Fidler

Leads

Research Areas

Sanja Fidler

Project

Understandable Deep Networks

Our aim is to improve the interpretability of deep neural networks to make it possible to understand their decisions, debug their errors, and make systematic improvements.

Antonio Torralba

Leads

Antonio Torralba

Research Areas

Antonio Torralba

Project

Predictive Vision

We've developed an algorithm to anticipate visual events that may happen in the future

Leads

Research Areas

Project

Where Are They Looking?

Our goal is to build a system that predicts where people are looking in images. Given an image and the location of a head, our approach follows the gaze of the person and identifies the object being looked at.

Leads

Research Areas

Impact Areas

Education

Health Care

Project

Single-Image 3D Interpreter Network

Cognitive AI Community of Research

We aim to understand 3D object structure from a single image. We propose an end-to-end framework which sequentially estimates 2D keypoint heatmaps and 3D object structure, by training it on both real 2D-annotated images and synthetic 3D data and by integrating a 3D-to-2D projection layer.

Leads

Research Areas

Impact Areas

Manufacturing

Project

Understanding Light via Deep Neural Networks

Our goal is to understand the illumination of an environment. By disentangling the illumination effect from other intrinsic properties (e.g. geometry, texture, color), we can better understand how human perceive the world. It also has several applications such as single image relighting, color editing, etc.

Wei-Chiu Ma

Antonio Torralba

Leads

Wei-Chiu Ma

Antonio Torralba

Research Areas

Wei-Chiu Ma

Antonio Torralba

7 More

Centers

Research Center

Center for Deployable Machine Learning (CDML)

The MIT Center for Deployable Machine Learning (CDML) works towards creating AI systems that are robust, reliable and safe for real-world deployment.

+10

Aleksander Madry

Leads

Aleksander Madry

Research Areas

Algorithms & Theory

Human-Computer Interaction

Programming Languages & Software Engineering

Robotics

Security & Cryptography

Impact Areas

Big Data

Cybersecurity

Lead

Aleksander Madry

+10

Aleksander Madry

Groups

Community of Research

Visual Computing at CSAIL

The shared mission of Visual Computing is to connect images and computation, spanning topics such as image and video generation and analysis, photography, human perception, touch, applied geometry, and more.

+11

Jonathan Ragan-Kelley

Leads

Jonathan Ragan-Kelley

Research Areas

Impact Areas

Transportation

Lead

Jonathan Ragan-Kelley

+11

Jonathan Ragan-Kelley

Community of Research

Embodied Intelligence Community of Research

Our goal is to understand the nature of intelligent behavior in the physical world, through the study of human intelligence and the design and implementation of intelligent robots.

+18

Jim Glass

Leads

Jim Glass

Research Areas

Robotics

Lead

Jim Glass

+18

Jim Glass

Research Group

Vision Group

Our researchers create state-of-the-art systems to better recognize objects, people, scenes, behaviors and more, with applications in health-care, gaming, tagging systems and more.

Leads

Ted Adelson

Fredo Durand

William Freeman

Polina Golland

Berthold Horn

Aude Oliva

Antonio Torralba

Eric Grimson

Research Areas

Impact Areas