Dirty Data, Robotics, and Artificial Intelligence

Speaker

UC Berkeley

Host

Leslie Kaelbling

Abstract:
Large training datasets have revolutionized AI research, but enabling similar breakthroughs in other fields, such as Robotics, requires a new understanding of how to acquire, clean, and structure emergent forms of large-scale, unstructured sequential data. My talk presents a systematic approach to handling such dirty data in the context of modern AI applications. I start by introducing a statistical formalization on data cleaning in this setting including research on: (1) how common data cleaning operations affect model training, (2) how data cleaning programs can be expected to generalize to unseen data, (3) and how to prioritize limited human intervention in rapidly growing datasets. Then, using surgical robotics as a motivating example, I present a series of robust Bayesian models for automatically extracting hierarchical structure from highly varied and noisy robot trajectory data facilitating imitation learning and reinforcement learning on short, consistent sub-problems. I present how the combination of clean training data and structured learning tasks enables learning highly accurate control policies in tasks ranging from surgical cutting to debridement.

Bio:
Sanjay Krishnan is a Computer Science PhD candidate in the RISELab and in the Berkeley Laboratory for Automation Science and Engineering at UC Berkeley. His research studies problems on the intersection of database theory, machine learning, and robotics. Sanjay's work has received a number of awards including the 2016 SIGMOD Best Demonstration award, 2015 IEEE GHTC Best Paper award, and Sage Scholar award. https://www.ocf.berkeley.edu/~sanjayk/

Add to Calendar 2017-12-15 14:00:00 2017-12-15 15:00:00 America/New_York Dirty Data, Robotics, and Artificial Intelligence Abstract:Large training datasets have revolutionized AI research, but enabling similar breakthroughs in other fields, such as Robotics, requires a new understanding of how to acquire, clean, and structure emergent forms of large-scale, unstructured sequential data. My talk presents a systematic approach to handling such dirty data in the context of modern AI applications. I start by introducing a statistical formalization on data cleaning in this setting including research on: (1) how common data cleaning operations affect model training, (2) how data cleaning programs can be expected to generalize to unseen data, (3) and how to prioritize limited human intervention in rapidly growing datasets. Then, using surgical robotics as a motivating example, I present a series of robust Bayesian models for automatically extracting hierarchical structure from highly varied and noisy robot trajectory data facilitating imitation learning and reinforcement learning on short, consistent sub-problems. I present how the combination of clean training data and structured learning tasks enables learning highly accurate control policies in tasks ranging from surgical cutting to debridement.Bio:Sanjay Krishnan is a Computer Science PhD candidate in the RISELab and in the Berkeley Laboratory for Automation Science and Engineering at UC Berkeley. His research studies problems on the intersection of database theory, machine learning, and robotics. Sanjay's work has received a number of awards including the 2016 SIGMOD Best Demonstration award, 2015 IEEE GHTC Best Paper award, and Sage Scholar award. https://www.ocf.berkeley.edu/~sanjayk/ Star (32-D463)

Organizer & Contact

Leslie Kaelbling

lpk@csail.mit.edu

Part of

Robotics@MIT Seminar Series 2017

Dirty Data, Robotics, and Artificial Intelligence

Speaker

Host

December 15 2017

Location

Organizer & Contact

Part of

February 14

Robust and Risk-Sensitive Planning via Contraction Theory and Convex Optimization

November 14

Multicellular Machines: A Bio-inspired approach to automated electromechanical design and fabrication

Dirty Data, Robotics, and Artificial Intelligence

Speaker

Host

December 15 2017

Location

Organizer & Contact

Part of

Related Events

February 14

Robust and Risk-Sensitive Planning via Contraction Theory and Convex Optimization

November 14

Multicellular Machines: A Bio-inspired approach to automated electromechanical design and fabrication