Explorations in robust optimization of deep networks for adversarial examples: provable defenses, threat models, and overfitting

Speaker

Carnegie Mellon University (CMU)

Host

Aleksander Madry

MIT

While deep networks have contributed to major leaps in raw performance across various applications, they are also known to be quite brittle to targeted data perturbations, so-called adversarial examples, and pose a serious risk for safety- and security-centric applications where reliability and robustness are critical.

In this talk, we discuss a number of approaches for mitigating the effect of adversarial examples, which can offer varying degrees and types of robustness. We first discuss provable defenses which can guarantee that no adversarial example exists within an L-p bounded region. Next, we study alternative threat models for the adversarial example, such as the Wasserstein threat model and the union of multiple threat models. Finally, we present some unexpected findings on the robust learning problem, showing that weak adversaries can be sufficient for training and that overfitting is a dominant phenomenon in adversarially robust training.

Add to Calendar 2020-02-18 13:00:00 2020-02-18 14:00:00 America/New_York Explorations in robust optimization of deep networks for adversarial examples: provable defenses, threat models, and overfitting While deep networks have contributed to major leaps in raw performance across various applications, they are also known to be quite brittle to targeted data perturbations, so-called adversarial examples, and pose a serious risk for safety- and security-centric applications where reliability and robustness are critical. In this talk, we discuss a number of approaches for mitigating the effect of adversarial examples, which can offer varying degrees and types of robustness. We first discuss provable defenses which can guarantee that no adversarial example exists within an L-p bounded region. Next, we study alternative threat models for the adversarial example, such as the Wasserstein threat model and the union of multiple threat models. Finally, we present some unexpected findings on the robust learning problem, showing that weak adversaries can be sufficient for training and that overfitting is a dominant phenomenon in adversarially robust training. Conference Room G575

Organizer & Contact

Deborah Goodwin

dlehto@csail.mit.edu

324-7303

Explorations in robust optimization of deep networks for adversarial examples: provable defenses, threat models, and overfitting

Speaker

Host

February 18 2020

Location

Organizer & Contact