How to Use Heuristics for Differential Privacy

Speaker

Seth Neel

U Penn

Host

Akshay Degwekar, Pritish Kamath and Govind Ramnarayan

CSAIL MIT

Abstract: In this paper, we develop theory for using heuristics to solve computationally hard problems in differential privacy. Heuristic approaches have enjoyed tremendous success in machine learning, in which performance can be empirically evaluated. However, privacy guarantees cannot be evaluated empirically, and must be proven — without making heuristic assumptions. We show that learning problems over broad classes of functions — those that have universal identification sequences — can be solved privately, assuming the existence of a non-private oracle for solving the same problem. Our generic algorithm yields a privacy guarantee that only holds if the oracle succeeds. We then give a reduction which applies to a class of heuristics, which we call certifiable, which allows us to give a worst-case privacy guarantee that holds even when the oracle might fail in adversarial ways. Finally, we consider classes of functions for which both they and their dual classes have universal identification sequences. This includes most classes of simple boolean functions studied in the PAC learning literature, including halfspaces, conjunctions, disjunctions, and parities. We show that there is an efficient algorithm for privately constructing synthetic data for any such class, given a non-private learning oracle.

Add to Calendar 2018-11-07 16:00:00 2018-11-07 17:00:00 America/New_York How to Use Heuristics for Differential Privacy Abstract: In this paper, we develop theory for using heuristics to solve computationally hard problems in differential privacy. Heuristic approaches have enjoyed tremendous success in machine learning, in which performance can be empirically evaluated. However, privacy guarantees cannot be evaluated empirically, and must be proven — without making heuristic assumptions. We show that learning problems over broad classes of functions — those that have universal identification sequences — can be solved privately, assuming the existence of a non-private oracle for solving the same problem. Our generic algorithm yields a privacy guarantee that only holds if the oracle succeeds. We then give a reduction which applies to a class of heuristics, which we call certifiable, which allows us to give a worst-case privacy guarantee that holds even when the oracle might fail in adversarial ways. Finally, we consider classes of functions for which both they and their dual classes have universal identification sequences. This includes most classes of simple boolean functions studied in the PAC learning literature, including halfspaces, conjunctions, disjunctions, and parities. We show that there is an efficient algorithm for privately constructing synthetic data for any such class, given a non-private learning oracle. 32-G882

Organizer & Contact

Rebecca Yadegar

ryadegar@csail.mit.edu

Part of

Algorithms & Complexity Seminars 2018-2019

How to Use Heuristics for Differential Privacy

Speaker

Host

November 07 2018

Location

Organizer & Contact

Part of

November 09

Simple and Efficient Algorithm for Parallel Matchings

June 05

The Sample Complexity of Toeplitz Covariance Estimation

How to Use Heuristics for Differential Privacy

Speaker

Host

November 07 2018

Location

Organizer & Contact

Part of

Related Events

November 09

Simple and Efficient Algorithm for Parallel Matchings

June 05

The Sample Complexity of Toeplitz Covariance Estimation