User-Level Differential Privacy With Few Examples Per User

Speaker

Google Research

Host

Noah Golowich

MIT

Abstract: Previous work on user-level differential privacy (DP) [Ghazi et al., NeurIPS 2021; Bun et al., STOC 2023] obtained generic algorithms that work for various learning tasks. However, their focus was on the example-rich regime, where the users have so many examples that each user could themselves solve the problem. In this work we consider the example-scarce regime, where each user has only a few examples, and obtain the following results:
For approximate-DP, we give a generic transformation of any item-level DP algorithm to a user-level DP algorithm. Roughly speaking, the latter gives a (multiplicative) savings of O_{ε,δ}(√m) in terms of the number of users required for achieving the same utility, where m is the number of examples per user. This algorithm, while recovering most known bounds for specific problems, also gives new bounds, e.g., for PAC learning.
For pure-DP, we present a simple technique for adapting the exponential mechanism [McSherry & Talwar, FOCS 2007] to the user-level setting. This gives new bounds for a variety of tasks, such as private PAC learning, hypothesis selection, and distribution learning. For some of these problems, we show that our bounds are near-optimal.

Add to Calendar 2023-09-25 16:00:00 2023-09-25 17:00:00 America/New_York User-Level Differential Privacy With Few Examples Per User Abstract: Previous work on user-level differential privacy (DP) [Ghazi et al., NeurIPS 2021; Bun et al., STOC 2023] obtained generic algorithms that work for various learning tasks. However, their focus was on the example-rich regime, where the users have so many examples that each user could themselves solve the problem. In this work we consider the example-scarce regime, where each user has only a few examples, and obtain the following results:For approximate-DP, we give a generic transformation of any item-level DP algorithm to a user-level DP algorithm. Roughly speaking, the latter gives a (multiplicative) savings of O_{ε,δ}(√m) in terms of the number of users required for achieving the same utility, where m is the number of examples per user. This algorithm, while recovering most known bounds for specific problems, also gives new bounds, e.g., for PAC learning.For pure-DP, we present a simple technique for adapting the exponential mechanism [McSherry & Talwar, FOCS 2007] to the user-level setting. This gives new bounds for a variety of tasks, such as private PAC learning, hypothesis selection, and distribution learning. For some of these problems, we show that our bounds are near-optimal. 32-G449

Organizer & Contact

Noah Golowich

nzg@csail.mit.edu

Part of

Algorithms and Complexity Seminar 2023

User-Level Differential Privacy With Few Examples Per User

Speaker

Host

September 25 2023

Location

Organizer & Contact

Part of

April 12

Bounded independence plus noise and derandomization

April 19

Credible Decentralized Exchange Design via Verifiable Sequencing Rules

User-Level Differential Privacy With Few Examples Per User

Speaker

Host

September 25 2023

Location

Organizer & Contact

Part of

Related Events

April 12

Bounded independence plus noise and derandomization

April 19

Credible Decentralized Exchange Design via Verifiable Sequencing Rules