The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks

Nicholas Carlini; Chang Liu; Jernej Kos; Úlfar Erlingsson; Dawn Song

The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks

Nicholas Carlini

Chang Liu

Jernej Kos

Úlfar Erlingsson

Dawn Song

USENIX Security (2019) (to appear)

Google Scholar

Abstract

This paper describes a testing methodology for quantitatively assessing the risk of \emph{unintended memorization} of rare or unique sequences in generative sequence models---a common type of neural network. Such models are sometimes trained on sensitive data (e.g., the text of users' private messages); our methodology allows deep-learning to choose configurations that minimize memorization during training, thereby benefiting privacy.

In experiments, we show that unintended memorization is a persistent, hard-to-avoid issue that can have serious consequences. Specifically, if not addressed during training, we show that new, efficient procedures can allow extracting unique, secret sequences such as credit card numbers from trained models. We also show that our testing strategy is practical and easy-to-apply, e.g., by describing its use for quantitatively preventing data exposure in a production, commercial neural network---a predictive email-composition assistant trained on millions of users' email messages.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs