Differentially private training of neural networks with Langevin   dynamics for calibrated predictive uncertainty

Moritz Knolle; Alexander Ziller; Dmitrii Usynin; Rickmer Braren,; Marcus R. Makowski; Daniel Rueckert; Georgios Kaissis

arXiv:2107.04296·cs.LG·August 5, 2021

Differentially private training of neural networks with Langevin dynamics for calibrated predictive uncertainty

Moritz Knolle, Alexander Ziller, Dmitrii Usynin, Rickmer Braren,, Marcus R. Makowski, Daniel Rueckert, Georgios Kaissis

PDF

Open Access

TL;DR

This paper introduces a method combining Langevin dynamics with differential privacy to train neural networks that produce better-calibrated uncertainty estimates, addressing overconfidence issues in safety-critical applications.

Contribution

The paper presents a novel approach that adapts stochastic gradient Langevin dynamics for differentially private training, improving uncertainty calibration in neural networks.

Findings

01

Significantly reduces calibration error in neural networks.

02

Provides more reliable uncertainty estimates than standard DP-SGD.

03

Demonstrates effectiveness on MNIST and Pediatric Pneumonia Dataset.

Abstract

We show that differentially private stochastic gradient descent (DP-SGD) can yield poorly calibrated, overconfident deep learning models. This represents a serious issue for safety-critical applications, e.g. in medical diagnosis. We highlight and exploit parallels between stochastic gradient Langevin dynamics, a scalable Bayesian inference technique for training deep neural networks, and DP-SGD, in order to train differentially private, Bayesian neural networks with minor adjustments to the original (DP-SGD) algorithm. Our approach provides considerably more reliable uncertainty estimates than DP-SGD, as demonstrated empirically by a reduction in expected calibration error (MNIST $\sim 5$ -fold, Pediatric Pneumonia Dataset $\sim 2$ -fold).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · COVID-19 diagnosis using AI