Differentially private training of neural networks with Langevin dynamics for calibrated predictive uncertainty
Moritz Knolle, Alexander Ziller, Dmitrii Usynin, Rickmer Braren,, Marcus R. Makowski, Daniel Rueckert, Georgios Kaissis

TL;DR
This paper introduces a method combining Langevin dynamics with differential privacy to train neural networks that produce better-calibrated uncertainty estimates, addressing overconfidence issues in safety-critical applications.
Contribution
The paper presents a novel approach that adapts stochastic gradient Langevin dynamics for differentially private training, improving uncertainty calibration in neural networks.
Findings
Significantly reduces calibration error in neural networks.
Provides more reliable uncertainty estimates than standard DP-SGD.
Demonstrates effectiveness on MNIST and Pediatric Pneumonia Dataset.
Abstract
We show that differentially private stochastic gradient descent (DP-SGD) can yield poorly calibrated, overconfident deep learning models. This represents a serious issue for safety-critical applications, e.g. in medical diagnosis. We highlight and exploit parallels between stochastic gradient Langevin dynamics, a scalable Bayesian inference technique for training deep neural networks, and DP-SGD, in order to train differentially private, Bayesian neural networks with minor adjustments to the original (DP-SGD) algorithm. Our approach provides considerably more reliable uncertainty estimates than DP-SGD, as demonstrated empirically by a reduction in expected calibration error (MNIST -fold, Pediatric Pneumonia Dataset -fold).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · COVID-19 diagnosis using AI
