On the Intrinsic Differential Privacy of Bagging
Hongbin Liu, Jinyuan Jia, Neil Zhenqiang Gong

TL;DR
This paper demonstrates that Bagging inherently provides differential privacy due to its intrinsic randomness, offering a privacy-preserving ensemble method with strong theoretical guarantees and superior empirical performance on standard datasets.
Contribution
The authors prove that Bagging naturally achieves differential privacy without additional noise, providing tight privacy guarantees and outperforming existing methods in accuracy.
Findings
Bagging achieves differential privacy intrinsically.
Theoretical privacy bounds are tight under general assumptions.
Empirical results show higher accuracy than state-of-the-art methods.
Abstract
Differentially private machine learning trains models while protecting privacy of the sensitive training data. The key to obtain differentially private models is to introduce noise/randomness to the training process. In particular, existing differentially private machine learning methods add noise to the training data, the gradients, the loss function, and/or the model itself. Bagging, a popular ensemble learning framework, randomly creates some subsamples of the training data, trains a base model for each subsample using a base learner, and takes majority vote among the base models when making predictions. Bagging has intrinsic randomness in the training process as it randomly creates subsamples. Our major theoretical results show that such intrinsic randomness already makes Bagging differentially private without the needs of additional noise. In particular, we prove that, for any base…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Cryptography and Data Security
