TL;DR
This paper introduces a differentially private decision forest algorithm that leverages smooth sensitivity and the Exponential Mechanism to significantly reduce noise and improve predictive accuracy over existing methods.
Contribution
It presents a novel decision forest approach that minimizes query sensitivity using smooth sensitivity and the Exponential Mechanism, extending to continuous features.
Findings
Higher predictive power than current state-of-the-art
Reduced noise due to lower sensitivity
Effective handling of continuous features
Abstract
We propose a new differentially-private decision forest algorithm that minimizes both the number of queries required, and the sensitivity of those queries. To do so, we build an ensemble of random decision trees that avoids querying the private data except to find the majority class label in the leaf nodes. Rather than using a count query to return the class counts like the current state-of-the-art, we use the Exponential Mechanism to only output the class label itself. This drastically reduces the sensitivity of the query -- often by several orders of magnitude -- which in turn reduces the amount of noise that must be added to preserve privacy. Our improved sensitivity is achieved by using "smooth sensitivity", which takes into account the specific data used in the query rather than assuming the worst-case scenario. We also extend work done on the optimal depth of random decision trees…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
