Reducing Risk of Model Inversion Using Privacy-Guided Training
Abigail Goldsteen, Gilad Ezov, Ariel Farkash

TL;DR
This paper introduces a privacy-guided training method for tree-based models that reduces the influence of sensitive features, thereby decreasing the risk of model inversion attacks without compromising accuracy.
Contribution
It presents a novel approach to mitigate model inversion risks by adjusting feature influence during training, a concept not thoroughly explored before.
Findings
Reducing sensitive feature influence lowers inversion attack success.
Model accuracy remains stable despite influence adjustments.
The method is effective against various black-box and white-box attacks.
Abstract
Machine learning models often pose a threat to the privacy of individuals whose data is part of the training set. Several recent attacks have been able to infer sensitive information from trained models, including model inversion or attribute inference attacks. These attacks are able to reveal the values of certain sensitive features of individuals who participated in training the model. It has also been shown that several factors can contribute to an increased risk of model inversion, including feature influence. We observe that not all features necessarily share the same level of privacy or sensitivity. In many cases, certain features used to train a model are considered especially sensitive and therefore propitious candidates for inversion. We present a solution for countering model inversion attacks in tree-based models, by reducing the influence of sensitive features in these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Cryptography and Data Security
