Private Counterfactual Retrieval
Mohamed Nomeir, Pasan Dissanayake, Shreya Meel, Sanghamitra Dutta, Sennur Ulukus

TL;DR
This paper introduces privacy-preserving schemes for retrieving counterfactual explanations in machine learning, ensuring user privacy and analyzing trade-offs between accuracy and privacy leakage.
Contribution
It proposes novel private information retrieval schemes for counterfactual explanations, achieving perfect user privacy and quantifying database leakage.
Findings
Exact nearest neighbor retrieval with perfect user privacy.
Quantification and reduction strategies for database leakage.
Empirical validation of privacy-accuracy trade-offs.
Abstract
Transparency and explainability are two extremely important aspects to be considered when employing black-box machine learning models in high-stake applications. Providing counterfactual explanations is one way of fulfilling this requirement. However, this also poses a threat to the privacy of both the institution that is providing the explanation as well as the user who is requesting it. In this work, we propose multiple schemes inspired by private information retrieval (PIR) techniques which ensure the \emph{user's privacy} when retrieving counterfactual explanations. We present a scheme which retrieves the \emph{exact} nearest neighbor counterfactual explanation from a database of accepted points while achieving perfect (information-theoretic) privacy for the user. While the scheme achieves perfect privacy for the user, some leakage on the database is inevitable which we quantify…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Cryptography and Data Security
