What If, But Privately: Private Counterfactual Retrieval
Shreya Meel, Mohamed Nomeir, Pasan Dissanayake, Sanghamitra Dutta, Sennur Ulukus

TL;DR
This paper introduces privacy-preserving methods for retrieving counterfactual explanations in machine learning, ensuring user privacy while providing accurate explanations, and extends the framework to immutable features and user preferences.
Contribution
It proposes new schemes for private counterfactual retrieval that guarantee perfect user privacy and reduce database leakage, including extensions for immutable features and user preferences.
Findings
Achieves perfect, information-theoretic privacy for users.
Reduces database information leakage compared to baseline schemes.
Supports retrieval with immutable features and user attribute preferences.
Abstract
Transparency and explainability are two important aspects to be considered when employing black-box machine learning models in high-stake applications. Providing counterfactual explanations is one way of catering this requirement. However, this also poses a threat to the privacy of the institution that is providing the explanation, as well as the user who is requesting it. In this work, we are primarily concerned with the user's privacy who wants to retrieve a counterfactual instance, without revealing their feature vector to the institution. Our framework retrieves the exact nearest neighbor counterfactual explanation from a database of accepted points while achieving perfect, information-theoretic, privacy for the user. First, we introduce the problem of private counterfactual retrieval (PCR) and propose a baseline PCR scheme that keeps the user's feature vector…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
