Exploiting Fairness to Enhance Sensitive Attributes Reconstruction
Julien Ferry (LAAS-ROC), Ulrich A\"ivodji (ETS), S\'ebastien Gambs, (UQAM), Marie-Jos\'e Huguet (LAAS-ROC), Mohamed Siala (LAAS-ROC)

TL;DR
This paper shows how fairness constraints in machine learning models can be exploited by adversaries to better reconstruct sensitive attributes, proposing a generic correction method that improves reconstruction accuracy.
Contribution
It introduces a model-agnostic reconstruction correction method that leverages fairness information to enhance sensitive attribute recovery from black-box models.
Findings
The method improves sensitive attribute reconstruction across multiple fairness metrics.
Experimental results confirm the approach's effectiveness on various datasets.
The approach is applicable to different fair learning methods and models.
Abstract
In recent years, a growing body of work has emerged on how to learn machine learning models under fairness constraints, often expressed with respect to some sensitive attributes. In this work, we consider the setting in which an adversary has black-box access to a target model and show that information about this model's fairness can be exploited by the adversary to enhance his reconstruction of the sensitive attributes of the training data. More precisely, we propose a generic reconstruction correction method, which takes as input an initial guess made by the adversary and corrects it to comply with some user-defined constraints (such as the fairness information) while minimizing the changes in the adversary's guess. The proposed method is agnostic to the type of target model, the fairness-aware learning method as well as the auxiliary knowledge of the adversary. To assess the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
