Exploiting Fairness to Enhance Sensitive Attributes Reconstruction

Julien Ferry (LAAS-ROC); Ulrich A\"ivodji (ETS); S\'ebastien Gambs; (UQAM); Marie-Jos\'e Huguet (LAAS-ROC); Mohamed Siala (LAAS-ROC)

arXiv:2209.01215·cs.LG·September 7, 2022

Exploiting Fairness to Enhance Sensitive Attributes Reconstruction

Julien Ferry (LAAS-ROC), Ulrich A\"ivodji (ETS), S\'ebastien Gambs, (UQAM), Marie-Jos\'e Huguet (LAAS-ROC), Mohamed Siala (LAAS-ROC)

PDF

TL;DR

This paper shows how fairness constraints in machine learning models can be exploited by adversaries to better reconstruct sensitive attributes, proposing a generic correction method that improves reconstruction accuracy.

Contribution

It introduces a model-agnostic reconstruction correction method that leverages fairness information to enhance sensitive attribute recovery from black-box models.

Findings

01

The method improves sensitive attribute reconstruction across multiple fairness metrics.

02

Experimental results confirm the approach's effectiveness on various datasets.

03

The approach is applicable to different fair learning methods and models.

Abstract

In recent years, a growing body of work has emerged on how to learn machine learning models under fairness constraints, often expressed with respect to some sensitive attributes. In this work, we consider the setting in which an adversary has black-box access to a target model and show that information about this model's fairness can be exploited by the adversary to enhance his reconstruction of the sensitive attributes of the training data. More precisely, we propose a generic reconstruction correction method, which takes as input an initial guess made by the adversary and corrects it to comply with some user-defined constraints (such as the fairness information) while minimizing the changes in the adversary's guess. The proposed method is agnostic to the type of target model, the fairness-aware learning method as well as the auxiliary knowledge of the adversary. To assess the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.