Towards Fairness and Privacy: A Novel Data Pre-processing Optimization   Framework for Non-binary Protected Attributes

Manh Khoi Duong; Stefan Conrad

arXiv:2410.00836·cs.LG·November 19, 2024

Towards Fairness and Privacy: A Novel Data Pre-processing Optimization Framework for Non-binary Protected Attributes

Manh Khoi Duong, Stefan Conrad

PDF

1 Repo

TL;DR

This paper introduces a flexible data pre-processing framework that uses combinatorial optimization, including genetic algorithms, to enhance fairness and privacy in datasets with non-binary protected attributes, applicable across various metrics and tasks.

Contribution

It presents a novel, adaptable framework for debiasing datasets that incorporates synthetic data and optimization techniques, improving fairness and privacy preservation.

Findings

01

Genetic algorithms effectively produce fairer datasets.

02

The framework is metric- and task-agnostic.

03

Synthetic data use enhances privacy and fairness.

Abstract

The reason behind the unfair outcomes of AI is often rooted in biased datasets. Therefore, this work presents a framework for addressing fairness by debiasing datasets containing a (non-)binary protected attribute. The framework proposes a combinatorial optimization problem where heuristics such as genetic algorithms can be used to solve for the stated fairness objectives. The framework addresses this by finding a data subset that minimizes a certain discrimination measure. Depending on a user-defined setting, the framework enables different use cases, such as data removal, the addition of synthetic data, or exclusive use of synthetic data. The exclusive use of synthetic data in particular enhances the framework's ability to preserve privacy while optimizing for fairness. In a comprehensive evaluation, we demonstrate that under our framework, genetic algorithms can effectively yield…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mkduong-ai/fairdo
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.