Constructing Fair Latent Space for Intersection of Fairness and   Explainability

Hyungjun Joo; Hyeonggeun Han; Sehwan Kim; Sangwoo Hong; Jungwoo Lee

arXiv:2412.17523·cs.LG·January 22, 2025

Constructing Fair Latent Space for Intersection of Fairness and Explainability

Hyungjun Joo, Hyeonggeun Han, Sehwan Kim, Sangwoo Hong, Jungwoo Lee

PDF

Open Access 1 Video

TL;DR

This paper introduces a module that creates a fair latent space in generative models, improving fairness and explainability simultaneously, with minimal additional training and computational costs.

Contribution

It proposes a novel method to construct a fair latent space by disentangling labels and sensitive attributes, enhancing both fairness and explainability in generative models.

Findings

01

Effective fairness improvement demonstrated by various metrics

02

Provides faithful explanations for biased decisions

03

Reduces training time and costs by only training the module

Abstract

As the use of machine learning models has increased, numerous studies have aimed to enhance fairness. However, research on the intersection of fairness and explainability remains insufficient, leading to potential issues in gaining the trust of actual users. Here, we propose a novel module that constructs a fair latent space, enabling faithful explanation while ensuring fairness. The fair latent space is constructed by disentangling and redistributing labels and sensitive attributes, allowing the generation of counterfactual explanations for each type of information. Our module is attached to a pretrained generative model, transforming its biased latent space into a fair latent space. Additionally, since only the module needs to be trained, there are advantages in terms of time and cost savings, without the need to train the entire generative model. We validate the fair latent space…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Constructing Fair Latent Space for Intersection of Fairness and Explainability· underline

Taxonomy

TopicsExplainable Artificial Intelligence (XAI)