# Attraction-Repulsion clustering with applications to fairness

**Authors:** Eustasio del Barrio, Hristo Inouzhe, Jean-Michel Loubes

arXiv: 1904.05254 · 2021-10-27

## TL;DR

This paper introduces a novel clustering method that enhances diversity and fairness by applying attraction-repulsion perturbations to dissimilarities, improving demographic parity in various datasets.

## Contribution

It proposes a new pre-processing approach using attraction-repulsion dissimilarities to promote diversity in clustering, compatible with multiple methods and data types.

## Key findings

- Improves diversity with respect to protected attributes.
- Compatible with various clustering algorithms and non-Euclidean data.
- Demonstrated effectiveness on synthetic and real datasets.

## Abstract

We consider the problem of diversity enhancing clustering, i.e, developing clustering methods which produce clusters that favour diversity with respect to a set of protected attributes such as race, sex, age, etc. In the context of fair clustering, diversity plays a major role when fairness is understood as demographic parity. To promote diversity, we introduce perturbations to the distance in the unprotected attributes that account for protected attributes in a way that resembles attraction-repulsion of charged particles in Physics. These perturbations are defined through dissimilarities with a tractable interpretation. Cluster analysis based on attraction-repulsion dissimilarities penalizes homogeneity of the clusters with respect to the protected attributes and leads to an improvement in diversity. An advantage of our approach, which falls into a pre-processing set-up, is its compatibility with a wide variety of clustering methods and whit non-Euclidean data. We illustrate the use of our procedures with both synthetic and real data and provide discussion about the relation between diversity, fairness, and cluster structure. Our procedures are implemented in an R package freely available at https://github.com/HristoInouzhe/AttractionRepulsionClustering.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.05254/full.md

## Figures

61 figures with captions in the complete paper: https://tomesphere.com/paper/1904.05254/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/1904.05254/full.md

---
Source: https://tomesphere.com/paper/1904.05254