Fair Correlation Clustering in Forests
Katrin Casel, Tobias Friedrich, Martin Schirneck, Simon Wietheger

TL;DR
This paper explores the computational complexity of Fair Correlation Clustering on forests, revealing conditions under which the problem is tractable or intractable, and emphasizing that fairness constraints, not strictness, drive complexity.
Contribution
It characterizes the distributions of sensitive attributes on forests where Fair Correlation Clustering is solvable efficiently, and shows the problem's complexity is influenced more by fairness constraints than strictness.
Findings
Fair Correlation Clustering is tractable on certain forest classes.
Hardness is influenced more by fairness constraints than strictness.
The problem's complexity varies with sensitive attribute distributions.
Abstract
The study of algorithmic fairness received growing attention recently. This stems from the awareness that bias in the input data for machine learning systems may result in discriminatory outputs. For clustering tasks, one of the most central notions of fairness is the formalization by Chierichetti, Kumar, Lattanzi, and Vassilvitskii [NeurIPS 2017]. A clustering is said to be fair, if each cluster has the same distribution of manifestations of a sensitive attribute as the whole input set. This is motivated by various applications where the objects to be clustered have sensitive attributes that should not be over- or underrepresented. We discuss the applicability of this fairness notion to Correlation Clustering. The existing literature on the resulting Fair Correlation Clustering problem either presents approximation algorithms with poor approximation guarantees or severely limits the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
