Fair Model-based Clustering

Jinwon Park; Kunwoong Kim; Jihu Lee; Yongdai Kim

arXiv:2602.21509·stat.ML·February 26, 2026

Fair Model-based Clustering

Jinwon Park, Kunwoong Kim, Jihu Lee, Yongdai Kim

PDF

Open Access 1 Video

TL;DR

This paper introduces Fair Model-based Clustering (FMC), a scalable and flexible algorithm that achieves fair clustering by modeling data with a finite mixture model, suitable for large and non-metric datasets.

Contribution

FMC is a novel fair clustering method based on finite mixture models, with parameters independent of sample size, enabling scalable and applicable to non-metric data.

Findings

01

FMC scales efficiently to large datasets.

02

FMC achieves approximately fair clustering with mini-batch learning.

03

Theoretical and empirical results demonstrate FMC's superiority.

Abstract

The goal of fair clustering is to find clusters such that the proportion of sensitive attributes (e.g., gender, race, etc.) in each cluster is similar to that of the entire dataset. Various fair clustering algorithms have been proposed that modify standard K-means clustering to satisfy a given fairness constraint. A critical limitation of several existing fair clustering algorithms is that the number of parameters to be learned is proportional to the sample size because the cluster assignment of each datum should be optimized simultaneously with the cluster center, and thus scaling up the algorithms is difficult. In this paper, we propose a new fair clustering algorithm based on a finite mixture model, called Fair Model-based Clustering (FMC). A main advantage of FMC is that the number of learnable parameters is independent of the sample size and thus can be scaled up easily. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Fair Model-based Clustering· underline

Taxonomy

TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Ethics and Social Impacts of AI