Can Evolutionary Clustering Have Theoretical Guarantees?
Chao Qian

TL;DR
This paper provides the first theoretical guarantees for the approximation performance of a simple evolutionary algorithm, GSEMO, across multiple clustering formulations including fairness considerations, filling a significant gap in the understanding of evolutionary clustering methods.
Contribution
It proves that GSEMO can theoretically guarantee approximation performance for various clustering problems, including fairness-aware clustering, which was previously only supported by empirical results.
Findings
GSEMO achieves theoretical approximation guarantees for $k$-tMM, $k$-center, $k$-median, and $k$-means clustering.
GSEMO's performance guarantees extend to fairness-aware clustering under individual fairness constraints.
The work bridges the gap between empirical success and theoretical understanding of evolutionary clustering algorithms.
Abstract
Clustering is a fundamental problem in many areas, which aims to partition a given data set into groups based on some distance measure, such that the data points in the same group are similar while that in different groups are dissimilar. Due to its importance and NP-hardness, a lot of methods have been proposed, among which evolutionary algorithms are a class of popular ones. Evolutionary clustering has found many successful applications, but all the results are empirical, lacking theoretical support. This paper fills this gap by proving that the approximation performance of the GSEMO (a simple multi-objective evolutionary algorithm) for solving four formulations of clustering, i.e., -tMM, -center, discrete -median and -means, can be theoretically guaranteed. Furthermore, we consider clustering under fairness, which tries to avoid algorithmic bias, and has recently been an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Face and Expression Recognition · Gene expression and cancer classification
