Complexity and Approximation of the Fuzzy K-Means Problem
Johannes Bl\"omer, Sascha Brauer, and Kathrin Bujna

TL;DR
This paper studies the computational complexity of fuzzy K-means, showing optimal solutions are generally inexpressible with radicals, and introduces new approximation algorithms with provable guarantees.
Contribution
It provides the first approximation algorithms for fuzzy K-means, including deterministic and randomized methods, with complexity bounds and connections to hard clustering.
Findings
Optimal fuzzy K-means solutions cannot be expressed by radicals.
First polynomial-time approximation algorithms for fuzzy K-means.
Algorithms achieve (1+ε)-approximation with practical runtime bounds.
Abstract
The fuzzy -means problem is a generalization of the classical -means problem to soft clusterings, i.e. clusterings where each points belongs to each cluster to some degree. Although popular in practice, prior to this work the fuzzy -means problem has not been studied from a complexity theoretic or algorithmic perspective. We show that optimal solutions for fuzzy -means cannot, in general, be expressed by radicals over the input points. Surprisingly, this already holds for very simple inputs in one-dimensional space. Hence, one cannot expect to compute optimal solutions exactly. We give the first -approximation algorithms for the fuzzy -means problem. First, we present a deterministic approximation algorithm whose runtime is polynomial in and linear in the dimension of the input set, given that is constant, i.e. a polynomial time approximation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Criteria Decision Making · Rough Sets and Fuzzy Logic · Data Management and Algorithms
