Fuzzy Clustering with Similarity Queries
Wasim Huleihel, Arya Mazumdar, Soumyabrata Pal

TL;DR
This paper introduces a semi-supervised clustering framework that uses similarity queries to efficiently approximate fuzzy $k$-means clustering, making the problem computationally feasible and effective on real datasets.
Contribution
It proposes a novel active clustering algorithm that leverages similarity queries to solve fuzzy $k$-means efficiently, addressing nonconvexity and local minima issues.
Findings
Algorithms ask $O( ext{poly}(k) ext{log} n)$ similarity queries.
The approach achieves polynomial-time complexity.
Effective on real-world datasets.
Abstract
The fuzzy or soft -means objective is a popular generalization of the well-known -means problem, extending the clustering capability of the -means to datasets that are uncertain, vague, and otherwise hard to cluster. In this paper, we propose a semi-supervised active clustering framework, where the learner is allowed to interact with an oracle (domain expert), asking for the similarity between a certain set of chosen items. We study the query and computational complexities of clustering in this framework. We prove that having a few of such similarity queries enables one to get a polynomial-time approximation algorithm to an otherwise conjecturally NP-hard problem. In particular, we provide algorithms for fuzzy clustering in this setting that asks similarity queries and run with polynomial-time-complexity, where is the number of items. The fuzzy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Data Management and Algorithms · Face and Expression Recognition
