Fuzzy Clustering with Similarity Queries

Wasim Huleihel; Arya Mazumdar; Soumyabrata Pal

arXiv:2106.02212·cs.LG·November 5, 2021·1 cites

Fuzzy Clustering with Similarity Queries

Wasim Huleihel, Arya Mazumdar, Soumyabrata Pal

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a semi-supervised clustering framework that uses similarity queries to efficiently approximate fuzzy $k$-means clustering, making the problem computationally feasible and effective on real datasets.

Contribution

It proposes a novel active clustering algorithm that leverages similarity queries to solve fuzzy $k$-means efficiently, addressing nonconvexity and local minima issues.

Findings

01

Algorithms ask $O( ext{poly}(k) ext{log} n)$ similarity queries.

02

The approach achieves polynomial-time complexity.

03

Effective on real-world datasets.

Abstract

The fuzzy or soft $k$ -means objective is a popular generalization of the well-known $k$ -means problem, extending the clustering capability of the $k$ -means to datasets that are uncertain, vague, and otherwise hard to cluster. In this paper, we propose a semi-supervised active clustering framework, where the learner is allowed to interact with an oracle (domain expert), asking for the similarity between a certain set of chosen items. We study the query and computational complexities of clustering in this framework. We prove that having a few of such similarity queries enables one to get a polynomial-time approximation algorithm to an otherwise conjecturally NP-hard problem. In particular, we provide algorithms for fuzzy clustering in this setting that asks $O (poly (k) lo g n)$ similarity queries and run with polynomial-time-complexity, where $n$ is the number of items. The fuzzy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

omadson/fuzzy-c-means
jaxOfficial

Videos

Fuzzy Clustering with Similarity Queries· slideslive

Taxonomy

TopicsAdvanced Clustering Algorithms Research · Data Management and Algorithms · Face and Expression Recognition