Active Learning of Custering with Side Information Using $\eps$-Smooth   Relative Regret Approximations

Nir Ailon; Ron Begleiter

arXiv:1201.6462·cs.LG·February 1, 2012·1 cites

Active Learning of Custering with Side Information Using $\eps$-Smooth Relative Regret Approximations

Nir Ailon, Ron Begleiter

PDF

Open Access

TL;DR

This paper introduces an active learning approach for semi-supervised clustering that adaptively biases pair selection based on side information, leading to faster convergence to optimal clustering with fewer queries.

Contribution

It proposes a novel iterative method using $ ext{ extsterling}$-smooth relative regret approximations to improve pair selection bias and clustering quality simultaneously.

Findings

01

Method outperforms uniform pair selection in convergence speed.

02

Biasing pair selection reduces query costs.

03

Algorithm converges faster to the optimal clustering solution.

Abstract

Clustering is considered a non-supervised learning setting, in which the goal is to partition a collection of data points into disjoint clusters. Often a bound $k$ on the number of clusters is given or assumed by the practitioner. Many versions of this problem have been defined, most notably $k$ -means and $k$ -median. An underlying problem with the unsupervised nature of clustering it that of determining a similarity function. One approach for alleviating this difficulty is known as clustering with side information, alternatively, semi-supervised clustering. Here, the practitioner incorporates side information in the form of "must be clustered" or "must be separated" labels for data point pairs. Each such piece of information comes at a "query cost" (often involving human response solicitation). The collection of labels is then incorporated in the usual clustering algorithm as either…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Advanced Clustering Algorithms Research · Face and Expression Recognition