Active Learning of Custering with Side Information Using $\eps$-Smooth Relative Regret Approximations
Nir Ailon, Ron Begleiter

TL;DR
This paper introduces an active learning approach for semi-supervised clustering that adaptively biases pair selection based on side information, leading to faster convergence to optimal clustering with fewer queries.
Contribution
It proposes a novel iterative method using $ ext{ extsterling}$-smooth relative regret approximations to improve pair selection bias and clustering quality simultaneously.
Findings
Method outperforms uniform pair selection in convergence speed.
Biasing pair selection reduces query costs.
Algorithm converges faster to the optimal clustering solution.
Abstract
Clustering is considered a non-supervised learning setting, in which the goal is to partition a collection of data points into disjoint clusters. Often a bound on the number of clusters is given or assumed by the practitioner. Many versions of this problem have been defined, most notably -means and -median. An underlying problem with the unsupervised nature of clustering it that of determining a similarity function. One approach for alleviating this difficulty is known as clustering with side information, alternatively, semi-supervised clustering. Here, the practitioner incorporates side information in the form of "must be clustered" or "must be separated" labels for data point pairs. Each such piece of information comes at a "query cost" (often involving human response solicitation). The collection of labels is then incorporated in the usual clustering algorithm as either…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Clustering Algorithms Research · Face and Expression Recognition
