Crowdsourced correlation clustering with relative distance comparisons
Antti Ukkonen

TL;DR
This paper introduces a new correlation clustering method based on relative distance comparisons, enabling direct clustering from human-provided comparisons without intermediate steps, and demonstrates its effectiveness through experiments.
Contribution
It defines a novel correlation clustering variant for relative distance data and provides approximation and practical algorithms with empirical validation.
Findings
Our method outperforms existing complex methods on synthetic data.
It efficiently produces intuitive clusterings from real human comparison data.
The approach is suitable for crowdsourced clustering tasks.
Abstract
Crowdsourced, or human computation based clustering algorithms usually rely on relative distance comparisons, as these are easier to elicit from human workers than absolute distance information. A relative distance comparison is a statement of the form "item A is closer to item B than to item C". However, many existing clustering algorithms that use relative distances are rather complex. They are often based on a two-step approach, where the relative distances are first used to learn either a distance matrix, or an embedding of the items, and then some standard clustering method is applied in a second step. In this paper we argue that it should be possible to compute a clustering directly from relative distance comparisons. Our ideas are built upon existing work on correlation clustering, a well-known non-parametric approach to clustering. The technical contribution of this work is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
