Crowdsourced correlation clustering with relative distance comparisons

Antti Ukkonen

arXiv:1709.08459·cs.DS·September 26, 2017

Crowdsourced correlation clustering with relative distance comparisons

Antti Ukkonen

PDF

TL;DR

This paper introduces a new correlation clustering method based on relative distance comparisons, enabling direct clustering from human-provided comparisons without intermediate steps, and demonstrates its effectiveness through experiments.

Contribution

It defines a novel correlation clustering variant for relative distance data and provides approximation and practical algorithms with empirical validation.

Findings

01

Our method outperforms existing complex methods on synthetic data.

02

It efficiently produces intuitive clusterings from real human comparison data.

03

The approach is suitable for crowdsourced clustering tasks.

Abstract

Crowdsourced, or human computation based clustering algorithms usually rely on relative distance comparisons, as these are easier to elicit from human workers than absolute distance information. A relative distance comparison is a statement of the form "item A is closer to item B than to item C". However, many existing clustering algorithms that use relative distances are rather complex. They are often based on a two-step approach, where the relative distances are first used to learn either a distance matrix, or an embedding of the items, and then some standard clustering method is applied in a second step. In this paper we argue that it should be possible to compute a clustering directly from relative distance comparisons. Our ideas are built upon existing work on correlation clustering, a well-known non-parametric approach to clustering. The technical contribution of this work is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.