Adaptive Hierarchical Clustering Using Ordinal Queries
Ehsan Emamjomeh-Zadeh, David Kempe

TL;DR
This paper presents an adaptive algorithm for hierarchical clustering using ordinal queries, which is robust to noise and significantly more efficient than non-adaptive methods, with theoretical guarantees on correctness and query complexity.
Contribution
It introduces a robust, adaptive algorithm for learning hierarchical clusterings from ordinal queries, improving efficiency and noise tolerance over prior non-adaptive approaches.
Findings
Deterministic algorithm uses at most n log n queries in the noise-free case.
The noisy algorithm achieves correct clustering with high probability using O(n log n + n log(1/δ)) queries.
Non-adaptive algorithms require at least Ω(n^3) queries, highlighting the importance of adaptivity.
Abstract
In many applications of clustering (for example, ontologies or clusterings of animal or plant species), hierarchical clusterings are more descriptive than a flat clustering. A hierarchical clustering over elements is represented by a rooted binary tree with leaves, each corresponding to one element. The subtrees rooted at interior nodes capture the clusters. In this paper, we study active learning of a hierarchical clustering using only ordinal queries. An ordinal query consists of a set of three elements, and the response to a query reveals the two elements (among the three elements in the query) which are "closer" to each other than to the third one. We say that elements and are closer to each other than if there exists a cluster containing and , but not . When all the query responses are correct, there is a deterministic algorithm that learns the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Advanced Database Systems and Queries
