Clustering Permutations: New Techniques with Streaming Applications

Diptarka Chakraborty; Debarati Das; Robert Krauthgamer

arXiv:2212.01821·cs.DS·February 23, 2026

Clustering Permutations: New Techniques with Streaming Applications

Diptarka Chakraborty, Debarati Das, Robert Krauthgamer

PDF

1 Video

TL;DR

This paper introduces a new algorithmic framework for clustering permutations with a focus on the Ulam metric, achieving near-optimal approximation ratios and extending to streaming models and outlier handling.

Contribution

The paper presents a novel framework that improves approximation ratios for permutation clustering under the Ulam metric and extends these results to streaming and outlier scenarios.

Findings

01

Achieved a 1.999-approximation for the metric k-median problem under the Ulam metric.

02

Developed a streaming algorithm with polylogarithmic space complexity.

03

Extended results to handle outliers in permutation clustering.

Abstract

We study the classical metric $k$ -median clustering problem over a set of input rankings (i.e., permutations), which has myriad applications, from social-choice theory to web search and databases. A folklore algorithm provides a $2$ -approximate solution in polynomial time for all $k = O (1)$ , and works irrespective of the underlying distance measure, so long it is a metric; however, going below the $2$ -factor is a notorious challenge. We consider the Ulam distance, a variant of the well-known edit-distance metric, where strings are restricted to be permutations. For this metric, Chakraborty, Das, and Krauthgamer [SODA, 2021] provided a $(2 - δ)$ -approximation algorithm for $k = 1$ , where $δ \approx 2^{- 40}$ . Our primary contribution is a new algorithmic framework for clustering a set of permutations. Our first result is a $1.999$ -approximation algorithm for the metric $k$ -median…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Clustering Permutations: New Techniques with Streaming Applications· youtube