Nested and outlier embeddings into trees
Shuchi Chawla, Kristin Sheridan

TL;DR
This paper introduces an efficient algorithm for embedding metric spaces into Hierarchically Separated Trees (HSTs) with outlier handling, achieving low distortion and few outliers, advancing metric embedding techniques.
Contribution
The paper presents a novel algorithm for probabilistic outlier embeddings into HSTs with controlled distortion and outlier count, combining nested embedding methods with existing approximation algorithms.
Findings
Efficient algorithm for outlier embeddings with low distortion.
Bound on outliers proportional to (k/psilon) log^2k.
Achieves probabilistic embeddings with at most O(k/psilon) outliers.
Abstract
In this paper, we consider outlier embeddings into HSTs. In particular, for metric , let be the size of the smallest subset of such that all but that subset (the ``outlier set'') can be probabilistically embedded into the space of HSTs with expected distortion at most . Our primary result is showing that there exists an efficient algorithm that takes in and a target distortion and samples from a probabilistic embedding with at most outliers and distortion at most , for any . In order to facilitate our results, we show how to find good nested embeddings into HSTs and combine this with an approximation algorithm of Munagala et al. [MST23] to obtain our results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Computational Geometry and Mesh Generation · Topological and Geometric Data Analysis
