If it ain't broke, don't fix it: Sparse metric repair
Anna C. Gilbert, Lalit Jain

TL;DR
This paper introduces three combinatorial algorithms for minimally adjusting noisy distance data to satisfy metric properties, ensuring data integrity with minimal alterations and improved computational efficiency.
Contribution
It presents novel algorithms for sparse metric repair that guarantee minimal changes and efficient computation, advancing data correction methods in metric-based applications.
Findings
Algorithms guarantee the sparsest solution in one setting.
All algorithms effectively repair the metric with minimal data alterations.
Running time is proportional to the cube of data points, reducible with prior info.
Abstract
Many modern data-intensive computational problems either require, or benefit from distance or similarity data that adhere to a metric. The algorithms run faster or have better performance guarantees. Unfortunately, in real applications, the data are messy and values are noisy. The distances between the data points are far from satisfying a metric. Indeed, there are a number of different algorithms for finding the closest set of distances to the given ones that also satisfy a metric (sometimes with the extra condition of being Euclidean). These algorithms can have unintended consequences, they can change a large number of the original data points, and alter many other features of the data. The goal of sparse metric repair is to make as few changes as possible to the original data set or underlying distances so as to ensure the resulting distances satisfy the properties of a metric. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
