Fitting Metrics and Ultrametrics with Minimum Disagreements
Vincent Cohen-Addad, Chenglin Fan, Euiwoong Lee, Arnaud de Mesmay

TL;DR
This paper introduces improved approximation algorithms for the metric violation distance and ultrametric violation distance problems, with significant theoretical advances and practical simplicity, extending to related clustering and maximization variants.
Contribution
It presents an $O( ext{log } n)$-approximation for MVD, a constant-factor approximation for UMVD, and a new correlation clustering approach with hierarchical refinement.
Findings
O(log n) approximation for MVD
Constant factor approximation for UMVD
Hardness results for maximization variants
Abstract
Given recording pairwise distances, the METRIC VIOLATION DISTANCE (MVD) problem asks to compute the distance between and the metric cone; i.e., modify the minimum number of entries of to make it a metric. Due to its large number of applications in various data analysis and optimization tasks, this problem has been actively studied recently. We present an -approximation algorithm for MVD, exponentially improving the previous best approximation ratio of of Fan et al. [ SODA, 2018]. Furthermore, a major strength of our algorithm is its simplicity and running time. We also study the related problem of ULTRAMETRIC VIOLATION DISTANCE (UMVD), where the goal is to compute the distance to the cone of ultrametrics, and achieve a constant factor approximation algorithm. The UMVD can be regarded as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Automated Road and Building Extraction
