Approximation Schemes for Clustering with Outliers
Zachary Friggstad, Kamyar Khodamoradi, Mohsen Rezapour, Mohammad R., Salavatipour

TL;DR
This paper develops approximation algorithms for clustering problems with outliers, focusing on UFL, k-Median, and k-Means, especially in doubling and minor-closed graph metrics, providing PTAS and bicriteria solutions.
Contribution
It introduces PTAS for uniform-cost UFL with outliers and bicriteria approximations for k-Median and k-Means in special metrics, extending the theoretical understanding of clustering with outliers.
Findings
PTAS for uniform-cost UFL with outliers on doubling metrics
Bicriteria approximation for k-Median and k-Means with outliers in special metrics
Local search heuristics can have unbounded gap in Euclidean metrics
Abstract
Clustering problems are well-studied in a variety of fields such as data science, operations research, and computer science. Such problems include variants of centre location problems, -median, and -means to name a few. In some cases, not all data points need to be clustered; some may be discarded for various reasons. We study clustering problems with outliers. More specifically, we look at Uncapacitated Facility Location (UFL), -Median, and -Means. In UFL with outliers, we have to open some centres, discard up to points of and assign every other point to the nearest open centre, minimizing the total assignment cost plus centre opening costs. In -Median and -Means, we have to open up to centres but there are no opening costs. In -Means, the cost of assigning to is . We present several results. Our main focus is on cases…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFacility Location and Emergency Management · Vehicle Routing Optimization Methods · Optimization and Search Problems
