Outliers Detection Is Not So Hard: Approximation Algorithms for Robust Clustering Problems Using Local Search Techniques
Yishui Wang, Rolf H. M\"ohring, Chenchen Wu, Dachuan Xu, Dongmei Zhang

TL;DR
This paper introduces new approximation algorithms for robust clustering problems like k-median and k-means with outliers, significantly improving their approximation ratios using local search techniques.
Contribution
It develops a novel analysis method for local search algorithms, achieving the best known approximation ratios for various robust clustering models.
Findings
Improved approximation ratio for k-Means with penalties from 25+ε to 9+ε.
Best known approximation ratio for k-Median with penalties.
Enhanced bi-criteria approximation algorithms for outlier models with tighter bounds.
Abstract
In this paper, we consider two types of robust models of the -median/-means problems: the outlier-version (-MedO/-MeaO) and the penalty-version (-MedP/-MeaP), in which we can mark some points as outliers and discard them. In -MedO/-MeaO, the number of outliers is bounded by a given integer. In -MedP/-MeaP, we do not bound the number of outliers, but each outlier will incur a penalty cost. We develop a new technique to analyze the approximation ratio of local search algorithms for these two problems by introducing an adapted cluster that can capture useful information about outliers in the local and the global optimal solution. For -MeaP, we improve the best known approximation ratio based on local search from to . For -MedP, we obtain the best known approximation ratio. For -MedO/-MeaO, there exists only two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFacility Location and Emergency Management · Multi-Criteria Decision Making · Advanced Statistical Methods and Models
