Approximate Algorithms For $k$-Sparse Wasserstein Barycenter With Outliers
Qingyuan Yang, Hu Ding

TL;DR
This paper introduces novel approximation algorithms for the $k$-sparse Wasserstein barycenter problem with outliers, leveraging clustering and coreset techniques to handle noise and high-dimensional data effectively.
Contribution
The paper develops the first clustering-based LP method and coreset approach for $k$-sparse Wasserstein barycenters with outliers, providing constant and $(1+ta)$-approximation guarantees.
Findings
Clustering-based LP method achieves constant approximation.
Coreset technique yields $(1+ta)$-approximation in low dimensions.
Algorithms demonstrate practical efficiency in experiments.
Abstract
Wasserstein Barycenter (WB) is one of the most fundamental optimization problems in optimal transportation. Given a set of distributions, the goal of WB is to find a new distribution that minimizes the average Wasserstein distance to them. The problem becomes even harder if we restrict the solution to be ``-sparse''. In this paper, we study the -sparse WB problem in the presence of outliers, which is a more practical setting since real-world data often contains noise. Existing WB algorithms cannot be directly extended to handle the case with outliers, and thus it is urgently needed to develop some novel ideas. First, we investigate the relation between -sparse WB with outliers and the clustering (with outliers) problems. In particular, we propose a clustering based LP method that yields constant approximation factor for the -sparse WB with outliers problem. Further, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Image Processing Techniques · Commutative Algebra and Its Applications · Topological and Geometric Data Analysis
MethodsSparse Evolutionary Training
