Dissimilarity-based Sparse Subset Selection
Ehsan Elhamifar, Guillermo Sapiro, S. Shankar Sastry

TL;DR
This paper introduces a convex optimization framework for selecting representative subsets from dissimilarity data, enabling efficient clustering, outlier handling, and applications in vision and time-series analysis.
Contribution
It proposes a novel dissimilarity-based sparse subset selection method with a convex relaxation and ADMM implementation, improving clustering and representation tasks.
Findings
Effectively identifies representatives from multiple groups.
Handles outliers robustly in dissimilarity data.
Outperforms state-of-the-art in scene categorization and time-series segmentation.
Abstract
Finding an informative subset of a large collection of data points or models is at the center of many problems in computer vision, recommender systems, bio/health informatics as well as image and natural language processing. Given pairwise dissimilarities between the elements of a `source set' and a `target set,' we consider the problem of finding a subset of the source set, called representatives or exemplars, that can efficiently describe the target set. We formulate the problem as a row-sparsity regularized trace minimization problem. Since the proposed formulation is, in general, NP-hard, we consider a convex relaxation. The solution of our optimization finds representatives and the assignment of each element of the target set to each representative, hence, obtaining a clustering. We analyze the solution of our proposed optimization as a function of the regularization parameter. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAlternating Direction Method of Multipliers
