Dissimilarity-based Sparse Subset Selection

Ehsan Elhamifar; Guillermo Sapiro; S. Shankar Sastry

arXiv:1407.6810·cs.LG·April 12, 2016

Dissimilarity-based Sparse Subset Selection

Ehsan Elhamifar, Guillermo Sapiro, S. Shankar Sastry

PDF

TL;DR

This paper introduces a convex optimization framework for selecting representative subsets from dissimilarity data, enabling efficient clustering, outlier handling, and applications in vision and time-series analysis.

Contribution

It proposes a novel dissimilarity-based sparse subset selection method with a convex relaxation and ADMM implementation, improving clustering and representation tasks.

Findings

01

Effectively identifies representatives from multiple groups.

02

Handles outliers robustly in dissimilarity data.

03

Outperforms state-of-the-art in scene categorization and time-series segmentation.

Abstract

Finding an informative subset of a large collection of data points or models is at the center of many problems in computer vision, recommender systems, bio/health informatics as well as image and natural language processing. Given pairwise dissimilarities between the elements of a `source set' and a `target set,' we consider the problem of finding a subset of the source set, called representatives or exemplars, that can efficiently describe the target set. We formulate the problem as a row-sparsity regularized trace minimization problem. Since the proposed formulation is, in general, NP-hard, we consider a convex relaxation. The solution of our optimization finds representatives and the assignment of each element of the target set to each representative, hence, obtaining a clustering. We analyze the solution of our proposed optimization as a function of the regularization parameter. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAlternating Direction Method of Multipliers