SPOT: A framework for selection of prototypes using optimal transport

Karthik S. Gurumoorthy; Pratik Jawanpuria; Bamdev Mishra

arXiv:2103.10159·cs.LG·April 6, 2021

SPOT: A framework for selection of prototypes using optimal transport

Karthik S. Gurumoorthy, Pratik Jawanpuria, Bamdev Mishra

PDF

TL;DR

This paper introduces SPOT, an optimal transport-based framework for selecting representative prototypes that efficiently summarize datasets, aiding interpretability and decision-making in machine learning applications.

Contribution

The paper presents a novel OT-based method for prototype selection that leverages submodularity and greedy algorithms for efficient and effective data summarization.

Findings

01

Outperforms existing methods on real-world benchmarks.

02

Provides a fast, deterministic approximation algorithm.

03

Effectively identifies prototypes that best represent target data.

Abstract

In this work, we develop an optimal transport (OT) based framework to select informative prototypical examples that best represent a given target dataset. Summarizing a given target dataset via representative examples is an important problem in several machine learning applications where human understanding of the learning models and underlying data distribution is essential for decision making. We model the prototype selection problem as learning a sparse (empirical) probability distribution having the minimum OT distance from the target distribution. The learned probability measure supported on the chosen prototypes directly corresponds to their importance in representing the target data. We show that our objective function enjoys a key property of submodularity and propose an efficient greedy method that is both computationally fast and possess deterministic approximation guarantees.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.