PRISM: A Rich Class of Parameterized Submodular Information Measures for Guided Subset Selection
Suraj Kothawade, Vishal Kaushal, Ganesh Ramakrishnan, Jeff Bilmes,, Rishabh Iyer

TL;DR
PRISM introduces a versatile class of parameterized submodular information measures that enable guided subset selection, balancing diversity, representation, and similarity to meet specific data-driven objectives.
Contribution
It proposes a novel, flexible framework for guided subset selection using parameterized submodular functions, generalizing previous methods and demonstrating effectiveness in real-world tasks.
Findings
PRISM outperforms state-of-the-art methods in targeted learning tasks.
PRISM achieves superior results in guided image-collection summarization.
The framework offers a broad modeling capability for diverse subset selection applications.
Abstract
With ever-increasing dataset sizes, subset selection techniques are becoming increasingly important for a plethora of tasks. It is often necessary to guide the subset selection to achieve certain desiderata, which includes focusing or targeting certain data points, while avoiding others. Examples of such problems include: i)targeted learning, where the goal is to find subsets with rare classes or rare attributes on which the model is underperforming, and ii)guided summarization, where data (e.g., image collection, text, document or video) is summarized for quicker human consumption with specific additional user intent. Motivated by such applications, we present PRISM, a rich class of PaRameterIzed Submodular information Measures. Through novel functions and their parameterizations, PRISM offers a variety of modeling capabilities that enable a trade-off between desired qualities of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
