RD-DPP: Rate-Distortion Theory Meets Determinantal Point Process to Diversify Learning Data Samples
Xiwen Chen, Huayu Li, Rahul Amin, Abolfazl Razi

TL;DR
This paper introduces RD-DPP, a novel method combining Rate-Distortion theory and Determinantal Point Processes to improve diverse sample selection in data-limited learning scenarios, overcoming DPP limitations.
Contribution
The paper establishes a fundamental link between DPP and RD theory, proposing a bi-modal sampling method that enhances diversity selection beyond DPP's traditional constraints.
Findings
RD-DPP outperforms existing methods across multiple datasets and models.
The approach effectively addresses DPP's rank restriction and task customization issues.
The method demonstrates high generalizability to various learning tasks.
Abstract
In some practical learning tasks, such as traffic video analysis, the number of available training samples is restricted by different factors, such as limited communication bandwidth and computation power. Determinantal Point Process (DPP) is a common method for selecting the most diverse samples to enhance learning quality. However, the number of selected samples is restricted to the rank of the kernel matrix implied by the dimensionality of data samples. Secondly, it is not easily customizable to different learning tasks. In this paper, we propose a new way of measuring task-oriented diversity based on the Rate-Distortion (RD) theory, appropriate for multi-level classification. To this end, we establish a fundamental relationship between DPP and RD theory. We observe that the upper bound of the diversity of data selected by DPP has a universal trend of ,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Point processes and geometric inequalities · Sparse and Compressive Sensing Techniques
