Fast determinantal point processes via distortion-free intermediate   sampling

Micha{\l} Derezi\'nski

arXiv:1811.03717·cs.LG·February 25, 2019·6 cites

Fast determinantal point processes via distortion-free intermediate sampling

Micha{\l} Derezi\'nski

PDF

Open Access

TL;DR

This paper introduces a fast, efficient algorithm for sampling from determinantal point processes with reduced preprocessing and sampling time, enabling scalable applications in machine learning and data analysis.

Contribution

The paper proposes a novel regularized DPP with a distortion-free intermediate distribution, achieving near-linear preprocessing and polynomial-time sampling independent of dataset size.

Findings

01

Preprocessing runs in near-linear time based on non-zero entries of the data matrix.

02

Sampling complexity is polynomial in the feature dimension, independent of dataset size.

03

The method maintains exact probabilities without distortion, ensuring accurate sampling.

Abstract

Given a fixed $n \times d$ matrix $X$ , where $n ≫ d$ , we study the complexity of sampling from a distribution over all subsets of rows where the probability of a subset is proportional to the squared volume of the parallelepiped spanned by the rows (a.k.a. a determinantal point process). In this task, it is important to minimize the preprocessing cost of the procedure (performed once) as well as the sampling cost (performed repeatedly). To that end, we propose a new determinantal point process algorithm which has the following two properties, both of which are novel: (1) a preprocessing step which runs in time $O (number-of-non-zeros (X) \cdot lo g n) + poly (d)$ , and (2) a sampling step which runs in $poly (d)$ time, independent of the number of rows $n$ . We achieve this by introducing a new regularized determinantal point process (R-DPP), which serves as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRandom Matrices and Applications · Point processes and geometric inequalities · Advanced Combinatorial Mathematics