Fast determinantal point processes via distortion-free intermediate sampling
Micha{\l} Derezi\'nski

TL;DR
This paper introduces a fast, efficient algorithm for sampling from determinantal point processes with reduced preprocessing and sampling time, enabling scalable applications in machine learning and data analysis.
Contribution
The paper proposes a novel regularized DPP with a distortion-free intermediate distribution, achieving near-linear preprocessing and polynomial-time sampling independent of dataset size.
Findings
Preprocessing runs in near-linear time based on non-zero entries of the data matrix.
Sampling complexity is polynomial in the feature dimension, independent of dataset size.
The method maintains exact probabilities without distortion, ensuring accurate sampling.
Abstract
Given a fixed matrix , where , we study the complexity of sampling from a distribution over all subsets of rows where the probability of a subset is proportional to the squared volume of the parallelepiped spanned by the rows (a.k.a. a determinantal point process). In this task, it is important to minimize the preprocessing cost of the procedure (performed once) as well as the sampling cost (performed repeatedly). To that end, we propose a new determinantal point process algorithm which has the following two properties, both of which are novel: (1) a preprocessing step which runs in time , and (2) a sampling step which runs in time, independent of the number of rows . We achieve this by introducing a new regularized determinantal point process (R-DPP), which serves as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Point processes and geometric inequalities · Advanced Combinatorial Mathematics
