Expectation-Maximization for Learning Determinantal Point Processes
Jennifer Gillenwater, Alex Kulesza, Emily Fox, Ben Taskar

TL;DR
This paper introduces an expectation-maximization algorithm for learning the full kernel matrix of a determinantal point process, enabling better modeling of set diversity and improving likelihood on real-world data.
Contribution
It presents a novel EM-based method for full kernel matrix learning in DPPs, overcoming previous restrictions and non-convexity issues.
Findings
Achieved up to 16.5% improvement in test log-likelihood.
Demonstrated effectiveness on a product recommendation dataset.
Provided a new optimization approach for DPP kernel learning.
Abstract
A determinantal point process (DPP) is a probabilistic model of set diversity compactly parameterized by a positive semi-definite kernel matrix. To fit a DPP to a given task, we would like to learn the entries of its kernel matrix by maximizing the log-likelihood of the available data. However, log-likelihood is non-convex in the entries of the kernel matrix, and this learning problem is conjectured to be NP-hard. Thus, previous work has instead focused on more restricted convex learning settings: learning only a single weight for each row of the kernel matrix, or learning weights for a linear combination of DPPs with fixed kernel matrices. In this work we propose a novel algorithm for learning the full kernel matrix. By changing the kernel parameterization from matrix entries to eigenvalues and eigenvectors, and then lower-bounding the likelihood in the manner of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities · Markov Chains and Monte Carlo Methods · Complexity and Algorithms in Graphs
