Approximately Optimal Subset Selection for Statistical Design and   Modelling

Yu Wang; Nhu D. Le; James V. Zidek

arXiv:1709.00151·stat.CO·July 12, 2019

Approximately Optimal Subset Selection for Statistical Design and Modelling

Yu Wang, Nhu D. Le, James V. Zidek

PDF

Open Access

TL;DR

This paper introduces an efficient polynomial-time algorithm based on Determinantal Point Processes for approximately solving the optimal subset selection problem, which maximizes the determinant of a positive definite matrix, with applications in various statistical domains.

Contribution

The paper develops a novel polynomial-time approximation algorithm for subset selection using DPPs, improving computational efficiency in statistical design and modeling.

Findings

01

Algorithm achieves near-optimal solutions efficiently

02

Demonstrated effectiveness on synthetic data

03

Validated on real-world datasets

Abstract

We study the problem of optimal subset selection from a set of correlated random variables. In particular, we consider the associated combinatorial optimization problem of maximizing the determinant of a symmetric positive definite matrix that characterizes the chosen subset. This problem arises in many domains, such as experimental designs, regression modeling, and environmental statistics. We establish an efficient polynomial-time algorithm using Determinantal Point Process for approximating the optimal solution to the problem. We demonstrate the advantages of our methods by presenting computational results for both synthetic and real data sets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPoint processes and geometric inequalities · Risk and Portfolio Optimization · Mathematical Approximation and Integration