Projection predictive variable selection for discrete response families with finite support
Frank Weber, \"Anne Glass, Aki Vehtari

TL;DR
This paper introduces an exact projection method for Bayesian variable selection in models with discrete, finite support responses, improving accuracy at the cost of increased computation, and compares it to approximate methods.
Contribution
It presents the augmented-data projection, an exact solution for all discrete finite support response families, extending previous approximate latent-space approaches.
Findings
The augmented-data projection outperforms or matches the approximate method in simulations.
It is more computationally intensive, suitable for final model selection.
The method applies to both ordinal and nominal response families.
Abstract
The projection predictive variable selection is a decision-theoretically justified Bayesian variable selection approach achieving an outstanding trade-off between predictive performance and sparsity. Its projection problem is not easy to solve in general because it is based on the Kullback-Leibler divergence from a restricted posterior predictive distribution of the so-called reference model to the parameter-conditional predictive distribution of a candidate model. Previous work showed how this projection problem can be solved for response families employed in generalized linear models and how an approximate latent-space approach can be used for many other response families. Here, we present an exact projection method for all response families with discrete and finite support, called the augmented-data projection. A simulation study for an ordinal response family shows that the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock · Gene expression and cancer classification · Statistical Methods and Inference
