A Prediction Divergence Criterion for Model Selection
Stephane Guerrier, Maria-Pia Victoria-Feser

TL;DR
This paper introduces the Prediction Divergence Criterion (PDC), a new model selection method based on divergence measures, specifically designed for linear regression, with advantages in avoiding overfitting and handling correlated variables.
Contribution
The paper proposes a novel model selection criterion, PDC, derived from a new class of error measures, extending existing criteria and demonstrating improved performance in sparse, correlated settings.
Findings
PDC is asymptotically loss efficient and consistent.
PDC reduces overfitting compared to Mallow's Cp.
PDC performs well in sparse, correlated variable scenarios.
Abstract
The problem of model selection is inevitable in an increasingly large number of applications involving partial theoretical knowledge and vast amounts of information, like in medicine, biology or economics. The associated techniques are intended to determine which variables are "important" to "explain a phenomenon under investigation. The terms "important" and "explain" can have very different meanings according to the context and, in fact, model selection can be applied to any situation where one tries to balance variability with complexity. In this paper, we introduce a new class of error measures and of model selection criteria, to which many well know selection criteria belong. Moreover, this class enables us to derive a novel criterion, based on a divergence measure between the predictions produced by two nested models, called the Prediction Divergence Criterion (PDC). Our selection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference
