A Note on Model Selection for Small Sample Regression
Masanori Kawakita, Jun'ichi Takeuchi

TL;DR
This paper analyzes the Direct Eigenvalue Estimator (DEE) for small sample regression, identifies a flaw in its derivation, and proposes an improved class of risk estimators (mDEE) that outperform DEE and ADJ in experiments.
Contribution
It reveals a flaw in DEE's derivation and introduces a new class of risk estimators (mDEE) that enhance performance in small sample regression.
Findings
mDEE often outperforms DEE and ADJ in experiments
DEE's derivation contains an inappropriate part affecting its potential
The new class of estimators improves risk estimation accuracy
Abstract
The risk estimator called "Direct Eigenvalue Estimator" (DEE) is studied. DEE was developed for small sample regression. In contrast to many existing model selection criteria, derivation of DEE requires neither any asymptotic assumption nor any prior knowledge about the noise variance and the noise distribution. It was reported that DEE performed well in small sample cases but DEE performed a little worse than the state-of-the-art ADJ. This seems somewhat counter-intuitive because DEE was developed for specifically regression problem by exploiting available information exhaustively, while ADJ was developed for general setting. In this paper, we point out that the derivation of DEE includes an inappropriate part in spite that the resultant form of DEE is valid in a sense. As its result, DEE cannot derive its potential. We introduce a class of valid risk estimators based on the idea of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Bayesian Methods and Mixture Models
