Information Criteria for Deciding between Normal Regression Models
Robert S. Maier

TL;DR
This paper discusses the use and limitations of the Akaike Information Criterion (AIC) and its small-sample correction AICc in model selection, emphasizing proper application and introducing significance testing for model comparison.
Contribution
It clarifies the correct usage of AIC and AICc in physical sciences, corrects common misapplications, and proposes a significance test for model comparison based on AIC differences.
Findings
AIC should be used when error variances are known.
AICc is only valid when error variances are unknown.
A significance test for AIC differences can replace Akaike weights.
Abstract
Regression models fitted to data can be assessed on their goodness of fit, though models with many parameters should be disfavored to prevent over-fitting. Statisticians' tools for this are little known to physical scientists. These include the Akaike Information Criterion (AIC), a penalized goodness-of-fit statistic, and the AICc, a variant including a small-sample correction. They entered the physical sciences through being used by astrophysicists to compare cosmological models; e.g., predictions of the distance-redshift relation. The AICc is shown to have been misapplied, being applicable only if error variances are unknown. If error bars accompany the data, the AIC should be used instead. Erroneous applications of the AICc are listed in an appendix. It is also shown how the variability of the AIC difference between models with a known error variance can be estimated. This yields a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Statistical and numerical algorithms · Advanced Statistical Methods and Models
