A simple data-driven method to optimise the penalty strengths of penalised models and its application to non-parametric smoothing
Jens Thomas, Mathias Lipka

TL;DR
This paper introduces a generalized information criterion, AICp, for optimizing penalty strengths in penalized models, especially useful for non-parametric smoothing in astrophysics, eliminating the need for Monte Carlo simulations.
Contribution
It develops a flexible, model-agnostic criterion AICp for penalty optimization, extending traditional AIC to continuum model selection and non-parametric contexts.
Findings
AICp enables penalty optimization without Monte Carlo simulations.
The method applies to linear, non-linear, parametric, and non-parametric models.
Demonstrated in astrophysical non-parametric smoothing applications.
Abstract
Information of interest can often only be extracted from data by model fitting. When the functional form of such a model can not be deduced from first principles, one has to make a choice between different possible models. A common approach in such cases is to minimise the information loss in the model by trying to reduce the number of fit variables (or the model flexibility, respectively) as much as possible while still yielding an acceptable fit to the data. Model selection via the Akaike Information Criterion (AIC) provides such an implementation of Occam's razor. We argue that the same principles can be applied to optimise the penalty-strength of a penalised maximum-likelihood model. However, while in typical applications AIC is used to choose from a finite, discrete set of maximum-likelihood models the penalty optimisation requires to select out of a continuum of candidate models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical and numerical algorithms · Gaussian Processes and Bayesian Inference · Statistical Methods and Inference
