Bridging Information Criteria and Parameter Shrinkage for Model Selection
Kun Zhang, Heng Peng, Laiwan Chan, Aapo Hyvarinen

TL;DR
This paper proposes a novel method combining information criteria and parameter shrinkage techniques for efficient and accurate model selection, applicable to complex models like Gaussian mixtures.
Contribution
It introduces a simple approach that uses data-dependent penalties to approximate information criteria, enabling easier optimization and model selection for complex models.
Findings
The method approximates BIC-based selection with high accuracy.
It produces exactly the same model as BIC in some cases.
Applicable to complex models like Gaussian mixtures and factor analyzers.
Abstract
Model selection based on classical information criteria, such as BIC, is generally computationally demanding, but its properties are well studied. On the other hand, model selection based on parameter shrinkage by -type penalties is computationally efficient. In this paper we make an attempt to combine their strengths, and propose a simple approach that penalizes the likelihood with data-dependent penalties as in adaptive Lasso and exploits a fixed penalization parameter. Even for finite samples, its model selection results approximately coincide with those based on information criteria; in particular, we show that in some special cases, this approach and the corresponding information criterion produce exactly the same model. One can also consider this approach as a way to directly determine the penalization parameter in adaptive Lasso to achieve information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Control Systems and Identification
