Information criteria for structured parameter selection in high   dimensional tree and graph models

Maarten Jansen

arXiv:2306.14026·cs.LG·June 27, 2023

Information criteria for structured parameter selection in high dimensional tree and graph models

Maarten Jansen

PDF

Open Access

TL;DR

This paper develops refined information criteria, specifically corrected Mallows's Cp, for structured parameter selection in high-dimensional tree and graph models, aiming to better balance false positives and negatives without shrinkage.

Contribution

It introduces corrected Mallows's Cp criteria tailored for structured high-dimensional models, improving parameter selection accuracy in trees and graphical models.

Findings

01

Corrected criteria reduce false positive overestimation.

02

Enhanced selection accuracy in high-dimensional models.

03

Applicable to non-shrinkage estimators in structured models.

Abstract

Parameter selection in high-dimensional models is typically finetuned in a way that keeps the (relative) number of false positives under control. This is because otherwise the few true positives may be dominated by the many possible false positives. This happens, for instance, when the selection follows from a naive optimisation of an information criterion, such as AIC or Mallows's Cp. It can be argued that the overestimation of the selection comes from the optimisation process itself changing the statistics of the selected variables, in a way that the information criterion no longer reflects the true divergence between the selection and the data generating process. In lasso, the overestimation can also be linked to the shrinkage estimator, which makes the selection too tolerant of false positive selections. For these reasons, this paper works on refined information criteria, carefully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Statistical Methods and Inference