Selecting Diverse Models for Scientific Insight

Laura J. Wendelberger; Brian J. Reich; Alyson G. Wilson

arXiv:2006.09157·stat.ME·December 17, 2021

Selecting Diverse Models for Scientific Insight

Laura J. Wendelberger, Brian J. Reich, Alyson G. Wilson

PDF

Open Access

TL;DR

This paper introduces a multi-model penalized regression approach to identify diverse models that reveal multiple explanatory patterns in data, addressing model uncertainty in scientific modeling.

Contribution

It proposes a novel method for selecting multiple diverse models simultaneously, explicitly limiting model similarity to uncover different underlying predictor patterns.

Findings

01

Successfully identified multiple models explaining SFE from steel composition.

02

Demonstrated the method's ability to promote model diversity and variable selection.

03

Showed potential for revealing varied scientific insights from data.

Abstract

Model selection often aims to choose a single model, assuming that the form of the model is correct. However, there may be multiple possible underlying explanatory patterns in a set of predictors that could explain a response. Model selection without regard for model uncertainty can fail to bring these patterns to light. We explore multi-model penalized regression (MMPR) to acknowledge model uncertainty in the context of penalized regression. We examine how different penalty settings can promote either shrinkage or sparsity of coefficients in separate models. The method is tuned to explicitly limit model similarity. A choice of penalty form that enforces variable selection is applied to predict stacking fault energy (SFE) from steel alloy composition. The aim is to identify multiple models with different subsets of covariates that explain a single type of response.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems