An information-based model selection criterion for data-driven model discovery
Michael C Chung, Alen Zacharia, Juan Guan

TL;DR
This paper introduces a new information-based criterion called SLIC for automatic model selection in data-driven discovery, improving accuracy and interpretability in complex systems.
Contribution
It proposes a novel sample-length-scaling logarithmic information criterion (SLIC) that automates model selection and outperforms existing criteria in identifying correct models.
Findings
SLIC outperforms other criteria in identifying correct models from data.
SLIC successfully discovers interpretable models in fluid dynamics and nanotechnology.
The method generates new testable predictions from experimental data.
Abstract
Data-driven model discovery (DDMD) algorithms are powerful tools for extracting interpretable symbolic models from data. However, identifying the model that best balances goodness-of-fit and sparsity is often a laborious process requiring user fine-tuning, is prone to overfitting, and results may significantly vary depending on model initialization and specific training procedure. Here, we present a sparse regression algorithm that automatically and adaptively generates candidate models, and uses a novel sample-length-scaling logarithmic information criterion (SLIC) to identify the best model from these candidates. We demonstrate that SLIC greatly outperforms other popular information criteria in extracting the correct model from the data of several nonlinear ordinary and partial differential equations. Then, we demonstrate SLIC's ability to discover interpretable models from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Machine Learning in Materials Science · Explainable Artificial Intelligence (XAI)
