GoodRegressor: A Hierarchical Inductive Bias for Navigating High-Dimensional Compositional Space

Seong-Hoon Jang

arXiv:2510.18325·cond-mat.mtrl-sci·March 30, 2026

GoodRegressor: A Hierarchical Inductive Bias for Navigating High-Dimensional Compositional Space

Seong-Hoon Jang

PDF

TL;DR

GoodRegressor is a hierarchical symbolic regression framework that balances interpretability and predictive accuracy in complex scientific datasets by controlling interaction depth.

Contribution

It introduces a depth-controlled symbolic regression method that effectively explores vast compositional spaces while maintaining interpretability and outperforming black-box models.

Findings

01

Predictive performance matches or exceeds state-of-the-art black-box models.

02

Interaction-depth evolution reveals system-dependent optimal complexity windows.

03

Hierarchical inductive bias with depth control enhances interpretability in high-dimensional spaces.

Abstract

Interpretable scientific machine learning often trades predictive performance for structural transparency. When physical targets arise from hierarchical and nonlinear descriptor entanglement, weakly interacting white-box models underfit, whereas highly expressive black-box models obscure physical insight. Here I introduce GoodRegressor, a hierarchical depth-controlled symbolic regression framework that systematically assembles nonlinear descriptor interactions through lexicographically-ordered expansion. Despite effective compositional search spaces approaching $\sim 1 0^{400}$ structures, disciplined depth control enables tractable and reproducible exploration under realistic computational constraints. Across oxygen-ion conductors, NASICONs, and superconducting oxides, as representative high-complexity testbeds, predictive performances match or exceed state-of-the-art black-box models,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.