GoodRegressor: A Hierarchical Inductive Bias for Navigating High-Dimensional Compositional Space
Seong-Hoon Jang

TL;DR
GoodRegressor is a hierarchical symbolic regression framework that balances interpretability and predictive accuracy in complex scientific datasets by controlling interaction depth.
Contribution
It introduces a depth-controlled symbolic regression method that effectively explores vast compositional spaces while maintaining interpretability and outperforming black-box models.
Findings
Predictive performance matches or exceeds state-of-the-art black-box models.
Interaction-depth evolution reveals system-dependent optimal complexity windows.
Hierarchical inductive bias with depth control enhances interpretability in high-dimensional spaces.
Abstract
Interpretable scientific machine learning often trades predictive performance for structural transparency. When physical targets arise from hierarchical and nonlinear descriptor entanglement, weakly interacting white-box models underfit, whereas highly expressive black-box models obscure physical insight. Here I introduce GoodRegressor, a hierarchical depth-controlled symbolic regression framework that systematically assembles nonlinear descriptor interactions through lexicographically-ordered expansion. Despite effective compositional search spaces approaching structures, disciplined depth control enables tractable and reproducible exploration under realistic computational constraints. Across oxygen-ion conductors, NASICONs, and superconducting oxides, as representative high-complexity testbeds, predictive performances match or exceed state-of-the-art black-box models,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
