Prior-Guided Symbolic Regression: Towards Scientific Consistency in Equation Discovery
Jing Xiao, Xinhai Chen, Jiaming Peng, Qinglin Wang, Menghan Jia, Zhiquan Lai, Guangping Yu, Dongsheng Li, Tiejun Li, Jie Liu

TL;DR
This paper introduces PG-SR, a prior-guided symbolic regression framework that incorporates domain priors and a novel evaluation mechanism to discover scientifically consistent equations, improving interpretability and reliability.
Contribution
We propose PG-SR, a three-stage symbolic regression method that explicitly encodes domain priors and uses a prior annealing evaluation to ensure scientific consistency.
Findings
PG-SR outperforms state-of-the-art methods across multiple domains.
PG-SR maintains robustness to noisy data and limited data scenarios.
Theoretically, PG-SR reduces hypothesis space complexity, providing generalization guarantees.
Abstract
Symbolic Regression (SR) aims to discover interpretable equations from observational data, with the potential to reveal underlying principles behind natural phenomena. However, existing approaches often fall into the Pseudo-Equation Trap: producing equations that fit observations well but remain inconsistent with fundamental scientific principles. A key reason is that these approaches are dominated by empirical risk minimization, lacking explicit constraints to ensure scientific consistency. To bridge this gap, we propose PG-SR, a prior-guided SR framework built upon a three-stage pipeline consisting of warm-up, evolution, and refinement. Throughout the pipeline, PG-SR introduces a prior constraint checker that explicitly encodes domain priors as executable constraint programs, and employs a Prior Annealing Constrained Evaluation (PACE) mechanism during the evolution stage to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Evolutionary Algorithms and Applications · Model Reduction and Neural Networks
