FePySR: A Neural Feature Extraction Framework for Efficient and Scalable Symbolic Regression
Zhiming Yu, Wangtao Lu, Xin Lai

TL;DR
FePySR is a two-stage neural feature extraction framework that improves the efficiency and accuracy of symbolic regression, especially for complex equations and noisy data, outperforming existing methods.
Contribution
Introduces FePySR, a novel two-stage framework combining neural networks and structural optimization to enhance symbolic regression performance and scalability.
Findings
Outperforms state-of-the-art methods on benchmark datasets.
Recovers 36 complex equations out of 75 synthesized cases.
Maintains performance under noise and varying feature selection levels.
Abstract
A fundamental challenge in symbolic regression (SR) is efficiently recovering complex mathematical expressions from observational data. Although this problem is NP-hard, many expressions of practical interest decompose naturally into combinations of nonlinear feature modules, concentrating structural complexity into a small number of reusable components. Here, we introduce FePySR, a two-stage framework that reduces the SR search space by extracting valid features prior to equation search. FePySR first employs a heterogeneous neural network to constrain observational data to a set of candidate expressions, then performs structural optimization within this refined expression space using PySR. Across five standard benchmarks, FePySR outperforms state-of-the-art methods by achieving higher equation recovery rates. On a set of 75 highly complex synthesized equations, FePySR recovers 36…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
