Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression
Paul Saegert, Ullrich K\"othe

TL;DR
This paper introduces SimpliPy, a rule-based simplification engine that significantly accelerates amortized neural symbolic regression, enabling it to scale to complex scientific problems and outperform existing methods in accuracy and expression conciseness.
Contribution
We develop SimpliPy, a fast simplification engine that enhances amortized SR scalability and efficiency, and demonstrate its effectiveness within the Flash-ANSR framework on benchmark datasets.
Findings
SimpliPy achieves a 100-fold speed-up over SymPy.
Flash-ANSR outperforms amortized baselines in accuracy.
Flash-ANSR matches state-of-the-art direct optimization results.
Abstract
Symbolic regression (SR) aims to discover interpretable analytical expressions that accurately describe observed data. Amortized SR promises to be much more efficient than the predominant genetic programming SR methods, but currently struggles to scale to realistic scientific complexity. We find that a key obstacle is the lack of a fast reduction of equivalent expressions to a concise normalized form. Amortized SR has addressed this by general-purpose Computer Algebra Systems (CAS) like SymPy, but the high computational cost severely limits training and inference speed. We propose SimpliPy, a rule-based simplification engine achieving a 100-fold speed-up over SymPy at comparable quality. This enables substantial improvements in amortized SR, including scalability to much larger training sets, more efficient use of the per-expression token budget, and systematic training set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Machine Learning and Data Classification · Machine Learning in Materials Science
