Current Challenges of Symbolic Regression: Optimization, Selection, Model Simplification, and Benchmarking
Guilherme Seidyo Imai Aldeia

TL;DR
This paper reviews key challenges in symbolic regression, introduces novel methods for parameter optimization, parent selection, and model simplification, and demonstrates improved performance on benchmarks.
Contribution
It presents new techniques for parameter tuning, parent selection, and model simplification, integrated into a multi-objective SR library that outperforms existing approaches.
Findings
Improved predictive accuracy with trade-offs in runtime and model size.
Enhanced parent selection using $oldsymbol{ ext{ extepsilon}}$-lexicase.
Simpler, more accurate models via memoization and hashing.
Abstract
Symbolic Regression (SR) is a regression method that aims to discover mathematical expressions that describe the relationship between variables, and it is often implemented through Genetic Programming, a metaphor for the process of biological evolution. Its appeal lies in combining predictive accuracy with interpretable models, but its promise is limited by several long-standing challenges: parameters are difficult to optimize, the selection of solutions can affect the search, and models often grow unnecessarily complex. In addition, current methods must be constantly re-evaluated to understand the SR landscape. This thesis addresses these challenges through a sequence of studies conducted throughout the doctorate, each focusing on an important aspect of the SR search process. First, I investigate parameter optimization, obtaining insights into its role in improving predictive accuracy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Metaheuristic Optimization Algorithms Research · Language and cultural evolution
