Benchmarking symbolic regression constant optimization schemes
L.G.A dos Reis, V.L.P.S. Caminha, T.J.P.Penna

TL;DR
This paper evaluates eight constant optimization methods in symbolic regression using genetic programming across ten benchmarks, introduces Tree Edit Distance as a new metric, and analyzes their performance and biases.
Contribution
It provides a comparative analysis of constant optimization schemes, introduces Tree Edit Distance for symbolic accuracy, and discusses metric biases in symbolic regression.
Findings
Different methods excel in different scenarios.
No single method is best for all problems.
Common metrics may bias model evaluation.
Abstract
Symbolic regression is a machine learning technique, and it has seen many advancements in recent years, especially in genetic programming approaches (GPSR). Furthermore, it has been known for many years that constant optimization of parameters, during the evolutionary search, greatly increases GPSR performance However, different authors approach such tasks differently and no consensus exists regarding which methods perform best. In this work, we evaluate eight different parameter optimization methods, applied during evolutionary search, over ten known benchmark problems, in two different scenarios. We also propose using an under-explored metric called Tree Edit Distance (TED), aiming to identify symbolic accuracy. In conjunction with classical error measures, we develop a combined analysis of model performance in symbolic regression. We then show that different constant optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
