Transformer Semantic Genetic Programming for d-dimensional Symbolic Regression Problems
Philipp Anthes, Dominik Sobania, Franz Rothlauf

TL;DR
Transformer Semantic Genetic Programming (TSGP) introduces a semantic search method using a pre-trained transformer to generate diverse, high-semantic similarity offspring, outperforming traditional methods on symbolic regression tasks.
Contribution
The paper presents TSGP, a novel semantic search approach leveraging a pre-trained transformer as a variation operator for symbolic regression, demonstrating superior performance and solution compactness.
Findings
TSGP outperforms standard GP, SLIM_GSGP, Deep Symbolic Regression, and Denoising Autoencoder GP.
A single transformer trained on millions of programs generalizes across various symbolic regression problems.
Target semantic distance effectively balances exploration and exploitation, influencing convergence and solution size.
Abstract
Transformer Semantic Genetic Programming (TSGP) is a semantic search approach that uses a pre-trained transformer model as a variation operator to generate offspring programs with high semantic similarity to a given parent. Unlike other semantic GP approaches that rely on fixed syntactic transformations, TSGP aims to learn diverse structural variations that lead to solutions with similar semantics. We find that a single transformer model trained on millions of programs is able to generalize across symbolic regression problems of varying dimension. Evaluated on 24 real-world and synthetic datasets, TSGP significantly outperforms standard GP, SLIM_GSGP, Deep Symbolic Regression, and Denoising Autoencoder GP, achieving an average rank of 1.58 across all benchmarks. Moreover, TSGP produces more compact solutions than SLIM_GSGP, despite its higher accuracy. In addition, the target semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
