On the Generalization Bounds of Symbolic Regression with Genetic Programming

Masahiro Nomura; Ryoki Hamano; Isao Ono

arXiv:2604.17402·cs.LG·April 21, 2026

On the Generalization Bounds of Symbolic Regression with Genetic Programming

Masahiro Nomura, Ryoki Hamano, Isao Ono

PDF

TL;DR

This paper provides a theoretical analysis of symbolic regression with genetic programming, deriving generalization bounds that explain how structural constraints and stability mechanisms influence model performance.

Contribution

It introduces a learning-theoretic generalization bound for GP-based SR, linking practical design choices to complexity measures and explaining empirical behaviors.

Findings

01

Structural restrictions reduce hypothesis class complexity.

02

Stability mechanisms control prediction sensitivity.

03

Theoretical bounds explain practices like parsimony pressure and depth limits.

Abstract

Symbolic regression (SR) with genetic programming (GP) aims to discover interpretable mathematical expressions directly from data. Despite its strong empirical success, the theoretical understanding of why GP-based SR generalizes beyond the training data remains limited. In this work, we provide a learning-theoretic analysis of SR models represented as expression trees. We derive a generalization bound for GP-style SR under constraints on tree size, depth, and learnable constants. Our result decomposes the generalization gap into two interpretable components: a structure-selection term, reflecting the combinatorial complexity of choosing an expression-tree structure, and a constant-fitting term, capturing the complexity of optimizing numerical constants within a fixed structure. This decomposition provides a theoretical perspective on several widely used practices in GP, including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.