Decomposable Neuro Symbolic Regression

Giorgio Morales; John W. Sheppard

arXiv:2511.04124·cs.LG·March 31, 2026

Decomposable Neuro Symbolic Regression

Giorgio Morales, John W. Sheppard

PDF

TL;DR

This paper introduces a decomposable neuro-symbolic regression method that combines transformers, genetic algorithms, and genetic programming to produce interpretable, accurate multivariate symbolic models from opaque regressors.

Contribution

It presents a novel explainable SR approach that distills complex models into structured mathematical expressions, improving interpretability and structure recovery.

Findings

01

Lower or comparable errors to existing methods on noisy data.

02

Consistently recovers original mathematical structures.

03

Achieves high symbolic solution recovery rate on Feynman dataset.

Abstract

Symbolic regression (SR) models complex systems by discovering mathematical expressions that capture underlying relationships in observed data. However, most SR methods prioritize minimizing prediction error over identifying the governing equations, often producing overly complex or inaccurate expressions. To address this, we present a decomposable SR method that generates interpretable multivariate expressions leveraging transformer models, genetic algorithms (GAs), and genetic programming (GP). In particular, our explainable SR method distills a trained ``opaque'' regression model into mathematical expressions that serve as explanations of its computed function. Our method employs a Multi-Set Transformer to generate multiple univariate symbolic skeletons that characterize how each variable influences the opaque model's response. We then evaluate the generated skeletons' performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.