Additive Atomic Forests for Symbolic Function and Antiderivative Discovery
Reda Belaiche

TL;DR
This paper introduces a novel symbolic framework that simultaneously recovers functions and their antiderivatives from data using additive atomic forests, enabling interpretable models that outperform some machine learning methods.
Contribution
The framework combines a derivative algebra, primitive function seeds, and additive atomic forests to automatically discover symbolic functions and their derivatives from data.
Findings
Sparse atom combinations match or exceed XGBoost on 13 out of 17 datasets.
The method produces interpretable symbolic formulas.
The framework is theoretically complete and self-constructing.
Abstract
We present a framework for the simultaneous symbolic recovery of a function and its antiderivative from data. The framework rests on three ideas. First, a derivative algebra: the observation that the product rule and the chain rule, applied to a seed set of elementary functions, generate a self-expanding system of function-derivative pairs -- a living library that grows each time a new function is discovered. Second, two complementary primitives -- EML, which is theoretically complete for all elementary functions, and SOL, introduced here, which makes trigonometric atoms available at depth~1 instead of depth~8 -- that seed the library with core atoms cheaply. Third, additive atomic forests: finite sums of primitive trees, optionally composed via multiplicative nodes, whose derivatives are fitted to data by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
