SymbolFit: Automatic Parametric Modeling with Symbolic Regression
Ho Fung Tsoi, Dylan Rankin, Cecile Caillol, Miles Cranmer, Sridhara Dasu, Javier Duarte, Philip Harris, Elliot Lipeles, Vladimir Loncar

TL;DR
SymbolFit is an automated framework that uses symbolic regression to efficiently generate parametric models for complex data distributions, providing uncertainty estimates without predefined functional forms, demonstrated on high-energy physics datasets.
Contribution
We develop SymbolFit, a novel symbolic regression-based framework that automates parametric modeling and uncertainty estimation for data without known functional forms, applicable across diverse distributions.
Findings
Successfully modeled complex distributions in LHC data
Reduced manual effort in functional form selection
Achieved comparable fits with minimal configuration
Abstract
We introduce SymbolFit, a framework that automates parametric modeling by using symbolic regression to perform a machine-search for functions that fit the data while simultaneously providing uncertainty estimates in a single run. Traditionally, constructing a parametric model to accurately describe binned data has been a manual and iterative process, requiring an adequate functional form to be determined before the fit can be performed. The main challenge arises when the appropriate functional forms cannot be derived from first principles, especially when there is no underlying true closed-form function for the distribution. In this work, we develop a framework that automates and streamlines the process by utilizing symbolic regression, a machine learning technique that explores a vast space of candidate functions without requiring a predefined functional form because the functional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Advanced Database Systems and Queries · Graph Theory and Algorithms
