LLM-Based Scientific Equation Discovery via Physics-Informed Token-Regularized Policy Optimization
Boxiao Wang, Kai Li, Tianyi Liu, Chen Li, Junzhe Wang, Yifan Zhang, Jian Cheng

TL;DR
This paper introduces PiT-PO, a reinforcement learning framework that fine-tunes LLMs to generate physically valid and concise scientific equations, improving discovery accuracy and efficiency in symbolic regression tasks.
Contribution
The work presents a novel physics-informed, token-regularized reinforcement learning method to adapt LLMs for more accurate and physically consistent equation discovery.
Findings
Achieves state-of-the-art results on benchmark symbolic regression tasks.
Successfully discovers new turbulence models in fluid dynamics.
Enables small models to outperform larger, closed-source models.
Abstract
Symbolic regression aims to distill mathematical equations from observational data. Recent approaches have successfully leveraged Large Language Models (LLMs) to generate equation hypotheses, capitalizing on their vast pre-trained scientific priors. However, existing frameworks predominantly treat the LLM as a static generator, relying on prompt-level guidance to steer exploration. This paradigm fails to update the model's internal representations based on search feedback, often yielding physically inconsistent or mathematically redundant expressions. In this work, we propose PiT-PO (Physics-informed Token-regularized Policy Optimization), a unified framework that evolves the LLM into an adaptive generator via reinforcement learning. Central to PiT-PO is a dual-constraint mechanism that rigorously enforces hierarchical physical validity while simultaneously applying fine-grained,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis
