Learning to Unscramble: Simplifying Symbolic Expressions via Self-Supervised Oracle Trajectories
David Shih

TL;DR
This paper introduces a self-supervised learning method using oracle trajectories and transformer networks to effectively simplify complex symbolic mathematical expressions, outperforming prior approaches.
Contribution
The authors develop a novel self-supervised approach with oracle trajectories and transformer policies for symbolic expression simplification, applied successfully to physics problems.
Findings
Near-perfect solve rates on high-energy physics problems.
Outperforms reinforcement learning and regression methods.
Achieves 100% full simplification on complex amplitudes.
Abstract
We present a new self-supervised machine learning approach for symbolic simplification of complex mathematical expressions. Training data is generated by scrambling simple expressions and recording the inverse operations, creating oracle trajectories that provide both goal states and explicit paths to reach them. A permutation-equivariant, transformer-based policy network is then trained on this data step-wise to predict the oracle action given the input expression. We demonstrate this approach on two problems in high-energy physics: dilogarithm reduction and spinor-helicity scattering amplitude simplification. In both cases, our trained policy network achieves near perfect solve rates across a wide range of difficulty levels, substantially outperforming prior approaches based on reinforcement learning and end-to-end regression. When combined with contrastive grouping and beam search,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
