Learning to Unscramble: Simplifying Symbolic Expressions via Self-Supervised Oracle Trajectories

David Shih

arXiv:2603.11164·hep-th·April 14, 2026

Learning to Unscramble: Simplifying Symbolic Expressions via Self-Supervised Oracle Trajectories

David Shih

PDF

TL;DR

This paper introduces a self-supervised learning method using oracle trajectories and transformer networks to effectively simplify complex symbolic mathematical expressions, outperforming prior approaches.

Contribution

The authors develop a novel self-supervised approach with oracle trajectories and transformer policies for symbolic expression simplification, applied successfully to physics problems.

Findings

01

Near-perfect solve rates on high-energy physics problems.

02

Outperforms reinforcement learning and regression methods.

03

Achieves 100% full simplification on complex amplitudes.

Abstract

We present a new self-supervised machine learning approach for symbolic simplification of complex mathematical expressions. Training data is generated by scrambling simple expressions and recording the inverse operations, creating oracle trajectories that provide both goal states and explicit paths to reach them. A permutation-equivariant, transformer-based policy network is then trained on this data step-wise to predict the oracle action given the input expression. We demonstrate this approach on two problems in high-energy physics: dilogarithm reduction and spinor-helicity scattering amplitude simplification. In both cases, our trained policy network achieves near perfect solve rates across a wide range of difficulty levels, substantially outperforming prior approaches based on reinforcement learning and end-to-end regression. When combined with contrastive grouping and beam search,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.