Nested-GPT for variable-multiplicity parton showers: A case study in the resummation of non-global logarithms
Wanchen Li, Ding Yu Shao, Hao-Zhe Shi, Yu-Xuan Sun

TL;DR
Nested-GPT is a hierarchical Transformer architecture designed to simulate variable-multiplicity parton-shower histories, effectively modeling complex emission processes in high-energy physics.
Contribution
It introduces Nested-GPT, a novel autoregressive Transformer framework that enforces physical constraints and dynamically predicts emissions, advancing surrogate modeling of parton showers.
Findings
Nested-GPT accurately reproduces reference shower observables within uncertainties.
It outperforms flow-matching baseline in modeling emission sequences.
The approach is validated for leading-logarithmic resummation of non-global logarithms.
Abstract
We introduce Nested-GPT, a hierarchical autoregressive Transformer architecture for simulating the variable-multiplicity parton-shower histories. As a controlled benchmark, we study the leading-logarithmic resummation of non-global logarithms in the large- limit, utilizing a stochastic Monte Carlo dipole shower to generate reference training data. We systematically evaluate Nested-GPT against a Transformer flow-matching baseline. The flow-matching framework successfully parameterizes the joint distribution of emission kinematics at fixed multiplicity. Its phase-space representation, however, requires the final number of emissions to be specified externally rather than generated dynamically. Conversely, Nested-GPT strictly enforces the ordered Markovian branching structure, predicting emissions sequentially and dynamically evaluating a learned sequence-termination condition. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
