Nested-GPT for variable-multiplicity parton showers: A case study in the resummation of non-global logarithms

Wanchen Li; Ding Yu Shao; Hao-Zhe Shi; Yu-Xuan Sun

arXiv:2605.18360·hep-ph·May 21, 2026

Nested-GPT for variable-multiplicity parton showers: A case study in the resummation of non-global logarithms

Wanchen Li, Ding Yu Shao, Hao-Zhe Shi, Yu-Xuan Sun

PDF

TL;DR

Nested-GPT is a hierarchical Transformer architecture designed to simulate variable-multiplicity parton-shower histories, effectively modeling complex emission processes in high-energy physics.

Contribution

It introduces Nested-GPT, a novel autoregressive Transformer framework that enforces physical constraints and dynamically predicts emissions, advancing surrogate modeling of parton showers.

Findings

01

Nested-GPT accurately reproduces reference shower observables within uncertainties.

02

It outperforms flow-matching baseline in modeling emission sequences.

03

The approach is validated for leading-logarithmic resummation of non-global logarithms.

Abstract

We introduce Nested-GPT, a hierarchical autoregressive Transformer architecture for simulating the variable-multiplicity parton-shower histories. As a controlled benchmark, we study the leading-logarithmic resummation of non-global logarithms in the large- $N_{c}$ limit, utilizing a stochastic Monte Carlo dipole shower to generate reference training data. We systematically evaluate Nested-GPT against a Transformer flow-matching baseline. The flow-matching framework successfully parameterizes the joint distribution of emission kinematics at fixed multiplicity. Its phase-space representation, however, requires the final number of emissions to be specified externally rather than generated dynamically. Conversely, Nested-GPT strictly enforces the ordered Markovian branching structure, predicting emissions sequentially and dynamically evaluating a learned sequence-termination condition. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.