Optimizing the parton shower model in PYTHIA with pp collision data at $\sqrt{s}$ = 13 TeV
S. K. Kundu, T. Sarkar, M. Maity

TL;DR
This paper describes the optimization of the PYTHIA8 parton shower model using recent 13 TeV proton-proton collision data from the LHC, improving the simulation of quark and gluon evolution into jets.
Contribution
The paper introduces a new tuning of the PYTHIA8 parton shower model based on recent experimental data, enhancing its accuracy in simulating hadron collision events.
Findings
Optimized PYTHIA8 model better matches experimental data
Improved simulation of event shape variables
Enhanced prediction of jet cross-sections
Abstract
Production of quarks and gluons in hadron collisions tests Quantum Chromodynamics (QCD) over a wide range of energy. Models of QCD are implemented in event generators to simulate hadron collisions and evolution of quarks and gluons into jets of hadrons. PYTHIA8 uses the parton shower model for simulating particle collisions and is optimized using experimental observations. Recent measurements of event shape variables and jet cross-sections in pp collisions at = 13 TeV at the Large Hadron Collider have been used to optimize the parton shower model as used in PYTHIA8.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7| pythia8 | Monash | Sampling range | Optimized |
|---|---|---|---|
| Parameters set | values | values | |
| SpaceShower:alphaSvalue | 0.1365 | ||
| TimeShower:alphaSvalue | 0.1365 | ||
| SpaceShower:PTmaxFudge | 1.0 |
| range | ||||||||
|---|---|---|---|---|---|---|---|---|
| in GeV | Monash | This | Monash | This | Monash | This | Monash | This |
| tune | study | tune | study | tune | study | tune | study | |
| 73-93 | 8.49 | 16.40 | 38.11 | 16.86 | 69.84 | 22.12 | 12.32 | 20.19 |
| 93-165 | 3.90 | 9.96 | 19.44 | 7.31 | 23.79 | 6.49 | 4.16 | 7.82 |
| 165- 225 | 1.14 | 5.39 | 13.63 | 4.38 | 21.30 | 7.17 | 2.56 | 4.60 |
| 225-298 | 1.81 | 4.59 | 11.44 | 3.58 | 23.51 | 7.42 | 2.86 | 3.43 |
| 298-365 | 1.54 | 3.53 | 6.29 | 1.68 | 12.19 | 3.78 | 2.03 | 3.17 |
| 365-452 | 1.36 | 3.95 | 7.42 | 2.29 | 8.53 | 5.15 | 1.07 | 3.68 |
| 452-557 | 1.72 | 2.40 | 6.29 | 2.09 | 10.10 | 4.56 | 2.61 | 1.88 |
| 1.16 | 3.88 | 6.38 | 2.76 | 8.46 | 6.23 | 3.43 | 3.91 | |
| CMS(R) | CMS(R) | ATLAS(R) | ||||
| Rapidity | Monash | This | Monash | This | Monash | This |
| range | tune | study | tune | study | tune | study |
| 2.10 | 0.63 | 2.17 | 0.68 | 9.38 | 1.13 | |
| 1.73 | 0.41 | 2.11 | 0.50 | 8.20 | 0.96 | |
| 2.03 | 0.46 | 3.02 | 0.97 | 9.33 | 2.05 | |
| 0.89 | 0.25 | 1.30 | 0.40 | 6.92 | 1.56 | |
| 0.72 | 0.58 | 1.21 | 0.76 | 5.64 | 1.39 | |
| 0.36 | 0.31 | 0.27 | 0.15 | 3.75 | 1.23 | |
| 0.28 | 0.14 | 0.28 | 0.23 | - | - | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Optimizing the parton shower model in PYTHIA with pp collision data at
S. K. Kundu, T. Sarkar, M. Maity [email protected]@[email protected]
(Department of Physics, Visva-Bharati University
Santiniketan, India
)
Abstract
Production of quarks and gluons in hadron collisions tests Quantum Chromodynamics (QCD) over a wide range of energy. Models of QCD are implemented in event generators to simulate hadron collisions and evolution of quarks and gluons into jets of hadrons. pythia8 uses the parton shower model for simulating particle collisions and is optimized using experimental observations. Recent measurements of event shape variables and jet cross-sections in pp collisions at at the Large Hadron Collider have been used to optimize the parton shower model as used in pythia8.
1 Introduction
High energy collisions of hadrons produce quarks and gluons, collectively known as partons, which evolve into a large number of stable hadrons. Successive stages of this process involve lower energy and hence increasing value of the strong coupling . This makes analytical calculations based on perturbation theory prohibitively difficult as well as unreliable. Thus, a combination of analytical calculations in the early stages, including the hard scattering, and approximate numerical calculations later, is used to describe such events.
A number of event shape variables (ESVs) [1, 2] have been defined to reflect the different aspects of QCD over a wide range of energy as well as theoretical complexity. Their measurements[3, 4, 5, 6, 7, 8, 9, 10] at the colliders have been used to improve our understanding of QCD - parton distribution function (PDF), multi-parton interaction (MPI), initial state radiation(ISR), final state radiation (FSR), fragmentation, hadronization, etc. Although data from LEP, Tevatron and RunI of the Large Hadron Collider(LHC) have been used to this end, RunII data of LHC has so far not been used exhaustively. This study attempts to improve the understanding of QCD using four ESVs measured in RunII data [3].
Monte Carlo (MC) simulation programs such as pythia8 [11], herwig++ [12], MadGraph5_amc@nlo [13], etc. combine analytical calculations and approximate models to describe the collisions. pythia8 is a popular MC event generator and is extensively used by the particle physics community. Event generators which use matrix element (ME) calculations for the hard scattering process often use pythia8 for emulating the subsequent fragmentation and hadronization process. Data from colliders have been used to optimize pythia [14, 15, 16]and it is imperative that data from the RunII of LHC is used to optimize it.
The plan of the paper is as follows. Section 2 briefly reviews the theory relevant to parton shower. Section 3 examines the agreement of pythia8 with measurement of ESVs by the CMS experiment [3] in pp collisions at TeV. Section 4 describes the studies of the different parameters of pythia8 and optimization of the parameter set. It also describes the application of the improved parameter set to compare with results of other analyses of CMS [17] and ATLAS [18]. Section 5 contains the summary and outlook.
2 Parton Shower description of hadron collisions
In high energy collisions, it is intensely complicated to achieve an exclusive picture of the final state partons using matrix elements calculation and only fixed order treatment is not sufficient. However, for comparison with experimental analyses, a fully exclusive description of the final states using calculations based on the shower evolution and hadronization is more suitable. Such methods are described through phenomenological models embedded in the shower MC codes. MC event generators such as pythia8, herwig++ and sherpa use their own parton shower treatment which have their own merits. herwig++ uses the coherent branching algorithm [19] for angular order branching whereas Catani-Seymour dipole factorisation [20] is used in sherpa.
pythia8 uses leading order(LO) calculations for generating the hard scattering processes. It uses ‘transverse momentum’ ordered parton shower[21] with as evolution variable for the generation of () final states by taking account ISR and FSR. pythia8 also emulates MPI and evolution of the partons into hadrons. In this showering scheme, resummation is done to all orders with a certain logarithmic accuracy.
For the splitting of a parton , pythia8 uses the branching probability expressed by the DGLAP evolution equations:
[TABLE]
where is the DGLAP splitting function and represents the scale of the branching; represents the sharing of of between the two daughters, with taking a fraction and the rest, . Here the summation goes over all allowed branching, e.g. and and so on. Now, these probabilities become larger than unity due to divergence when which is taken care by introducing a term known as Sudakov form factor [22]. This Sudakov factor ensures that there will be no emission between scale to a given .
Considering lightcone kinematics, evolution variables for at virtuality scale for space-like branching (ISR) and time-like branching (FSR) are given by equations 2.2 and 2.3, respectively.
[TABLE]
Finally, equations 2.4 and 2.5 describe the evolutions for ISR and FSR respectively [21].
[TABLE]
Currently both the running re-normalisation and factorisation shower scales, i.e. the scales at which and the PDFs are evaluated, are chosen to be [23]. The general methodology of pythia8 for ISR, FSR, MPI is to start from some maximum scale and evolve downward in energies towards next branching untill the daughter partons reach some cut-off.
3 Event Shape Variables at RunII of LHC
Event shape variables[1] are defined in terms of the four-momenta of the objects with multi-jet final states. They are sensitive to the topology of the primary hard scattering and also, the evolution of the primary partons into stable hadrons. With a judicious choice of the ESVs it is possible to use experimental data to confront QCD predictions - from perturbative analytical calculation in the early phase to the nonperturbative models in the later phase of the event. These variables also have the advantage that, they are safe from collinear and infrared(IRC) divergences [2]. The ESVs may also be used to look for signature of physics beyond the Standard Model [24, 25]. LHC experiments ATLAS, CMS, and ALICE have studied various ESVs, evaluating them with jets and charged particles in proton-proton (pp) collision [3, 4, 5, 6, 7, 8, 9, 10].
Most recently the CMS has evaluated four ESVs in multi-jet events (with atleast three jets) in pp collisions at TeV [3]. These are - the complement of transverse thrust (), total jet mass (), total transverse jet mass () and total jet broadening () and defined as ratios of momenta of the jets in an event and hence many uncertainties cancel out. These ESVs have higher values for multijet, spherical events and lower values for two-jet, pencil like events. The complement of transverse thrust () is expected to be sensitive to the initial hard scattering, multi-parton interaction, and emissions of high gluons in the form of ISR and FSR. The other three - , and - have additional dependence on the details of fragmentation and hadronization.
The distributions of the ESVs observed in data are unfolded to remove the effects of the efficiencies and acceptances of the CMS detector and a broad agreement between data and the predictions of pythia8, herwig++ and MadGraph5_amc@nlo is observed. In case of pythia8, it is noted that the agreement is better for the and which are computed using of the jets. However, in case of and , which are computed using of the jets, pythia8 overestimates the multijet nature of the events. This indicates that the flow of energy in the transverse plane is better modelled by pythia8 while the overall three-dimensional modelling is not adequate.
CMS also observes [3] that ISR has a large effect on the distributions of the ESVs and increases the spherical nature of the multijet events. The effect of FSR is much smaller and that of MPI is negligible.
For all the four ESVs, the overall agreement improves with the energy scale of the event. The strong coupling is smaller for higher energy and reduces the emission of hard gluons in the early stage of the evolution of the partons. This makes the higher order calculations less important for describing such event. CMS has used average of the two leading jets of an event, , to represent its energy scale.
4 Optimizing the Parton Shower Model of PYTHIA8
Recently, both CMS and ATLAS have done several optimizations of pythia8 around its Monash tune [14]. CMS has used underlying event (UE) and charged particle data from the CDF ( collisions, Fermilab) and CMS (pp collisions, LHC) for tuning the strong coupling and MPI related parameters for different PDF sets [26, 27]. ATLAS has optimized ISR, FSR and MPI related parameters using a number of observables [28]. The present study also uses the Monash tune, pythia v8.235 with NNPDF2.3 PDF (LO) set, as the default. For each parameter, variation has been done around the default value of the Monash tune.
Since Monash tune overestimates the multijet nature of the events [3], the role of ISR and FSR need to be examined along with the choice of a suitable value of . pythia8 has the provision to use separate values of for ISR and FSR and the corresponding parameters are SpaceShower:alphaSvalue and TimeShower:alphaSvalue respectively. These will be referred to as and . The Monash tune uses for both. The maximum evolution scale involved in the showering is set to match the scale of the hard process itself. In pythia8 it set equal to the factorization scale, but allows its modification by multiplicative factors SpaceShower:PTmaxFudge for ISR and TimeShower:PTmaxFudge for FSR. Both parameters in the Monash tune have 1 as their default values.
For each point in the parameter space, events have been generated to ensure that the statistical uncertainty is smaller than the experimental uncertainty. The resulting distributions have been compared with data and has been calculated to check the goodness of fit. The optimization has been done using professor v2.3.0 [29] along with RIVET v2.6 [30] with the complete set of ESVs distributions from pythia8 as available from [3]. Post optimization, the new parameter set is checked using other relevant results from the CMS [17] and ATLAS [18].
4.1 Optimization of the Strong Coupling for ISR and FSR
The Monash tune of pythia8 uses the same value of for both ISR and FSR as the default. It is a choice of convenience but not mandatory. It has been argued that the effective scales for ISR and FSR are different which could be absorbed by effective values of the corresponding strong coupling and this is acceptable. This freedom to set nonidentical for ISR and FSR within the validity of the PS approach gives MC tuning studies a space to best describe the observables. The observation of CMS [3] that ISR has a rather large effect on the ESVs compared to FSR adds to the importance of this issue. So it is instructive and important to probe if pythia8 prefers to have different values of for both ISR and FSR.
First, is varied, keeping and other parameters fixed at their default values. The value of is varied by 20% about the default value in steps of 2%. The results, in terms of are shown in figure 1. It is seen that and are not sensitive to while and is mildly sensitive preferring a lower value.
Next, is varied, keeping other parameters including fixed at their default values. The value of is also varied by 20% in steps of 2%. The results, in terms of are shown in figure 2. It is seen that is not sensitive to , prefers the default value, , and prefer a higher value.
The effect of variation of either , or, on the four ESVs conform to the observations of CMS regarding the effects of ISR and FSR. The disagreements for the first two ranges for all of them are very large and appear to be beyond the scope of parameter tuning and left out of the next study, where both and are varied simultaneously and over the same ranges as above. Each plot in figure 3 shows the summed up for the six ranges for the same variable. This gives us an overall idea of the preferable values of and .
4.2 Optimization of Maximum scale of the Shower Evolution
The sensitivity of the ESVs to maximum scale of the ISR shower evolution SpaceShower:PTmaxFudge () has been checked. Figure 4 shows the variation of for the four ESVs with this parameter. It is observed that and have similar dependence on , preferring a value of . On the other hand, and prefer a value . Similar study for TimeShower:PTmaxFudge, related to FSR, shows no significant variation for the ESVs.
4.3 Simultaneous Optimization of the Strong Coupling and Maximum scale of the Shower Evolution
Finally, professor v2.3.0 [29] is used to simultaneously optimize , , and keeping other parameters fixed to their default values in the Monash tune. CMS data for the four ESVs are made available through RIVET v2.6 [30] and HEP Data [31]. Overall, 120 different combinations of the three parameters are randomly sampled by professor in the ranges mentioned in table 1. professor was instructed to perform a third order polynomial fit to optimize parameter response for the observables, the four ESVs in this case. This goodness-of-fit minimization process gives in return best favourable values of the three parameters as listed in table 1. The uncertainty for the optimized parameters are obtained from Minuit [32] package interfaced with professor.
The values obtained for , , and are 0.11409, 0.15052 and 0.9323 respectively, see table 1. It is noted that the model of QCD implemented in pythia prefers a lower value of for ISR compared to the default value used in the Monash tune but prefers a higher value of the same for FSR. It is also observed that the optimized values obtained here for and are very close to their values obtained while is fixed to 1.
4.4 Validation of the Optimized Set of Parameters of PYTHIA8
The optimized values of the three parameters (see, table 1) are used to calculate the ESVs. It is seen that the agreement with data deteriorates slightly for (figure 5) and (figure 8) compared to the good agreement with the Monash tune. But, there is significant improvement in agreement with data for (figure 6) and (figure 7) compared to the Monash tune. Since and had a rather poor agreement between data and the Monash tune, overall this new set of parameters is better.
It is imperative that the optimized set of parameters is tested against some other QCD results at the LHC. Inclusive jet cross-section is a very important QCD measurement at the LHC and is sensitive to PDF of protons and and both CMS [17] and ATLAS [18] have studied this with the 13 TeV data. The CMS has measurements of inclusive cross-sections for jets with R = 0.4, 0.7. Figures 9 and 10 show that the new parameter set improves the agreement with data compared to the Monash tune. Similar improvement is seen for the ATLAS measurement of jets with R = 0.4 (figure 11).
5 Summary
The strong coupling and maximum shower evolution scale used in the parton shower model of pythia8 have been optimized using the CMS measurement of four event shape variables over a wide range of energy scale of the events. pythia8 with the optimized parameters shows better agreement with inclusive jet cross-section measurements by CMS and ATLAS experiments. This study suggests that the models of initial and final state radiations in pythia8 can be improved to better represent various Quantum Chromodynamics related studies in the context of the Large Hadron Collider.
Acknowledgments
The authors sincerely thank Prof. Peter Z. Skands (Monash University) for useful discussions and suggestions. Financial support by the Department of Science & Technology, Government of India, through research grants, SR/MF/PS-02/2014-VB(G) and SR/MF/PS-03/2014-VB(G) is thankfully acknowledged.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Banfi, G. P. Salam and G. Zanderighi, JHEP 08 , 062 (2004), ar Xiv:hep-ph/0407287 [hep-ph] , doi: 10.1088/1126-6708/2004/08/062 .
- 2[2] A. Banfi, G. P. Salam and G. Zanderighi, JHEP 06 , 038 (2010), ar Xiv:1001.4082 [hep-ph] , doi: 10.1007/JHEP 06(2010)038 .
- 3[3] CMS Collaboration (A. M. Sirunyan et al. ), JHEP 12 , 117 (2018), ar Xiv:1811.00588 [hep-ex] , doi: 10.1007/JHEP 12(2018)117 .
- 4[4] ATLAS Collaboration (G. Aad et al. ), Eur. Phys. J. C 76 , 375 (2016), ar Xiv:1602.08980 [hep-ex] , doi: 10.1140/epjc/s 10052-016-4176-8 .
- 5[5] CMS Collaboration (V. Khachatryan et al. ), JHEP 10 , 087 (2014), ar Xiv:1407.2856 [hep-ex] , doi: 10.1007/JHEP 10(2014)087 .
- 6[6] CMS Collaboration (S. Chatrchyan et al. ), Phys. Lett. B 722 , 238 (2013), ar Xiv:1301.1646 [hep-ex] , doi: 10.1016/j.physletb.2013.04.025 .
- 7[7] ATLAS Collaboration (G. Aad et al. ), Eur. Phys. J. C 72 , 2211 (2012), ar Xiv:1206.2135 [hep-ex] , doi: 10.1140/epjc/s 10052-012-2211-y .
- 8[8] ATLAS Collaboration (G. Aad et al. ), Phys. Rev. D 88 , 032004 (2013), ar Xiv:1207.6915 [hep-ex] , doi: 10.1103/Phys Rev D.88.032004 .
