Improved description of the HERA data with a new simple PDF parametrization
Francesco Giuli, Marco Bonvini

TL;DR
This paper introduces a new, more flexible PDF parametrization that significantly improves the fit quality to HERA data, especially at small-x, highlighting the importance of parametrization choices in collider physics.
Contribution
It presents a novel PDF parametrization with higher small-x flexibility, implemented in xFitter, resulting in a better fit to HERA data and revealing biases in previous models.
Findings
Reduced chi-squared by over 60 units with only four additional parameters
Inclusion of small-x resummation further improves fit quality by 30 units
Identifies parametrization bias in default xFitter at small-x
Abstract
A new parametrization for the parton distribution functions with a higher flexibility in the small- region is presented. It has been implemented in the xFitter open-source PDF fitting tool, and compared to the default xFitter parametrization, used for the determination of the HERAPDF set. It has been found that the combined inclusive HERA I+II data can be described using NNLO theory with a significantly higher quality than HERAPDF2.0: the is reduced by more than 60 units, having used only four more parameters. Our result highlights a significant parametrization bias in the default xFitter parametrization at small , which would lead to even more dramatic effects when used for higher energy colliders, where the small- region is more relevant. We also find that the inclusion of small- resummation leads to a further reduction by approximately 30 extra units in .ā¦
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4| Contribution to | Old parametrizationĀ [10] | New parametrization |
|---|---|---|
| subset NC 920 | ||
| subset NC 820 | ||
| subset NC 575 | ||
| subset NC 460 | ||
| subset NC | ||
| subset CC | ||
| subset CC | ||
| correlation term + log term | ||
| Total |
| Contribution to | HELL3.0 (NLL) | HELL3.0 (LLā²) | HELL2.0 (LLā²) |
|---|---|---|---|
| subset NC 920 | |||
| subset NC 820 | |||
| subset NC 575 | |||
| subset NC 460 | |||
| subset NC | |||
| subset CC | |||
| subset CC | |||
| correlation term + log term | |||
| Total |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Improved description of the HERA data with a new simple PDF parametrization
University of Rome Tor Vergata and INFN, Sezione di Roma 2,
Via della Ricerca ScientificaĀ 1, 00133 Roma, Italy
āā
Marco Bonvini
INFN, Sezione di Roma 1,
Piazzale Aldo MoroĀ 5, 00185 Roma, Italy
Abstract:
A new parametrization for the parton distribution functions with a higher flexibility in the small- region is presented. It has been implemented in the xFitter open-source PDF fitting tool, and compared to the default xFitter parametrization, used for the determination of the HERAPDF set. It has been found that the combined inclusive HERAĀ I+II data can be described using NNLO theory with a significantly higher quality than HERAPDF2.0: the is reduced by more than 60 units, having used only four more parameters. Our result highlights a significant parametrization bias in the default xFitter parametrization at small , which would lead to even more dramatic effects when used for higher energy colliders, where the small- region is more relevant. We also find that the inclusion of small- resummation leads to a further reduction by approximately 30 extra units in . In this contribution, we review the results of the recent paper āA new simple PDF parametrization: improved description of the HERA dataā (arXiv:1902.11125).
1 The new proposed parametrization
Among many others, parton distribution functions (PDFs) represent a fundamental aspect of perturbative QCD (pQCD) in presence of incoming protons. These object describe the longitudinal momentum fraction x of partons within the proton. Currently the most accurate and reliable way to determine PDFs is through fits to data; thus the resulting fitted distributions depend on various aspects of this procedure e.g. the perturbative order of partonic cross sections or DGLAP splitting functions, the way heavy quarks are treated, definition, minimizations methods, the choice of the PDF parametrisation at the initial scale , etc. Here we review the results of Ref.Ā [1], where a new flexible and simple parametrization has been proposed and successfully used to determine PDFs.
Starting from the default parametrization used in xFitterĀ [2, 3] (which is the one used for the HERAPDF set), namely
[TABLE]
we propose a new extension, designed to add more flexibility in the small- region, while keeping the number of fitted parameters small. A polynomial in , which gives flexibility in the low- region, is added on top of the polynomial in , which gives flexibility in the high- region. These two polynomials can be combined considering a multiplicative option
[TABLE]
or an additive option
[TABLE]
These two options have been tested and it has been found that the additive parametrization results in smoother shapes and smaller in the fit. So, the actual parametrization used in our fits to the inclusive HERA data is
[TABLE]
This new parametrization depends on 18 free parameters to be fitted at the starting scale, which is to be compared with the HERAPDF2.0 parametrization, which depends on 14 free parameters.
2 PDF determination at NNLO
In order to directly compare our fit results with HERAPDF2.0, we use the same definition of the , namelyĀ [4]
[TABLE]
where the measured data are represented by , the corresponding theoretical prediction by , and represent the uncorrelated systematic and the statistical uncertainties on the measured data respectively, describe the correlated systematics which are accounted for using the nuisance parameters . The sums over extend over all data points, while the sum over runs over the various sources of correlated systematics.
Instead of the āoptimizedā versionĀ [5] of the Thorne-Roberts schemeĀ [6, 7] used in HERAPDF2.0Ā [4], the FONLL schemeĀ [8] is used as heavy quark mass scheme. The differences between the two fit setups are summarized in Tab.Ā 1. The results of the fit in terms of , switching from the old parametrization to the new one in FONLL scheme, are presented in Tab.Ā 2. A significant reduction of 76 units of the total is observed, which is much larger than the increase of 4 units in the number of parameters.
Moving to the PDF comparison, the gluon, and distributions at the scale 3 GeV2 are shown in Fig.Ā 1, where our fit results are plotted along with the HERAPDF2.0. It can be noticed that the shape is generally smoother for HERAPDF2.0, while a richer structure in the medium- and small- region is present in our PDFs. The comparison between our PDFs and a NNPDF3.0 set obtained fitting only HERA dataĀ [9] is also shown in Fig.Ā 1. This choice has been made because NNPDF has the msot flexible parametrization available on the market. It is noticeable that our PDFs lie inside the NNPDF uncertainty bands in most regions of , while the HERAPDF2.0 PDFs lie outside in many more cases. Furthermore, we observe that the gluon shape predicted by NNPDF is very similar to ours and instead quite different from HERAPDF2.0.
The comparison of HERA data with theoretical predictions using both our fit and HERAPDF2.0 has been inspected in detail. It can been seen in Tab.Ā 2 that the agreement is at the same level, apart from the low- and low- data (contained in the first dataset of the list), where a clear improvement is manifest. Fig.Ā 2 shows the two lowest bins included in the fit, namely at 3.5 GeV2 and 4.5 GeV2, which contain the data at lowest ; here, the flexibility of our parametrization in the small- regime allows to better describe this region.
3 Reducing correlations for a stabler fit
A strong correlation between the parameters of the fit may lead to instabilities. In our parametrization, the parameters governing the small- region () turned out to have significant correlations. In order to reduce such correlation, it is useful to redefine the parameters such that they each multiply a function whose contribution is dominant in a restricted region. Bernstein polynomials provide an easy way to achieve this goal. For instance, a generic polynomial of degree in in the range can be conveniently expressed as a linear combination of the Bernstein basis polynomials
[TABLE]
each of which is peaked in a different region of . The variable can also be replaced by a function of which still ranges from 0 to 1. For instance in Ref.Ā [11] this basis was used in CT fits, but replacing with . In our case, the most obvious choice is to use , which however ranges from 0 to infinity. To circumvent this difficulty, we simply consider a limited range in in which we reparametrize our polynomial in in terms of Bernstein polynomials. Since the data only extend to a small but finite value of , we consider the range , with . Therefore, we can use as a basis for our parametrization the polynomials
[TABLE]
In our specific case, we actually mix a polynomial in and in , Eq.Ā (3). These two variables, or better and , tend to zero in opposite limits, and therefore describe opposite regions. The best option to separate off the two regions described by these two polynomials is to consider two different Bernstein polynomials, one in and one in , suppressing each with the contribution of the other. However, this option does no longer correspond to the polynomial we used in our fits, due to the presence of contributions with both and , which are absent in Eq.Ā (3). Therefore, we propose a simpler choice, in which the polynomial is treated as a ācorrectionā to the polynomial. Our most generic parametrization Eq.Ā (3) then becomes
[TABLE]
where the new āprimedā parameters should be much less correlated among each other, thereby leading to a stabler minimization procedure. Simpler versions with less coefficients can be constructed in similar ways. For instance, when the term is not used, as in our default parametrization Eq.Ā (4), we simply have
[TABLE]
Similarly, one can switch off either the or the term, which leaves the other unmodified. We plan to test this new form of the parametrization once the new version of xFitter will be released.
4 PDF determination with small- resummation
It has been observed that much of the improvement in the when using our new parametrization comes from a better description of the low- low- data which are also responsible for the success of small- resummationĀ [12, 10]. Moreover, the reduction in obtained using our new parametrization is of the same size as the one obtained when including small- resummation effect in theoryĀ [10]. In order to understand the interplay between the inclusion of small- resummation and the use of our new parametrization, PDF fits including small- resummation with our new parametrization have been performed.
The inclusion of small- resummation corrections is achieved using the HELL codeĀ [13, 14, 15, 16]. Here, three different variants of the resummed NNLO+NLL fit have been performed; these fits differ from each other in the treatment of subleading logarithmic contributions. Tab.Ā 3 present the various contributions; it is immediately noticeable that the three fits are of the same quality and in all the cases the reduction with respect to the NNLO fit (third column in Tab.Ā 2) is about 30 units less.
Moving to the PDFs comparison, the gluon PDF is shown at 3 GeV2 and in form of ratios at 104 GeV2 in Fig.Ā 3. We conclude that even though subleading logarithmic contributions may change the size of the effect of resummation on the PDFs, the resummed version of the gluon and the quark-singlet PDFs are always significantly larger at small- than at NNLO.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bonvini, M. and Giuli, F., A new simple PDF parametrization: improved description of the HERA data , [1902.11125]
- 2[2] Alekhin, S. et al., HERA Fitter , Eur. Phys. J. C 75 (2015) 7 [1410.4412]
- 3[3] Bertone, V. et al., x Fitter 2.0.0: An Open Source QCD Fit Framework , Po S DIS 2017 (2018) 203 [1709.01151]
- 4[4] ZEUS, H 1 collaboration, Combination of measurements of inclusive deep inelastic e±p scattering cross sections and QCD analysis of HERA data , Eur. Phys. J. C 75 (2015) 580 [1506.06042]
- 5[5] Thorne, R.S., The Effect of Changes of Variable Flavour Number Scheme on PD Fs and Predicted Cross Sections , Phys. Rev. D 86 (2012) 074017 [1201.6180]
- 6[6] Thorne, R. S. and Roberts, R. G., An Ordered analysis of heavy flavor production in deep inelastic scattering , Phys. Rev. D 57 (1998) 6871-6898 [hep-ph/9709442]
- 7[7] Thorne, R.S., A Variable-flavor number scheme for NNLO , Phys. Rev. D 73 (2006) 054019 [hep-ph/0601245]
- 8[8] Forte, S. et al., Heavy quarks in deep-inelastic scattering , Nucl. Phys. B 834 (2010) 116-162 [1001.2312]
