Fractal based observables to probe jet substructure of quarks and gluons

Joe Davighi; Philip Harris

arXiv:1703.00914·hep-ph·May 10, 2018

Fractal based observables to probe jet substructure of quarks and gluons

Joe Davighi, Philip Harris

PDF

TL;DR

This paper introduces Extended Fractal Observables (EFOs), new infrared safe measures based on fractal properties of jets, which improve quark-gluon discrimination by capturing scale-dependent hadron distribution features.

Contribution

The paper proposes a novel set of fractal-based jet observables (EFOs) that enhance quark-gluon discrimination and are weakly correlated with existing variables.

Findings

01

EFOs are individually effective in discrimination.

02

Inclusion of EFOs improves discrimination performance.

03

EFOs are weakly correlated with existing variables.

Abstract

New jet observables are defined which characterize both fractal and scale-dependent contributions to the distribution of hadrons in a jet. These infrared safe observables, named Extended Fractal Observables (EFOs), have been applied to quark-gluon discrimination to demonstrate their potential utility. The EFOs are found to be individually discriminating and only weakly correlated to variables used in existing discriminators. Consequently, their inclusion improves discriminator performance, as here demonstrated with particle level simulation from the parton shower.

Figures11

Click any figure to enlarge with its caption.

Equations4

p_{T} D = \frac{Σ _{i} p _{T, i}^{2}}{Σ _{i} p _{T, i}},

p_{T} D = \frac{Σ _{i} p _{T, i}^{2}}{Σ _{i} p _{T, i}},

σ_{2} = (λ_{2} / Σ_{i} p_{T, i}^{2})^{1/2},

σ_{2} = (λ_{2} / Σ_{i} p_{T, i}^{2})^{1/2},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

11institutetext: Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Wilberforce Road, Cambridge, UK 22institutetext: CERN, European Organization for Nuclear Research, Geneva, Switzerland

Fractal based observables to probe jet substructure of quarks and gluons

Joe Davighi 11

Philip Harris 22

Abstract

New jet observables are defined which characterize both fractal and scale-dependent contributions to the distribution of hadrons in a jet. These infrared safe observables, named Extended Fractal Observables (EFOs), have been applied to quark-gluon discrimination to demonstrate their potential utility. The EFOs are found to be individually discriminating and only weakly correlated to variables used in existing discriminators. Consequently, their inclusion improves discriminator performance, as here demonstrated with particle level simulation from the parton shower.

1 Introduction

A hadronic jet is produced from an initial parton via a sequence of perturbative QCD branching interactions (the parton shower), followed by the non-perturbative conversion of partons to the hadrons we observe in experiments (hadronization). A Markov chain description of the parton shower suggests the spatial distribution of partons will exhibit some fractal character Gustafson:1991ru ; PhysRevD.45.4077 ; Andersson:1988ee ; Larkoski:2012eh ; Jankowiak:2012na ; Soper:2011cr , and this will be inherited by the final hadron distribution (invoking local parton-hadron duality 0954-3899-17-10-017 ). However, true scale invariance of the hadron distribution within a jet is broken by the running of the branching probability, termination of the shower due to hadronization, and finite detector resolution. Here we define new observables to characterize jet branching structure, named Extended Fractal Observables (EFOs), which accommodate deviations from fractal structure through simple parametrizations. The idea is to apply box-counting techniques, used widely in the study of dynamical systems and scale invariant objects, to the substructure of QCD jets. Box counting has previously been employed in particle physics to calculate the fractal dimension of electromagnetic showers Ruan:2013iaa for highly granular calorimetric reconstruction. Here, we extend the generality and information content of this technique in our characterization of QCD jets.

The motivation for this study is two-fold. Firstly, we would like to characterize the spatial substructure of jets into a set of new observables. Secondly, we would like to demonstrate the use of such observables in the discrimination of quark and gluon jets. Quark and gluon discrimination has long been used as a tool to enhance the sensitivity of signatures with additional quarks Aad:2014gea ; CMS-PAS-JME-13-002 ; Badger:2016bpw ; Gras:2017jty . In particular, weak boson fusion induced Higgs-production is enhanced due to the distinct signature of two additional hard quark jets in the gluon-dominated forward region of the detector FerreiradeLima:2016gcz ; Gallicchio:2012ez ; Gallicchio:2011xq ; Gallicchio:2011xc ; Abreu:1999rs ; Briere:2007ch ; Pumplin:1992bv ; Seymour:1996np ; Aad:2014gea ; Kilic:2008ub . Quark and gluon tagging are also expected to be useful for physics searches beyond the Standard Model, including the detection of supersymmetric particles Bhattacherjee:2016bpy ; Joshi:2012pu . Additionally, if well designed, these taggers can be further extended to the subjets of boosted boson signatures CMS-PAS-JME-14-002 . We demonstrate that modest improvements can be made to existing quark-gluon taggers by incorporating the new jet observables defined in this paper.

Finally, our construction of pixel-based jet observables resonates with the recent development of the jet image paradigm Komiske:2016rsd ; Cogan:2014oua , in which the energy measured in each detector cell is interpreted as the intensity of a pixel in a 2D image. Within this approach, powerful machine-learning algorithms for classifying images have been brought to bear on a range of jet classification problems. This has included tagging boosted weak bosons deOliveira:2015xxd ; Cogan:2014oua , boosted top quarks Almeida:2015jua , and heavy-flavors Baldi:2016fql ; Guest:2016iqz .

We define EFOs in the following section. In section 3 we analyze the performance of these observables in quark-gluon discrimination, before concluding.

2 Extended Fractal Observables

The computation of the EFOs is performed on a jet by jet basis using a variation of the Minkowski-Bouligand (box-counting) dimension, as follows.

2.1 Variable definitions

To define our variables we implement a two-stage recipe: firstly, the jet cone is divided in the familiar $\left(\eta,\phi\right)$ angular coordinates into a square grid of cells, each cell having side-length $\epsilon$ . For a given scale $\epsilon$ , we count the number of cells $N_{hits}\left(\epsilon\right)$ which register particle hits with a total transverse momentum greater than some pixel-level soft cutoff, in this study chosen to be $p_{T}>1.0$ . This low energy cut represents a limiting threshold due to detector resolution. This counting is iterated over a range of scales, as is illustrated in Figure 1. The second stage is to fit smooth functions to the variation of $y=\log N_{hits}\left(\epsilon\right)$ with $x=\log\left(1/\epsilon\right)$ , and to extract the parameters of the fit as a set of (correlated) jet observables, which we call Extended Fractal Observables (EFOs). This is a generalization of the traditional box-counting method, in which only linear functions $y=mx+c$ are fitted, with the gradient $m$ identified as the fractal dimension Ruan:2013iaa .

Indeed, in Figure 2 there is no distinct region of linear scaling, as would be needed to extract a fractal dimension. Rather, $\log N_{hits}\left(\epsilon\right)$ levels off smoothly from large to small scales as saturation is approached, motivating a non-linear fit to extract whatever information this curve might encode about the jet. In particular, the hadronization region (i.e. at small $\epsilon$ ) obviously carries non-perturbative information sensitive to the flavor of the jet. The observed curves are distinct between quarks, gluons and b-quarks, as summarized in Figures 2 and 3. This scaling is a fundamental property of QCD resulting from the differences in the splitting of quarks and gluons. Further measurements of this scaling allows for an alternative approach to extract QCD properties such as the strong coupling constant Bolzoni:2013rsa ; Bolzoni:2012ii .

The generic plateauing curves in Figure 2 can be fitted by almost any non-linear function (given a suitably restricted range in $x$ ), so we studied fit functions with at most three parameters, for speed and robustness of fitting. Fits were carried out simply by a binned $\chi^{2}$ minimization of the chosen function. Example fit functions included the following:

logarithmic fits of the form $y=p_{0}+p_{1}x+p_{2}\log x$ . 2. 2.

quadratic fits: $y=p_{0}+p_{1}x+p_{2}x^{2}$ . 3. 3.

hyperbolic tangent fits: $y=p_{0}+p_{1}\tanh(x-p_{2})$ .

The values of the best fit parameters $\left\{p_{i}\right\}$ for each fitting function constitute three possible sets of EFOs. For a polynomial in $x=\log(1/\epsilon)$ , like the quadratic fit function, the fit reduces to a matrix inversion and thus has a well-defined convergence. The other two parametrizations are not polynomials, hence we perform a $\chi^{2}$ minimization.

Functions which actually saturate, such as the hyperbolic tangent parametrization above, are more physically motivated because they can model the saturation itself (asymptoting to the jet multiplicity). However, for the range of box scales used in our study (of width $\epsilon\geq 0.05$ , - see 2.2 below), and for all but the lowest $p_{T}$ jets, the non-saturating fit functions also provide adequate models for the observed scaling. For the purpose of quark-gluon discrimination (see section 3), the logarithmic fitting function was found to give the best discrimination performance of the three functions above (see Figure 6 to compare the performance between the logarithmic and hyperbolic tangent fitting functions).

2.2 The range of box-counting scales

The range of angular scales $\epsilon$ has been chosen by paving the jet cone with a square grid of $N\times N$ cells, where the splitting scale $N$ ranges in integer steps from 3 to 16. For each $N$ , the angular scale is $\epsilon=2R/N$ , where $R$ is the jet radius, in this study $R=0.4$ . The coarsest $\epsilon$ scale chosen, corresponding to $N=3$ , is essentially the coarsest scale carrying potentially discriminating information (for $N=2$ the jet cone would be divided into four quarters, all of which will register a hit for realistic jet shapes). The finest $\epsilon$ scale chosen is $\epsilon_{min}=0.8/16=0.05$ , because this is approximately the angular detector resolution in both LHC experiments, CMS and ATLAS Aad:2008zzm ; Chatrchyan:2008aa . For the $p_{T}\geq 100$ jets studied here, the number of hits is just beginning to saturate at this scale (see Figure 2), so we are probing into the hadronization region prior to the flat plateau.

Finally, we would like to highlight that these fractal-based observables are similar in spirit to calculating subjet rates of jets Gallicchio:2011xq ; Bhattacherjee:2015psa , given subjets clustered using the $p_{T}$ -independent Cambridge-Aachen algorithm Dokshitzer:1997in . Both observables compute $p_{T}$ -independent branching information on a succession of angular scales down to some threshold. And both observables perform what is essentially a further clustering on the substructure of the jet to extract this information pertaining to the branching history of the jet. In light of this, the EFO approach could be extended to utilize subjet counts (instead of hit grid cell counts) to assign scale-dependent multiplicities $N(\epsilon)$ .

2.3 Infrared and Collinear safety

Preserving infrared and collinear safety ensure calculability in perturbative QCD. An observable is infrared (collinear) safe if its value is unchanged by the emission of soft (co-moving) particles. The EFOs, as defined in 2.1 with a pixel-level soft cutoff, are fully IRC safe.

Firstly, the box counting procedure is intrinsically collinear safe: if one particle splits into two particles with the same $\left(\eta,\phi\right)$ coordinates, we still count just one cell hit by both daughter particles, at any finite scale of probing. Hence collinear splittings will not affect the number of cells $N_{hits}\left(\epsilon\right)$ to register particle hits at any choice of scale. On the other hand, infrared safety of the EFOs can only be engineered by imposing some low momentum cutoff to cleanse the jet of its soft constituents. However, this soft cutoff must be implemented consistently with collinear safety. If we simply discarded all soft hadrons with, say $p_{T}<1$ , this would spoil collinear safety. To see this, consider the following pathological example: if a particle with $p_{T}=1.5$ splits into two comoving particles with $p_{T}=0.8$ and $p_{T}=0.7$ , then both would be discarded by a particle-level soft cut, and so $N_{hits}\left(\epsilon\right)$ would not be invariant under this collinear splitting.

This is remedied by defining a pixel-level (rather than particle-level) sort cutoff. That is, we only consider a cell to register a hit if it measures a total $p_{T}$ greater than our soft cutoff of $1$ . This way, if the troublesome $1.5$ particle in the example above splits collinearly into any number of daughters, the pixel still measures a total $p_{T}$ of $1.5$ , and so registers a hit regardless of these splittings. Thus, box-counting with a pixel-level soft cutoff is fully IRC safe. In addition, a pixel-level rather than particle-level cut is more naturally realized experimentally since a pixel hit is consistent with an LHC detector cell.

Numerically, the performance of a quark-gluon discriminant built using the EFOs was found to be essentially insensitive to varying the value of this $p_{T}$ cut (over values between $0.1$ and $1.0$ ), suggesting the variables are not strongly shaped by the IR emission, at least in simulations. In the following section, a $p_{T}$ cut of $1$ is used throughout. Finally, we acknowledge that pixel-level cutoffs have been used previously in the context of jet images analyses (for example in Komiske:2016rsd ) to ensure IRC safety in the same context.

3 Performance in Quark-Gluon Discrimination

We now investigate whether these observables might be a useful new tool in the important and challenging problem of distinguishing light quarks from gluon jets.

3.1 Event generation and setup

In this study, we use QCD dijet samples at a center-of-mass energy of 13 . Because previous quark-gluon studies have revealed that discrimination performance varies a lot between the different generators Larkoski:2014pca ; Aad:2014gea ; CMS-PAS-JME-13-002 ; Gallicchio:2012ez ; Badger:2016bpw 111Herwig has been consistently seen to give the more conservative estimates of discrimination power, both with respect to Pythia and real LHC data., we here produce and shower events (at leading order) using both Herwig++ (version 2.7.0 with tune UE-EE-5C ) Bahr:2008pv ; Seymour:2013qka and Pythia 8 (version 8.185 with tune CUETP8M1)Khachatryan:2015pea , with order 150k events in each. Jets are clustered with the anti- $k_{T}$ algorithm using the final state particles following showering and hadronization; a cone size of $R=0.4$ and the FastJet code package Cacciari:2011ma are used for the jet clustering. The EFOs (here computed using the logarithmic fitting function), along with a set of other established jet observables, have been computed for the highest $p_{T}$ jet in each event. We define the flavor of that jet by matching to the highest- $p_{T}$ parton within $R<0.3$ of the jet axis, and classify the event as signal (background) if matched to a light quark (gluon)222Note that b(bottom)-jets may be efficiently identified using a secondary vertex tagger, and separately vetoed..

As a baseline for comparison, we shall consider the variables currently used by the Compact Muon Solenoid (CMS) quark-gluon tagger, which are CMS-PAS-JME-13-002 : i) the total number of reconstructed particles in the jet (the multiplicity) Alexander:1995bk ; ii) the $p_{T}D$ variable ( $C_{1}^{\beta=0}$ )Larkoski:2013eya ,

[TABLE]

where $i$ sums over the constituents of the jet, which describes the distribution of transverse momentum between the particles in the jet; and iii) $\sigma_{2}$ , the ( $p_{T}$ -weighted) semi-minor axis of the jet in the $(\eta,\phi)$ plane CMS-PAS-JME-13-002 , defined by

[TABLE]

where $\lambda_{2}$ is the smaller eigenvalue of the $2\times 2$ symmetric matrix with components $M_{11}=\Sigma_{i}p_{T,i}^{2}\Delta\eta_{i}^{2}$ , $M_{22}=\Sigma_{i}p_{T,i}^{2}\Delta\phi_{i}^{2}$ , and $M_{12}=-\Sigma_{i}p_{T,i}^{2}\Delta\eta_{i}\Delta\phi_{i}$ . Throughout this study, we build multi-variable quark-gluon discriminants using a boosted decision tree (BDT), implemented using the Toolkit for Multivariate Analysis (TMVA) via adaptive boosting. The $p_{T}$ of the quark and gluon samples are reweighted to match the exact same kinematics in both cases, so as to avoid selection biases induced by kinematic differences in the simulation.

3.2 Results

We first compare the discriminator performance of single variables and the correlations between them, before going on to compare multi-variable taggers built with and without inclusion of the new EFO observables.

We can measure discriminator performance by receiver operator characteristic (ROC) curves, which plot background rejection against signal efficiency. Roughly speaking, the more convex the curve, the better the performance. The left plot of Figure 4, made using the Herwig samples, shows that the EFOs 333We use a BDT discriminator built from the combination of the three EFOs, $p_{0}$ , $p_{1}$ and $p_{2}$ . While the combination of all three EFOs adds little discrimination beyond that of a single EFO due to their near-perfect correlation, the selection of any single $p_{i}$ would be arbitrary for the sake of this comparison. are individually well-discriminating, particularly if we seek high signal efficiency. Their performance is significantly better than that of the jet multiplicity variable.

The right plot of Figure 4 presents the linear correlation coefficients (calculated using the TMVA toolkit) between the EFOs and the existing CMS quark-gluon tagger variables: multiplicity, $p_{T}D$ and $\sigma_{2}$ . We also include a computation of the fractal dimension, which has been calculated from a linear fit over a small range of box scales. Strong correlations are present amongst the EFOs, as is natural given they are parameters derived from the same fit. However, their correlations with the other variables are no greater than 43% (for either quarks or gluons)444Note that the traditional fractal dimension is more strongly correlated to existing QGD variables, particularly multiplicity.. Interestingly, the EFOs are most highly correlated with $\sigma_{2}$ , not multiplicity as might have been expected. This evidence suggests the discrimination power of the EFOs is not simply a result of higher multiplicities in gluon jets, and therefore that the addition of these parameters to a quark-gluon discriminator might improve performance.

We find that replacing the multiplicity variable in the existing CMS quark-gluon tagger with the EFO variable yields a gain in discriminator performance, albeit only a modest one. This gain is seen using both Herwig and Pythia event generators (with the setup described above) in the ROCs presented in Figure 5, which are for jets with $p_{T}\geq 100$ . We see the performance in Pythia is significantly better than Herwig for each combination of variables, consistent with previous studies Aad:2014gea ; CMS-PAS-JME-13-002 ; Gallicchio:2012ez ; Badger:2016bpw .

Moreover, the incremental gain upon replacing multiplicity with the EFOs is larger in Pythia than Herwig, so Herwig gives the more conservative estimate of the impact of including the EFOs. We see the gain in performance (relative to a baseline tagger using just $p_{T}D$ and $\sigma_{2}$ ) more clearly in Figure 6, with the left panel for Herwig and the right for Pythia. The gain is at the level of $1-2\%$ in the more conservative Herwig setup, and slightly larger in Pythia (note the different scaling of the y-axis). To emphasize a previous point, these gains were found to be stable across different values of the soft $p_{T}$ cut. Finally, we investigated how the performance varies with energy scale, by performing the analysis in $p_{T}$ bins of $50-100$ , $100-200$ , and $200-500$ . Discrimination was found to increase with $p_{T}$ in both Herwig and Pythia (see Figure 7 for the Herwig results).

Combining all four variables (multiplicity, $p_{T}D$ , $\sigma_{2}$ and the EFOs) was seen to give no further improvement. This suggests all the information from multiplicity is captured by the EFOs555This is unsurprising, because jet multiplicity is simply the asymptotic number of hits as we approach the saturation region., while the converse is not true. In summary, we have presented evidence in this study that the Extended Fractal Observables provide an additional handle that captures the salient features of jet multiplicity, incorporates new information from showering and hadronization, and which is also better behaved under IRC emission (see 2.3).

4 Conclusions

In this study we defined new jet observables, the Extended Fractal Observables, by a generalization of the box-counting method used in the study of fractal systems. Defined with a pixel-level low momentum cutoff, these observables are infrared and collinear safe. We have then sought to apply the EFOs to improve quark-gluon discrimination. At the generator level, we find some modest improvement in discrimination by gluon rejection when we replace multiplicity with the EFOs in the existing CMS tagger, across both Herwig++ and Pythia 8. Extending the performance of these new variables to include detector effects can naturally be performed in the LHC environment with the CMS Particle Flow algorithm CMS-PAS-PFT-09-001 in conjunction with the PUPPI algorithm Bertolini:2014bba to reconstruct particle candidates in the presence of high pile-up.

5 Outlook

This method of studying jet substructure is a new approach. As such, there are many directions in which we would like to proceed, including:

Exploring particle hits in a 3-dimensional coordinate space spanned by $\eta$ , $\phi$ and $z^{-1}$ , where $z$ is the fractional transverse momentum of the jet constituent. 2. 2.

Applying the EFOs beyond Quark-Gluon discrimination, for example to the identification of pile-up jets, or initial state radiation. 3. 3.

These box-counting methods extend very naturally from the substructure of a single jet to a whole-event analysis. Such a novel approach may provide new insight into searches for new physics topologies such as those in supersymmetry or top quark pair production Soper:2014rya . 4. 4.

Furthermore, box-counting analyses could provide a useful characterization of event shapes in heavy ion collisions, where studies of jet properties beyond jet reconstruction are traditionally difficult, but well motivated Chatrchyan:2013kwa ; CMS-PAS-HIN-15-004 ; CMS-PAS-HIN-16-006 . 5. 5.

Finally, we would like to emphasize that the calculation of EFOs on quark and gluon jets probes parton shower scaling that results from the QCD color factor ratio. Calculating EFOs on cosmic ray air shower profiles Brooijmans:2016lfv could therefore help discriminate QCD-induced air showers from more interesting signals; of particular interest, showers induced by electroweak sphalerons. Experimentally, the calculation of EFOs in this air shower context is conceptually appealing: the 1660 individual Cerenkov detectors (spread over 3000 km2) of the Pierre Auger Observatory in Argentina ThePierreAuger:2015rma would naturally function as the finest-scale cells in our box-counting algorithm. These techniques could therefore be useful in probing physics at energies far beyond that of the LHC.

6 Acknowledgments

JD’s work has been supported by The Cambridge Trust, and by the STFC consolidated grant ST/L000385/1. We thank the CERN summer student program where this work was initiated. We also thank Andrew Larkoski for his insightful comments when performing these studies, and Bryan Webber for helpful discussions. Finally, we thank Eric Metodiev for helpful comments.

Bibliography52

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) G. Gustafson and A. Nilsson, Multifractal dimensions in QCD cascades , Z. Phys. C 52 (1991) 533–542.
2(2) J. D. Bjorken, Fractal phase space as a diagnostic tool for high-energy multijet processes , Phys. Rev. D 45 (Jun, 1992) 4077–4087.
3(3) B. Andersson, P. Dahlkvist, and G. Gustafson, An Infrared Stable Multiplicity Measure on QCD Parton States , Phys. Lett. B 214 (1988) 604–608.
4(4) A. J. Larkoski, QCD Analysis of the Scale-Invariance of Jets , Phys. Rev. D 86 (2012) 054004, [ 1207.1437 ].
5(5) M. Jankowiak and A. J. Larkoski, Angular Scaling in Jets , JHEP 04 (2012) 039, [ 1201.2688 ].
6(6) D. E. Soper and M. Spannowsky, Finding physics signals with shower deconstruction , Phys. Rev. D 84 (2011) 074002, [ 1102.3480 ].
7(7) Y. L. Dokshitzer, V. A. Khoze, and S. I. Troyan, On the concept of local parton-hadron duality , Journal of Physics G: Nuclear and Particle Physics 17 (1991), no. 10 1585.
8(8) M. Ruan, D. Jeans, V. Boudry, J.-C. Brient, and H. Videau, Fractal Dimension of Particle Showers Measured in a Highly Granular Calorimeter , Phys. Rev. Lett. 112 (2014), no. 1 012001, [ 1312.7662 ].