Measurement of the top quark mass in the dileptonic t t-bar decay channel using the mass observables M[bl], M[T2], and M[blnu] in pp collisions at sqrt(s) = 8 TeV
CMS Collaboration

TL;DR
This paper presents a precise measurement of the top quark mass in dileptonic decay channels using innovative kinematic observables and Gaussian process modeling, based on 8 TeV proton-proton collision data from the CMS experiment.
Contribution
The study introduces a combined analysis of multiple mass observables with Gaussian process regression to improve top quark mass measurement accuracy.
Findings
Measured top quark mass as 172.22 GeV with high precision.
Utilized novel kinematic observables M[bl], M[T2], and M[blnu].
Achieved detailed modeling of observable shapes and sensitivities.
Abstract
A measurement of the top quark mass (M[t]) in the dileptonic t t-bar decay channel is performed using data from proton-proton collisions at a center-of-mass energy of 8 TeV. The data was recorded by the CMS experiment at the LHC and corresponding to an integrated luminosity of 19.7 +/- 0.5 inverse femtobarns. Events are selected with two oppositely charged leptons (l = e, mu) and two jets identified as originating from b quarks. The analysis is based on three kinematic observables whose distributions are sensitive to the value of M[t]. An invariant mass observable, M[bl], and a `stransverse mass' observable, M[T2], are employed in a simultaneous fit to determine the value of M[t] and an overall jet energy scale factor (JSF). A complementary approach is used to construct an invariant mass observable, M[blnu], that is combined with M[T2] to measure M[t]. The shapes of the observables,âŚ
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\cmsNoteHeader
TOP-15-008
\RCS
\RCS \RCS
\cmsNoteHeader
TOP-15-008
Measurement of the top quark mass in the dileptonic \ttbar decay channel using the mass observables , , and in pp collisions at \TeV
Abstract
A measurement of the top quark mass () in the dileptonic decay channel is performed using data from proton-proton collisions at a center-of-mass energy of 8\TeV. The data was recorded by the CMS experiment at the LHC and corresponds to an integrated luminosity of . Events are selected with two oppositely charged leptons () and two jets identified as originating from \PQbquarks. The analysis is based on three kinematic observables whose distributions are sensitive to the value of . An invariant mass observable, , and a âstransverse massâ observable, , are employed in a simultaneous fit to determine the value of and an overall jet energy scale factor (JSF). A complementary approach is used to construct an invariant mass observable, , that is combined with to measure . The shapes of the observables, along with their evolutions in and JSF, are modeled by a nonparametric Gaussian process regression technique. The sensitivity of the observables to the value of is investigated using a Fisher information density method. The top quark mass is measured to be .
0.1 Introduction
The top quark mass is a fundamental parameter of the standard model (SM), and an important component in global electroweak fits evaluating the self-consistency of the SM [1]. In addition, the value of has implications for the stability of the SM electroweak vacuum due to the role of the top quark in the quartic term of the Higgs potential [2]. Measurements of have been conducted by the CDF and \DZEROexperiments at the Tevatron, and by the ATLAS and CMS experiments at the CERN LHC. These measurements are typically calibrated against the top quark mass parameter in Monte Carlo (MC) simulation. Studies suggest that this parameter can be related to the top quark mass in a theoretically well-defined scheme with a precision of about 1\GeV[3]. A combination of measurements including all four experiments and \ttbar decay channels with zero, one, or two high-\ptelectrons or muons (all-hadronic, semileptonic, and dileptonic, respectively) gives a value of [4] for the top quark mass. Currently, the most precise experimental determination of is provided by CMS using a combination of measurements in all \ttbar decay channels, yielding a value of  [5]. In the dileptonic \ttbar decay channel, the ATLAS [6] and CMS [5] Collaborations have recently determined to be and , respectively. This paper presents a reanalysis of the dileptonic \ttbardata set recorded in 2012, with a primary motivation of reducing the systematic uncertainties in determination.
The dileptonic top quark pair (\ttbar) decay topology, , with , presents a challenge in mass measurement arising primarily from the presence of two neutrinos in the final state. While the undetected \ptvec of a single final-state neutrino in a semileptonic \ttbar decay can be inferred from the momentum imbalance in the event, the allocation of momentum imbalance between the two neutrinos in a dileptonic \ttbar decay is unknown a priori. For this reason, the dileptonic \ttbar system is kinematically underconstrained, and mass determination cannot be easily conducted on an event-by-event basis. Instead, the mass of the parent top quarks in the dileptonic \ttbarsystem can be extracted from kinematic features over an ensemble of events, with the help of appropriate observables and reconstruction techniques.
The measurement reported in this paper is based on a set of observables that have been proposed specifically for mass reconstruction in underconstrained decay topologies. These observables include the invariant mass, , of a  system, a âstransverse massâ variable, , constructed with the and daughters of the \ttbar system [7, 8, 9], and the invariant mass of a  system, , where the neutrino momentum is estimated by the -assisted on-shell (MAOS) reconstruction technique [10]. The MAOS reconstruction technique builds on  by exploiting the neutrino momenta estimates that are by-products of the  algorithm. The sensitivity of the , , and  observables to the value of is investigated using a Fisher information density method. Distributions of  and  in dileptonic events contain a sharp edge descending to a kinematic endpoint, the location of which is sensitive to the value of . Recently, masses of the top quark, W boson (), and neutrino () were extracted in a simultaneous fit using the endpoints of these distributions in dileptonic \ttbar events [11]. The , , and MAOS  observables are described in more detail in Section 0.4.
One of the dominant sources of systematic uncertainty limiting the precision of this measurement comes from the overall uncertainty in jet energy scale (JES). To address the JES uncertainty, we introduce a technique that uses the  and  observables to determine an overall jet energy scale factor (JSF) simultaneously with the top quark mass, where the JSF is defined as a multiplicative factor scaling the four-vectors of all jets in the event. Similar techniques have been developed for the all-hadronic and semileptonic \ttbar channels, where the jet pair originating from a \PW boson decay is used to determine the JSF [5]. Because light-quark jets from the \PW boson decay are used to calibrate the energy scale of \PQbjets arising from the \cPqt and decays, these methods are sensitive to flavor-dependent uncertainties that emerge from differences in the response of \PQbjets and light-quark jets. In the method featured here, the JSF is determined in the dileptonic \ttbar channel without relying on a \PW boson decaying to jets. Instead, it achieves sensitivity to the JSF through the kinematic differences between \PQbjets, which are subject to JSF scaling, and leptons, which are not. Because it does not use light quarks from a hadronic W boson decay, this approach is insensitive to flavor-dependent JES uncertainties.
To model the , , and MAOS Â distribution shapes, we use a Gaussian process (GP) regression technique [12, 13]. This technique is nonparametric, and thus largely model-independent. It is effective in modeling distribution shapes when no theoretical guidance is available to specify a functional form. The distribution shapes can conveniently be modeled as functions of multiple variables. In this analysis, three variables are used: the value of the relevant observable (, , or ), , and the JSF. The shapes are determined using simulated events generated with seven different values of ranging from to , and with five values of JSF, ranging from to , applied to the jets in each event. Each shape ultimately models the distributions of the observables together with their evolution in and in JSF.
0.2 The CMS detector
The central feature of the CMS apparatus is a superconducting solenoid of 6\unitm internal diameter, providing a magnetic field of 3.8\unitT. Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. The tracker has a track-finding efficiency of more than 99% for muons with transverse momentum and pseudorapidity . The ECAL is a fine-grained hermetic calorimeter with quasi-projective geometry, and is distributed in the barrel region of and in two endcaps that extend up to . The HCAL barrel and endcaps similarly cover the region . In addition to the barrel and endcap detectors, CMS has extensive forward calorimetry. Muons are measured in gas-ionization detectors, which are embedded in the steel flux-return yoke outside of the solenoid. The silicon tracker and muon systems play a crucial role in the identification of jets originating from the hadronization of \PQbquarks [14]. Events of interest are selected using a two-tiered trigger system [15]. The first level, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events at a rate of around 100\unitkHz within a time interval of less than 4\mus. The second level, known as the high-level trigger, consists of a farm of processors running a version of the full event reconstruction software optimized for fast processing, and reduces the event rate to less than 1\unitkHz before data storage. A more detailed description of the CMS detector, together with a definition of the coordinate system used, can be found in Ref. [16].
0.3 Data sets and event selection
We select dileptonic \ttbar events from a data set recorded at \TeVduring 2012 corresponding to an integrated luminosity of  [17]. Events are required to pass one of several triggers that require at least two leptons, , , or , where the leading (higher-\pt) lepton satisfies and the subleading lepton satisfies .
A particle-flow (PF) algorithm [18, 19] is used to reconstruct and identify each individual particle in an event by combining information from various subdetectors of CMS. Each event is required to have at least one reconstructed collision vertex, with the primary vertex selected as the one containing the largest of associated tracks. Electron candidates are reconstructed by matching a cluster of energy deposits in the ECAL to a reconstructed track [20]. They are required to satisfy and . Muon candidates are reconstructed in a global fit that combines information from the silicon tracker and muon system [21], and must have and . A requirement on the relative isolation is imposed inside a cone around each lepton candidate, where is the azimuthal angle in radians. A parameter is defined, where the sum includes all reconstructed PF candidates inside the cone (excluding the lepton itself), and is the lepton \pt. Electron (muon) candidates are required to have with . Events selected offline are required to contain exactly two such leptons, , , or , with opposite charge. For events containing an \EEÂ or \MMÂ pair, contributions from low-mass resonances are suppressed by requiring an invariant mass of the lepton pair , while contributions from \PZÂ boson decays are suppressed by requiring that , where [22].
Hadronic jets are clustered from PF candidates with the infrared and collinear safe anti-\ktalgorithm [23], with a distance parameter of 0.5, as implemented in the \FASTJETpackage [24]. The jet momentum is determined as the vectorial sum of all particle momenta in this jet. Corrections to the JES and jet energy resolution (JER) are derived using MC simulation, and are confirmed with measurements of the energy balance in quantum chromodynamics (QCD) dijet, QCD multijet, photon+jet, and Z+jet events [25]. Muons, electrons, and charged hadrons originating from multiple collisions within the same or nearby bunch crossings (pileup), are not included in the jet reconstruction. Contributions from neutral hadrons originating from pileup are estimated and subtracted from the JES. Jets originating from the hadronization of \PQbquarks are identified with a combined secondary vertex (CSV) \PQbtagging algorithm [14], combining information from the jet secondary vertex with the impact parameter significances of its constituent tracks. The algorithm yields a tagging efficiency of approximately and a misidentification rate of . Events are required to contain at least two jets that pass the \PQbtagging algorithm and satisfy and . In this analysis, the two jets satisfying these requirements that have the highest CSV discriminator values are referred to as \PQbjets.
The missing transverse momentum vector is defined as , where the sum includes all reconstructed PF candidates in an event [26]. Its magnitude is referred to as \ptmiss. Corrections to the JES and JER are propagated into \ptmiss, as well as an offset correction that accounts for pileup interactions. An additional correction mitigates a mild azimuthal dependence, arising from imperfect detector alignment and other effects, which is observed in the reconstructed \ptmiss. To further suppress contributions from DrellâYan processes, events containing an \EEÂ or \MMÂ pair are required to have .
Simulated \ttbar signal events are generated with the \MADGRAPH5.1.5.11 matrix-element generator [27], combined with MadSpin to include spin correlations of the top quark decay products [28], \PYTHIA6.426 with the tune for parton showering [29], and \TAUOLAfor the decay of leptons [30]. Parton distribution functions (PDFs) are described by the CTEQ6L1 set [31]. The \ttbar signal events are generated with seven different values of ranging from to . The contribution from the \PW associated single top quark production (t\PW) is simulated with \POWHEG1.380 [32, 33, 34, 35], where the value of is assumed to be . Background events from \PW+jets and Z+jets production are generated with \MADGRAPH 5.1.3.30, and contributions from \PW\PW, \PW\PZ, processes are simulated with \PYTHIA. The CMS detector response to the simulated events is modelled with \GEANTfour[36]. All background processes are normalized to their predicted cross sections [37, 38, 39, 40, 41].
With the requirements outlined previously,  \ttbar candidate events are selected in data. The sample composition is estimated in simulation to be 95% dileptonic \ttbar, 4% single top quark, and 1% other processes including diboson, \PW+jets, and DrellâYan production, as well as semileptonic and all-hadronic \ttbar.
0.4 Observables
The observables featured in this study have been developed for physics scenarios where undetected particles, such as neutrinos, carry away a portion of the kinematic information necessary for full event reconstruction. In the dileptonic \ttbar system, distributions in these observables contain endpoints, edges, and peak regions that are sensitive to the top quark mass. The observables are described in more detail below.
0.4.1 The observable
The  observable is defined as
[TABLE]
where and are four-vectors corresponding to a \PQbjet and lepton, respectively. The  pairs underlying each value of  are chosen out of four possible combinations by an algorithm described below. The  observable contains a kinematic endpoint that occurs when the \PQbjet and lepton are directly back-to-back in the top quark rest frame. The location of this endpoint, , is a function of the masses involved in the decay:
[TABLE]
With \GeV, \GeV[22], and , we have \GeV. Although this endpoint is a theoretical maximum on the value of  at leading order, events are still observed beyond this value due to background contamination, resolution effects, and nonzero particle widths.
The  distribution is shown in data and MC simulation in Fig. 1 (left), with a breakdown of signal and background events shown in the simulation. The âsignalâ category includes \ttbar dilepton decays where both \PQbjets are correctly identified by the \PQbtagging algorithm. The background categories include: âmistagâ dilepton decays where a light quark or gluon jet is incorrectly selected by the \PQbtagging algorithm; â decaysâ where dilepton events include at least one lepton in the final state subsequently decaying leptonically; and âhadronic decaysâ that include events where at least one of the top quarks decays hadronically. The ânon-\ttbar bkgâ category consists of single top quark, diboson, \PW+jets, and DrellâYan processes. Events in which a top quark decays through a lepton contain extra neutrinos stemming from the leptonic decay. Although the extra neutrinos cause a small distortion to the kinematic distributions, these events still contribute to the sensitivity of the measurement.
The sensitivity of the  observable to the value of is demonstrated in Fig. 1 (right), where  shapes corresponding to three values of the top quark mass in MC simulation () are shown. The variation between these shapes reveals regions of the  distribution that are sensitive to the value of , such as the edges to the left and right of the  peak, and regions that are not sensitive, such as the stationary point where the three shapes intersect. To provide a quantitative description of these effects, we introduce a âlocal shape sensitivityâ function, also known as the Fisher information density, shown in Figs. 1, 3, and 4. This function conveys the sensitivity of an observable at a specific point on its shape. For the  observable, the local shape sensitivity function peaks near the kinematic endpoint (), and has a zero value at the stationary point (). The integral of this function over its range is proportional to , where is the statistical uncertainty on a measurement of . A full description of the local shape sensitivity function is given in Appendix .11.
\PQb jet and lepton combinatorics
The two \PQbjets and two leptons stemming from each \ttbar decay give rise to a two-fold matching ambiguity, with two correct and two incorrect  pairings possible in each event. Pairings in which the \PQbjet and lepton emerge from different top quarks do not necessarily obey the upper bound described in Eq. (2), and thus do not have a clean kinematic endpoint in . Although a priori it is experimentally difficult to distinguish between correct and incorrect pairings, one possible approach is to select the smallest two  values in each event. This way, the kinematic endpoint of the distribution is preserved â even if the smallest two  values do not correspond to the correct pairings, they are guaranteed to fall below the correct pairings, which do respect the endpoint. In this analysis, we employ a slightly more sophisticated matching technique, introduced in Ref. [11], where either two or three  pairs are selected in each event.
By selecting either two or three  pairs in each event, the technique employed in this analysis has the benefit of increased statistical power, while preserving the kinematic endpoint of . Although they are not necessarily the correct pairs, the corresponding  values are guaranteed by construction to be less than or equal to those of the correct pairs. The matching technique is based on the following prescription:
match each \PQbjet with the lepton that produces the lower  value; 2. 2.
match each lepton with the \PQbjet that produces the lower  value.
This recipe produces either two or three values of . In the latter case, two different leptons may be successfully paired with the same \PQbjet, and vice versa. Such a configuration highlights the difference between this recipe and the simpler approach of choosing the smallest two values of , which do not necessarily incorporate both \PQbjets and both leptons in the event. For example, this could occur if both \PQbjets are matched to a single lepton. In these cases, the next largest  value is also needed to ensure both \PQbjets and both leptons from the event are used.
0.4.2 The observable
The  âstransverse massâ observable [7, 8] is based on the transverse mass, . The transverse mass of the W boson in a decay is given by
[TABLE]
where for , is the particle mass, and is the particle momentum projected onto the plane perpendicular to the beams. This quantity exhibits a kinematic endpoint at the parent mass, , which occurs in configurations when both the lepton and neutrino momenta lie entirely in the transverse plane (up to a common longitudinal boost).
The dileptonic \ttbar system has two layers of decays, with in the first step followed by in the second. The result is an event topology with two identical branches, and , each with a visible () and invisible () component. In this case, one value of  can be computed for each branch. The invisible particle momentum associated with each branch, however, is not known. While for a semileptonic \ttbar decay, with only one decay, the neutrino \ptvec is estimated from the \ptvecmiss in the event, a dileptonic \ttbar decay includes two neutrinos, for which the allocation of \ptvecmiss between them is unknown.
The  observable is an extension of  for a system with two identical decay branches, âaâ and âbâ, such as those in the dileptonic \ttbar system. Here, the invisible particle momenta,  and , must add up to the total \ptvecmiss. The strategy of  is to impose this constraint on the invisible particle momenta, while also performing a minimization in order to preserve the kinematic endpoint of . For a general event with a symmetric decay topology,  is defined as
[TABLE]
where and correspond to the two decay branches. If the invisible particle mass is known, it can be incorporated into the  calculation as well, yielding an endpoint at the parent particle mass. Although the final values of  and  are typically treated as intermediate quantities in the  algorithm, they are employed as neutrino \ptvec estimates in the MAOS reconstruction technique described in Section 0.4.3.
The subsystems
In the \ttbar system, there are several ways in which  can be computed, depending on how the decay products are grouped together. The  algorithm classifies them into three categories: upstream, visible, and child particles [42]. The child particles are those at the end of the decay chain that are unobservable or simply treated as unobservable. In the latter case, the child particle momenta are added to the \ptvecmiss vector. The visible particles are those whose \ptvec values are measured and used in the calculations; and the upstream particles are those from further up in the decay chain, including any initial-state radiation (ISR) accompanying the hard collision.
In general, the child, visible, and upstream particles may actually be collections of objects, creating three possible subsystems in the dileptonic \ttbar event topology. These subsystems are illustrated in Fig. 2. For simplicity, we refer to the corresponding  observables as , , and , where:
- â˘
The  observable uses the two leptons as visible particles, treating the neutrinos as invisible child particles, and combining the \PQbjets with all other upstream particles in the event.
- â˘
The  observable uses the \PQbjets as visible particles, and treats the W bosons as child particles, ignoring the fact that their charged daughter leptons are indeed observable. It considers only ISR jets as generators of upstream momentum.
- â˘
The  observable combines the \PQbjet and the lepton to form a single visible system, and takes the neutrinos as the invisible particles. A two-fold matching ambiguity results from the matching of \PQbjets to leptons in each event. In order to preserve the kinematic endpoint of the  distribution, the pair with the smallest value of  is used in each event.
These observables are identical, respectively, to , , of Ref. [42], and , , of Ref. [11].
The subsystem observable  is employed in this study to complement the observable . The  observable contains an endpoint at the value of , and can be combined with  to mitigate uncertainties due to the JES. This feature is discussed further in Section 0.5. The distribution of  and its sensitivity to the value of are shown in Fig. 3. Although  is not directly sensitive to , the neutrino \ptvec estimates that are a by-product of its computation are used as an input into the MAOS  reconstruction technique described in Section 0.4.3.
The  distribution employed in this analysis includes a kinematic requirement on the upstream momentum, defined as , where the sums are conducted over all reconstructed PF candidates, \PQbjets, and leptons in each event, respectively. The direction of  is required to lie outside the opening angle between the two \PQbjet \ptvec vectors in the event. This requirement primarily impacts events at low values of , and its effect on the statistical sensitivity of the observable is small.
0.4.3 The MAOS observable
The MAOS reconstruction technique employed in this analysis is based on the subsystem observable . In the  algorithm, an  variable, defined in Eq. (3), is constructed from the and pairs corresponding to each of the \ttbar decay branches. Because the values of neutrino \ptvec are unknown, a minimization is conducted in Eq. (4) over possible values consistent with the measured \ptvecmiss in each event.
The MAOS technique employs the neutrino \ptvecvalues that are determined by the  minimization to construct full  invariant mass estimates corresponding to each of the \ttbardecay branches. Given the neutrino \ptvec values, the remaining -components of their momenta are obtained by enforcing the W mass on-shell requirement [22]
[TABLE]
This yields a longitudinal momentum for each neutrino given by
[TABLE]
where [10]. Given these estimates for the neutrino three-momenta together with , we have the required four vectors to construct an  invariant mass corresponding to the decay products of each top quark.
The quadratic equations in Eq. (6) underlying the W mass on-shell requirement provide up to two solutions for each value of , yielding a two-fold ambiguity for each neutrino momentum. In addition, there is a two-fold ambiguity resulting from the matching of \PQbjets to pairs in the construction of  invariant masses. No matching ambiguity exists between leptons and neutrinos, since the and pairs have been fixed by the  algorithm. The combined four-fold ambiguity, along with the two top quark decays in each event, gives up to eight possible values of . In the measurement, all of the available values are used: for each pair, this includes up to two neutrino solutions, and two - matches. The distribution of MAOS  and its sensitivity to the value of are shown in Fig. 4.
0.5 Simultaneous determination of  and JSF
To mitigate the impact of JES uncertainties on the precision of this measurement, we introduce a technique that allows a JSF parameter to be fit simultaneously with . The JSF is a constant multiplicative factor that calibrates the overall energy scale of reconstructed jets. It is applied in addition to the standard JES calibration, which corrects the jet response as a function of \ptand . The dominant component of uncertainty in the JES calibration can be attributed to a global factor in jet response, which is captured in the JSF.
The challenge in determining the JSF simultaneously with stems from the large degree of correlation between these parameters. In the top quark decay, , the JSF directly affects the momentum of the \PQbjet, and indirectly, the inferred momentum of the neutrino, by scaling all jets entering the \ptmiss sum. The parameter affects the momenta of these two particles in addition to the lepton produced in the top quark decay. In the context of observables and distribution shapes, variations in the and JSF parameters cause shape changes that are difficult to distinguish. For this reason, a shape-based analysis using a single observable can be implemented to determine either or JSF, but not both simultaneously.
To determine the and JSF parameters simultaneously, we construct a likelihood function that contains two distributions corresponding to the  and  observables. In this configuration, variations in the parameters produce shifts in each individual distribution. They also create a relative shift between the distributions that provides the additional constraint needed for a simultaneous fit of and JSF. The dependence of the  and  distribution shapes on is shown in Figs. 1 and 3, and their dependence on the JSF is shown in Fig. 5. The difference in response between the  and  shapes to the JSF parameter is rooted in the reconstructed objects underlying the  and  observables â while each value of  uses one \PQbjet and one lepton, each value of  uses two \PQbjets and no leptons for the visible system. Thus,  exhibits a stronger dependence on the JSF. The likelihood fit used in this measurement is described in more detail in Section 0.7.
0.6 Gaussian processes for shape estimation
In this analysis, the , , and  distribution shapes are modeled with a GP regression technique that has two main advantages over other commonly-used shape estimate methods. First, the GP shape is nonparametric, determined only by a set of training points and hyperparameters that regulate smoothing; and second, it can be easily trained as a function of several variables simultaneously. The latter feature allows one to capture the smooth evolution of the distribution shapes as the and JSF parameters are varied. A detailed introduction to GPs can be found in Refs. [12, 13]. Here, we give a brief overview of the GP regression technique, with further discussion provided in Appendix .12.
The likelihood fit described in Section 0.7 uses distribution shapes of the form , where is the value of an observable (, , or ), and and  are free parameters in the fit. The shapes are shown in Figs. 1, 3, and 4 for each observable, where the free parameters are set to or and . In Fig. 5, shapes corresponding to the  and  observables are shown with the free parameters set to and or . In the figures, these shapes are represented as functions of a single variable (the observable ) with and  fixed. In GP regression, however, each shape is treated as a function of all three quantities (, , and ), and can be described as a probability density in three dimensions.
Each GP shape is trained using binned distributions of the observable in MC simulation. For each observable, binned distributions are used, corresponding to seven values of  ranging from to \GeVand five values of  ranging from to . Each distribution has 75 bins in , yielding a total of training points at which the value of is known and used as an input into the GP regression process. Each training point is specified by its values of , , and . The GP regression technique interpolates between the discrete values of , , and  covered by these training points to provide a shape that is smooth over its range. The smoothness properties of each shape are determined by a kernel function that is set by the analyzer. The GP shapes in this analysis correspond to the kernel function given in Eq. 18 of Appendix .12.
The binned distributions used to construct each GP shape are normalized to unity. However, the normalization of the GP shape itself may deviate slightly from unity due to minor imperfections in shape modeling. To mitigate this effect, the GP shape normalization is recomputed for each value of and JSF at which the shape is evaluated. In a likelihood fit, the normalization is recomputed for every variation of the fit parameters.
0.7 Fit strategy
This measurement employs an unbinned maximum-likelihood fit using the , , and MAOS  observables described in Section 0.4, along with the GP shape estimate technique described in Section 0.6. The MC samples used to train the GP shapes include the \ttbar signal and background processes described in Section 0.3.
The likelihood constructed from a single observable, , is given by:
[TABLE]
Here, the distribution shape depends on the value of the free parameters and , and expresses the likelihood of drawing some event where the value of the observable is . It is normalized to unity over its range for all values of and . The parameters and  are varied in the fit to maximize the value of the likelihood.
A likelihood containing two observables, and , is constructed as a product of individual likelihoods:
[TABLE]
This analysis employs three different versions of the likelihood fit:
the 1D fit uses the  and  observables to determine , and JSF is constrained to be unity; 2. 2.
the 2D fit also uses  and â but imposes no constraint on the JSF and determines and JSF simultaneously; 3. 3.
the MAOS fit uses the  and  observables to determine , and JSF is constrained to be unity.
Among these versions, the 1D fit provides the best precision on the value of . The 2D fit mitigates the JES uncertainties, which are the largest source of systematic error in the 1D approach. The MAOS fit is expected to yield results similar to the 1D fit, and is presented as a viable alternative that substitutes the  observable for MAOS . The best overall precision on is given by a combination of the 1D and 2D fits, which is discussed below. The fit results are discussed in Section 0.9.
The central value and statistical uncertainty on and JSF are determined using the bootstrapping technique [43]. This method is based on pseudo-experiments rather than the shape of the total likelihood defined in Eq. (8) near its maximum, and thus mitigates the effects of correlation between the two observables, and , in the likelihood. The technique also mitigates possible correlations within the  and  observables when multiple values of the observable occur in a single event. The bootstrapping technique is primarily relevant for statistical uncertainty determination, which may otherwise be affected by correlations in the likelihood. The technique has a negligible impact on the central values of and JSF. The bootstrap pseudo-experiments are constructed by resampling the full data set with replacement, where the size of each pseudo-experiment is fixed to have the number of events in data ( events). Events are selected at random from the full data set, so that a particular event has the same probability of being chosen at any stage during the sampling process. In this procedure, a single event may be selected more than once for any given pseudo-experiment. In data, all events have an equal probability to be selected. In simulation, the probability of selecting a particular event is proportional to its weight, containing the relevant cross sections, as well as corrections for MC modeling and object reconstruction efficiencies.
The performance of the likelihood fitting approach described above is evaluated using events in simulation, where the true values of and JSF are known. The fit is conducted using seven different values of  ranging from to \GeVfor each version of the likelihood fit. The results of this performance study are shown in Fig. 6. The likelihood fits are consistent with zero bias, showing that the GP shape modeling technique accurately captures the distribution shapes and their evolution over several values of . For this reason, no calibration of the fit is necessary for an unbiased determination of the and JSF parameters.
Combination of 1D and 2D fits
The 1D and 2D fits discussed above have differing sensitivities to various sources of systematic uncertainty in this measurement. Although the 2D fit successfully mitigates the JES uncertainties, which dominate in the 1D fit, other uncertainties in the 2D method are larger and cause the total precision to worsen (Section 0.8). The best overall precision on the value of is provided by a hybrid fit, defined as a linear combination of the 1D and 2D fits. The measured value of in the hybrid fit is given by:
[TABLE]
where the parameter  determines the relative weight between the 1D and 2D fits in the combination. The value of  and its statistical uncertainty are extracted using bootstrap pseudo-experiments, as described above. In each pseudo-experiment, the measured value of  is given by the linear combination in Eq. (9) of the measured  and  values. A value of is found to achieve the best precision on when both statistical and systematic uncertainties are taken into account. The performance of the hybrid fit, evaluated using MC samples corresponding to seven values of , is shown in Fig. 6.
0.8 Systematic uncertainties
The systematic uncertainties evaluated in this measurement are given in Table 0.8. The uncertainties include experimental effects from detector calibration and object reconstruction, and modeling effects mostly arising from the simulation of QCD processes. All uncertainties are determined by conducting the likelihood fit using events from MC simulation with the relevant parameters varied by , where is the uncertainty on a particular parameter. The difference in the measured top quark mass () or JSF () is taken to be the corresponding systematic uncertainty. For uncertainties that are evaluated by comparing two or more independent MC samples, the values of and may be subject to statistical fluctuations. For this reason, if the value of or is smaller than its statistical uncertainty in a particular systematic variation, the statistical uncertainty is quoted as the systematic uncertainty. Finally, if a systematic uncertainty is one-sided, where both and variations produce or shifts of the same sign, the larger shift is taken as the symmetric systematic uncertainty.
In the hybrid fit, the systematic uncertainties are evaluated according to the linear combination in Eq. (9). For each systematic variation, this gives . This approach provides the smallest overall uncertainty, with the largest contributions stemming from the JES, \PQbquark fragmentation modeling, and hard scattering scale. The next most precise result is given by the 1D fit, also dominated by the same sources of uncertainty. The JES uncertainties are successfully mitigated in the 2D fit. The 2D fit, however, is more sensitive to the uncertainties in the top quark \ptspectrum, matching scale, and underlying event tune, so the total systematic uncertainty for the 2D fit is larger than that of the 1D fit. The MAOS fit has a larger total systematic uncertainty than the 1D fit due to its sensitivity to the JES, top quark \ptspectrum, and \PQbquark fragmentation modeling uncertainties. Further details on each source of systematic uncertainty are given below.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Gfitter Group, âThe global electroweak fit at NNLO and prospects for the LHC and ILCâ, Eur. Phys. J. C 74 (2014) 3046, 10.1140/epjc/s 10052-014-3046-5 , ar Xiv:1407.3792 . ¡ doi â
- 2[2] G. Degrassi et al., âHiggs mass and vacuum stability in the Standard Model at NNLOâ, JHEP 08 (2012) 098, 10.1007/JHEP 08(2012)098 , ar Xiv:1205.6497 . ¡ doi â
- 3[3] A. H. Hoang and I. W. Stewart, âTop Mass Measurements from Jets and the Tevatron Top-Quark Massâ, Nucl. Phys. Proc. Suppl. 185 (2008) 220, 10.1016/j.nuclphysbps.2008.10.028 , ar Xiv:0808.0222 . ¡ doi â
- 4[4] ATLAS, CDF, CMS, and D 0 Collaborations, âFirst combination of Tevatron and LHC measurements of the top-quark massâ, (2014). ar Xiv:1403.4427 .
- 5[5] CMS Collaboration, âMeasurement of the top quark mass using proton-proton data at s đ {\sqrt{s}} = 7 and 8 Te Vâ, Phys. Rev. D 93 (2016) 072004, 10.1103/Phys Rev D.93.072004 , ar Xiv:1509.04044 . ¡ doi â
- 6[6] ATLAS Collaboration, âMeasurement of the top quark mass in the \ttbar â â \ttbar absent \ttbar\to dilepton channel from s = 8 đ 8 \sqrt{s}=8 Te V ATLAS dataâ, Phys. Lett. B 761 (2016) 350, 10.1016/j.physletb.2016.08.042 , ar Xiv:1606.02179 . ¡ doi â
- 7[7] A. J. Barr and C. G. Lester, âA review of the mass measurement techniques proposed for the Large Hadron Colliderâ, J. Phys. G 37 (2010) 123001, 10.1088/0954-3899/37/12/123001 , ar Xiv:1004.2732 . ¡ doi â
- 8[8] C. G. Lester and D. J. Summers, âMeasuring masses of semi-invisibly decaying particle pairs produced at hadron collidersâ, Phys. Lett. B 463 (1999) 99, 10.1016/S 0370-2693(99)00945-4 , ar Xiv:hep-ph/9906349 . ¡ doi â
