Search for supersymmetry in the all-hadronic final state using top quark tagging in pp collisions at sqrt(s) = 13 TeV
CMS Collaboration

TL;DR
This paper reports a search for supersymmetry in all-hadronic final states with missing transverse momentum and top quark tagging using CMS data at 13 TeV, setting limits on new particle masses.
Contribution
It introduces a novel search strategy combining top quark tagging and jet properties to improve sensitivity to supersymmetric particles.
Findings
No significant excess observed beyond standard model expectations.
Excluded top squark masses up to 740 GeV and neutralino masses up to 240 GeV.
Excluded gluino masses up to 1550 GeV and neutralino masses up to 900 GeV.
Abstract
A search is presented for supersymmetry in all-hadronic events with missing transverse momentum and tagged top quarks. The data sample was collected by the CMS detector at the LHC and corresponds to an integrated luminosity of 2.3 inverse femtobarns of proton-proton collisions at a center-of-mass energy of 13 TeV. Search regions are defined using the properties of reconstructed jets, the multiplicity of bottom and top quark candidates, and an imbalance in transverse momentum. With no statistically significant excess of events observed beyond the expected contributions from the standard model, we set exclusion limits at 95% confidence level on the masses of new particles in the context of simplified models of direct and gluino-mediated top squark production. For direct top squark production with decays to a top quark and a neutralino, top squark masses up to 740 GeV and neutralino masses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\cmsNoteHeader
SUS-16-009
\RCS
\RCS \RCS
\cmsNoteHeader
SUS-16-009
Search for supersymmetry in the all-hadronic final state using top quark tagging in collisions at
Abstract
A search is presented for supersymmetry in all-hadronic events with missing transverse momentum and tagged top quarks. The data sample was collected with the CMS detector at the LHC and corresponds to an integrated luminosity of 2.3\fbinvof proton-proton collisions at a center-of-mass energy of 13\TeV. Search regions are defined using the properties of reconstructed jets, the multiplicity of bottom and top quark candidates, and an imbalance in transverse momentum. With no statistically significant excess of events observed beyond the expected contributions from the standard model, we set exclusion limits at 95% confidence level on the masses of new particles in the context of simplified models of direct and gluino-mediated top squark production. For direct top squark production with decays to a top quark and a neutralino, top squark masses up to 740\GeVand neutralino masses up to 240\GeVare excluded. Gluino masses up to 1550\GeVand neutralino masses up to 900\GeVare excluded for a gluino-mediated production case, where each of the pair-produced gluinos decays to a top-antitop quark pair and a neutralino.
0.1 Introduction
The standard model (SM) of fundamental particles and their interactions has been extremely successful in describing phenomena in the atomic and subatomic realms. The discovery of a boson with properties consistent with the SM Higgs boson [1, 2, 3] at the CERN LHC [4] further strengthened this model. Assuming that the Higgs boson is a fundamental spin-0 particle, however, the low value of its measured mass, around 125\GeV [5], implies that there is a fine-tuned cancellation of large quantum corrections to its mass, which is referred to as the hierarchy problem and is currently unexplained [6, 7, 8, 9, 10]. Supersymmetry (SUSY) [11, 12, 13, 14, 15, 16, 17, 18, 19, 20] is one of the most compelling models of new physics as it provides an elegant mechanism to mitigate the hierarchy problem by introducing a symmetry between fermions and bosons.
Supersymmetry proposes a superpartner for each SM particle with the same quantum numbers, except for spin, which differs by a half-integer. The SM particles and their corresponding superpartners contribute to the loop corrections to the Higgs boson mass with opposite sign [21], and are therefore capable of controlling these corrections. This behavior can persist despite the breaking of SUSY, which is required to accommodate the lack of observation of superpartners with exactly the same masses as their SM counterparts. To solve the hierarchy problem in a “natural” way, Refs. [22, 23, 24, 25, 26, 27] suggest models in which the higgsino mass parameter is of the order of 100\GeVand the masses of the top squark , the bottom squark , and the gluino are near the TeV scale, while the masses of the other sparticles can be beyond the reach of the LHC. The mass of the top squark is particularly constrained in “natural” SUSY models as it is the most important factor in cancelling the top quark contribution to the Higgs boson mass. In -parity conserving models [28], superpartners are produced in pairs, and the lightest SUSY particle (LSP) is stable. Models with a weakly interacting neutralino () as the LSP are especially attractive because the can have properties consistent with dark matter [29].
Based on these considerations, we perform a search for top squarks, produced either directly or through gluino decays, with each top squark decaying into a stable and SM particles. Previous searches at the LHC in proton-proton collisions at have found no evidence for physics beyond the SM, and lower limits have been placed on the top squark mass within the framework of simplified models of the SUSY particle spectrum (SMS) [30, 31, 32, 33, 34]. The particle spectra in such models are typically restricted to states that are required for natural SUSY scenarios. Lower limits on the top squark mass, , extend up to [35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45], and those on the gluino mass, , extend up to 1400\GeV [46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57]. Lower limits on the neutralino mass, , extend up to for models with direct top squarks production and up to for models with gluino-mediated production. Recent searches in proton-proton collisions at have further extended these lower limits, reaching up to [58, 59, 60] for the top squark mass, up to for the gluino mass, and up to for the neutralino mass [61, 62, 63, 64, 65].
The search presented in this paper is performed on data collected with the CMS detector at the LHC and corresponding to an integrated luminosity of of proton-proton collisions at a center-of-mass energy of 13\TeV. The search strategy closely follows the one reported in Ref. [41] with several improvements. We select events containing large missing transverse momentum, at least four jets, at least one jet identified as originating from the hadronization of a quark (“ jet”), and no identified leptons. The analysis relies on a highly efficient algorithm to tag groups of jets consistent with top quark decay. This top quark tagging algorithm is improved relative to the one described in Ref. [41], to enhance the sensitivity for selecting top quarks with large Lorentz boosts that cause the merging of jets among the top decay products. The analysis categorizes each event according to the number of identified top quark candidates, in order to both discriminate signal from background and to distinguish among signal hypotheses such as direct top squark production and gluino-mediated top squark production, which contain different multiplicities of top quarks in the final state. In addition, the kinematic properties of top quark candidates are used as input to the computation of the “stransverse” mass () variable [66, 67], which is used to estimate the mass of pair-produced particles in the presence of invisible particles. Exclusive search regions are defined using several event properties, including the number of identified jets, the number of top quark candidates, the missing transverse momentum , and .
One of the major sources of SM background originates from either top-antitop quark pair () or \PW+jets events in which leptonic \PW boson decay produces a charged lepton that is not reconstructed or identified, and a high momentum neutrino, generating true missing transverse momentum. Events in which a \Z boson, produced in association with jets, decays to neutrinos () also provide a significant contribution to the SM background. The SM backgrounds are estimated using control samples in the data that are disjoint from the signal regions but have similar kinematic properties and composition.
This paper is structured as follows. Event reconstruction and simulation are described in Sec. 0.2. Sec. 0.3 presents details of the optimization of the analysis, including signal models, the top quark tagging algorithm, and event categorization. The strategy used to estimate the SM background is detailed in Sec. 0.4. The results and their interpretation in the context of SUSY are discussed in Sec. 0.5, followed by a summary in Sec. 0.6.
0.2 Detector, event reconstruction, and simulation
0.2.1 Detector and event reconstruction
The CMS detector is built around a superconducting solenoid of 6\unitm internal diameter, providing a magnetic field of 3.8\unitT. Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintillator hadron calorimeter (HCAL). The tracking detectors cover . The ECAL and HCAL, each composed of a barrel and two endcap sections, extend over a pseudorapidity range . Forward calorimeters on each side of the interaction point encompass . Muons are identified and measured within by gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid. The first level of the CMS trigger system, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select the most interesting events in a fixed time interval of less than 4\mus. The high-level-trigger processor farm further decreases the event rate from around 100\unitkHz to less than 1\unitkHz before data storage. A more detailed description of the CMS detector, together with a definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [68].
The recorded events are reconstructed using the particle-flow (PF) algorithm [69]. Using the information from the tracker, calorimeters, and muon system, this algorithm reconstructs PF candidates that are classified as charged hadrons, neutral hadrons, photons, muons, or electrons. The \ptvecmissis defined as the negative of the vector sum of the transverse momentum \ptof all PF candidates in the event, and its magnitude is denoted by \MET. The PF candidates in an event are clustered into jets using the anti-\ktclustering algorithm [70] with size parameter (AK4 jets). Charged particles from additional collisions (“pileup”) from the same or adjacent beam crossing to the one that produced the primary hard-scattering process are excluded if they do not originate from the primary interaction vertex, i.e., the vertex with the largest calculated from all its associated tracks. The momentum of neutral particles from pileup interactions, and from the underlying event, is subtracted using the \FASTJETtechnique, which is based on the calculation of the -dependent transverse momentum density, evaluated event by event [71, 72]. The energy and momentum of each jet are corrected using factors derived from simulation, and, for jets in data, an additional residual energy-momentum correction is applied to account for differences in the jet energy-momentum scales [73] between simulations and data. Only jets with and or , depending on the use case, are considered in this search. The scalar sum of the jet \ptfor all jets within is denoted by \HTin the following.
A jet is considered to be a jet (“-tagged”) if it passes the medium operating point requirements of the combined secondary vertex algorithm [74, 75], has , and is within . The corresponding quark identification efficiency is 70% on average per jet in \ttbarevents. The probability of a jet originating from a light quark or gluon to be misidentified as a quark jet is 1.4%, averaged over jet \ptin \ttbarevents [74].
Muons are reconstructed by matching tracks in the muon detectors to compatible track segments in the silicon tracker [76] and are required to be within . Electron candidates are reconstructed starting from clusters of energy deposited in the ECAL that are then matched to a track in the silicon tracker [77]. Electron candidates are required to have or to avoid the transition region between the ECAL barrel and the endcap. Muon and electron candidates are required to originate from within 2\unitmm of the primary vertex in the transverse plane and within 5\unitmm along the axis.
To obtain a sample of all-hadronic events, events with isolated electrons and muons are vetoed. The isolation of electron and muon candidates is defined as the of all additional PF candidates in a cone around the lepton candidate’s trajectory with a radius . The cone size depends on the lepton \ptas follows:
[TABLE]
The cone radius for higher-\ptcandidates is reduced because highly boosted objects, which may include high-\ptleptons in their decay, are contained in a cone of smaller radius than low-\ptobjects. The isolation sum is corrected for contributions originating from pileup interactions using an estimate of the pileup energy in the cone. A relative isolation is defined as the ratio of the isolation sum to the candidate \pt, and is required to be less than 0.1 (0.2) for electron (muon) candidates. Events with isolated electrons (muons) that have and (2.4) are rejected.
In order to further reduce the contribution from background events originating from leptonic \PW boson decays that feature low-\ptelectrons, muons, or hadronically decaying taus (), an additional veto on the presence of isolated tracks is used. These tracks are required to have , , and relative track isolation less than 0.2 (0.1) when they are identified by the PF algorithm as electrons or muons (charged hadrons). The isolation sum used to compute the relative track isolation is the of all additional charged PF candidates within a fixed cone of around the track. To preserve signal efficiency, this veto is applied only if the transverse mass () of the isolated track and system is consistent with a \PW boson decay. The is defined as
[TABLE]
with the \ptof the track and the azimuthal separation between the track and \ptvecmissvector. Specifically, we require .
0.2.2 Event simulation
Monte Carlo (MC) simulated event samples are used to study the properties of the SM background processes, as well as the signal models. The \MADGRAPH5_a\MCATNLOv2.2.2 generator [78] is used in leading-order (LO) mode to simulate events originating from production, \PW+jets with decays, \cPZ+jets with decays, Drell-Yan (DY)+jets, +jets, quantum chromodynamics (QCD) multijet, gluino pair production, and top squark pair production processes. The generation of these processes is based on LO parton distribution functions (PDFs) using NNPDF3.0 [79]. Single top quark events produced in the channel are generated with the next-to-leading-order (NLO) \POWHEGv1.0 [80, 81, 82, 83] generator. Rare SM processes, such as and , are generated at NLO accuracy with the \MADGRAPH5_a\MCATNLOv2.2.2 program. Both the single top quark and rare SM processes are generated using NLO NNPDF3.0 PDFs. The parton showering and hadronization is simulated with \PYTHIA v8.205 [84] using underlying-event tune CUETP8M1 [85].
The CMS detector response is simulated using a \GEANTfour-based model [86] in the case of SM background processes and a dedicated fast simulation package [87] for the case of signal processes, where a large number of signal model scenarios are needed. The fast simulation is tuned to provide results that are consistent with those obtained from the full \GEANTfour-based simulation. Event reconstruction is performed in the same manner as for collision data.
The signal production cross sections are calculated using NLO plus next-to-leading-logarithm (NLL) calculations [88]. The most precise available cross section calculations are used to normalize the SM simulated samples, corresponding to NLO or next-to-NLO accuracy in most cases [78, 89, 90, 91, 92, 93, 94, 95].
The simulation is corrected to account for discrepancies between data and simulation in the lepton selection efficiency and the tagging efficiency. The uncertainties corresponding to these corrections are propagated to the predicted SM yields in the search regions. Differences in the efficiencies for selecting isolated electrons and muons are measured in events. Correction factors and their uncertainties for the tagging efficiency are derived using multijet- and \ttbar-enriched event samples and are parametrized by the jet kinematics [74].
0.3 Analysis strategy
The analysis is designed for maximum sensitivity to models in which top quarks are produced in the SUSY decay chains discussed in Sec. 0.1. The data are first divided into regions based upon the numbers of tagged top quarks () and jets () found in each event. The search regions are defined by further subdivision of each , bin in several \METand bins.
0.3.1 Benchmark signal models
For direct top squark pair production, we consider two decay scenarios within the SMS framework. In the scenario denoted by “T2tt,” each decays via a top quark: , in which is the LSP. The second decay scenario considered here, denoted by “T2tb,” involves two decay modes, (as in T2tt) and , each with a 50% branching fraction. In the latter case, the lightest chargino decays with 100% branching fraction to a virtual boson and a . A natural simplified SUSY spectrum is assumed in which the is 5\GeVheavier than the [24, 25, 26]. As a result of the mixed decay modes, the T2tb scenario consists of three different final states containing either two quarks and no top quarks (25%), one quark and one top quark (50%), or two top quarks and no quarks (25%). Figure 1 shows the diagrams representing these two simplified models.
Two scenarios are considered for gluino-mediated top squark production, as shown in Fig. 2. In the main model, denoted by “T1tttt,” the gluino decays to top quarks via an off-shell top squark: . This model is complementary to the direct top squark production because it gives sensitivity to the scenario where the gluino is kinematically accessible but the top squark is too heavy for direct production. The second scenario, denoted by “T5ttcc,” features on-shell top squarks in the decay chain with a mass difference between top squark and LSP assumed to be . For this model, the gluino decays to a top quark and a top squark, , and the top squark decays to a charm quark and the LSP, . This model again serves as a complement to the direct search by providing sensitivity to very light top squarks, which would not decay to on-shell top quarks.
All scenarios described above share similar final states, containing two neutralinos and up to four top quarks. Given that the is stable and only interacts weakly, it does not produce a signal in the detector. Therefore, is one of the most important discriminators between signal and SM background, especially for models with large mass differences between the top squark or gluino and the . Since top quarks decay almost exclusively to a quark and a boson, each hadronically decaying top quark can result in up to three identified jets, depending on the top quark \ptand jet size. For certain signal scenarios, there may be additional bottom, charm, or light-flavor quarks, which increase the expected jet and -tagged jet multiplicities.
0.3.2 Top quark reconstruction and identification
The procedure to reconstruct and identify the hadronically decaying top quarks (top quark tagging or “ tagging”) presented here is similar to the one used in Ref. [41], where reconstruction of the hadronically decaying top quarks from resolved jets is performed as described in Refs. [96, 97, 98]. The tagging algorithm is improved in this work, to be more sensitive to boosted scenarios in which decay products from the boson or top quark are merged into a single jet. Additionally, the algorithm is expanded to allow the reconstruction of multiple top quarks in each event.
The top quark tagging algorithm takes as input all reconstructed AK4 jets that satisfy and . These jets are clustered into three categories of top quark candidates: trijet, dijet, and monojet. Trijet candidates, representing the three jets coming from the quark and the hadronic decay of the \PW boson, are subject to the following conditions: (i) All jets lie within a cone of radius , centered at the direction defined by the vector sum of the momentum of the three jets. The radius requirement implies a moderate Lorentz boost of the top quark, as is expected for the vast majority of signal parameter space targeted in this search. (ii) To reduce combinatoric backgrounds, one of the ratios of dijet to trijet masses must be consistent with the ratio [97]. The trijet system must satisfy one of the following three (overlapping) criteria:
[TABLE]
Here, , , and are the dijet masses, where the jet indices 1, 2, and 3 reflect a decreasing order in \pt. The numerical constants have values and , with and [99]. Assuming massless input jets and trijet mass , each of the three criteria can be reduced to the condition that the respective ratio of , or is within the range of .
The second category of top quark candidates is clustered from just two jets and is designed to tag top quark decays in which the boson decay products are merged into a single jet ( jet). The jet mass is used to determine if a jet represents a jet with a required mass window of –. Additionally, the dijet system is required to pass the requirement:
[TABLE]
where is the mass of the candidate jet and is the mass of the dijet system. and are the same as for the trijet requirements. The final category of candidates, monojets, are constructed from single jets which have a jet mass consistent with , \ie, in the range of 110–220\GeV.
After all possible top quark candidates are constructed, the final list of reconstructed top quark objects is determined by making requirements on the total mass of the object and the number of jets. Any top quark candidate with more than one jet is rejected because the probability of having two genuine jets, or having a second light-flavor jet tagged as a jet, in a single top quark candidate is negligible. All candidates with a mass outside the range – are rejected. The list of candidates is pruned to remove candidates that share a jet with another candidate, in favor of the candidate with the mass closer to the true top quark mass. However, if there is only one jet in the event, the top quark candidate with the best match to the true top mass may be pruned if it contains the jet to ensure that there are two objects for the calculation (described below).
By considering not only fully resolved (trijet) top quark decays, but also decays from boosted top quarks, manifesting themselves as dijet or monojet topologies, this tagger achieves a high efficiency for tagging top quarks over a wide range of top quark \ptvalues, from at to close to 85% at . The tagging efficiency is determined using the T2tt signal model with and since it has a wide top quark spectrum. The tagging efficiency was also measured using SM \ttbarbackground and other signal models, and was found to agree with the T2tt measurement within statistical uncertainties. The event sample used to measure the tagging efficiency was selected by requiring the presence of at least four jets with and . The -tagged object must be matched to a hadronically decaying generator-level top quark within a cone of radius 0.4 in () space. The tagging efficiency as a function of top quark \ptis shown in Fig. 3, which also includes the expected \ptdistributions for the hadronically decaying top quark in SM \ttbarevents, as well as in various signal models. Since the top quark \ptspectrum for signal events depends strongly on and , the good tagging efficiency across the top quark \ptspectrum ensures high acceptance for a wide range of signal models. The tagging efficiency for a previous algorithm, described in Ref. [41], as evaluated from simulation, is about 20% at top quark and drops quickly to close to 0 for higher top quark \pt. Figure 3 shows that the top quark tagger performance has substantially improved with respect to that used in Ref. [41]: the efficiency is about 55% at , and it rises with increasing \pt.
The purity of the tagger, computed as the percentage of -tagged objects that can be matched to a hadronically decaying generator-level top quark within a cone of radius 0.4 in () space, is 70–90% in \ttbarevents that satisfy and contain at least four jets, at least one of which is \PQb-tagged. The probability that an event that does not contain hadronically decaying top quarks will be found to contain one or more -tagged objects is about 30–40% for events passing the selection used for the efficiency calculation. Further details on the tagger performance are presented in App. .7. The event yields of these processes, as well as from the \ttbarprocess, are further reduced by placing requirements on the “stransverse mass” variable, , discussed below, as a complement to the top quark tagging requirements. The top quark tagging efficiency agrees well between data and the \GEANTfour-based simulation as shown in App. .7. However, a correction factor of up to 5% is needed to account for discrepancies between the fast simulation and the \GEANTfour-based simulation. It is derived using the same T2tt signal model mentioned above and is parametrized as a function of top quark candidate \pt.
The variable [66, 67] is an extension of the transverse mass variable that is sensitive to the pair production of heavy particles, \eg, gluinos or top squarks, each of which decays to an invisible particle. For direct top squark production, has a kinematic upper limit at the \sTopmass, whereas for \ttbarproduction the kinematic upper limit is the top quark mass. For gluino pair production, the interpretation of depends on the decay scenario. However, the values of are consistently larger than those for \ttbaror other SM backgrounds due to the larger values of \METand the high \ptof the top quarks produced in gluino decays. The variable is defined for two heavy particles, denoted with subscripts and , decaying to some visible particles and an invisible particle () as:
[TABLE]
where and are the transverse momentum and mass of the visible daughters of each heavy particle, and and represent the unknown transverse momentum and mass of the invisible from each heavy particle decay. The transverse mass squared, , is defined as
[TABLE]
The variable is the minimum [66] of two transverse masses with the constraint that the sum of the transverse momenta of both neutralinos is equal to the in the event, i.e., . The invisible particle is assumed to be massless, in order to be consistent with the use of the neutrino as the invisible particle in the calculation for the SM backgrounds, therefore equals zero in Eqs. (5) and (6).
We construct the visible decay products of each heavy particle ( and ) from the list of -tagged objects. The selection requirements used in the analysis ensure that every event has at least one reconstructed -tagged object. In the case where two -tagged objects are identified, each is used as one visible component in the calculation. If more than two -tagged objects are found, is calculated for all combinations and the lowest value is used. In the case where only one -tagged object is identified, the visible component of the second system is taken from the remaining jets not included in the -tagged object, using a -tagged jet as a seed to partially reconstruct a top quark. The -tagged jet is combined with the closest jet that yields an invariant mass between 50\GeVand . The combined “dijet” is used as the second visible system. In case no jet combination satisfies that invariant mass requirement, the -tagged jet is used as the only remnant of the second visible system.
0.3.3 Event selection and categorization
Events in the search regions are collected with a trigger that applies a lower threshold of on \HTin coincidence with a threshold of 100\GeVon \MET. This trigger is fully efficient at selecting events satisfying the requirements and , both at the full event reconstruction level.
All events must pass filters designed to remove detector- and beam-related noise. All jets considered in this analysis are required to have , and must pass a set of jet identification criteria as described in Ref. [100]. The minimum number of such jets with in an event must be , with the leading two jets required to have . Events must satisfy and , where the thresholds are chosen to exceed the trigger efficiency turn-on and to allow a low sideband for background studies. A requirement on the angle between \METand the first three leading jets, , 0.5, 0.3, is applied to reduce the number of events from QCD multijet processes. High-\METQCD multijet events are usually the result of an undermeasurement of the \ptof one of the leading jets, which results in \METbeing aligned with that jet and being small. The undermeasurement can occur because of detector effects or, in the case of semileptonic or quark decays, because a neutrino carries away unmeasured energy. Finally, requirements that , , and \GeVare applied, after which we observe 288 events in the data.
After this preselection, we define nonoverlapping search regions in terms of , , \MET, and . Figure 4 displays the background composition, as computed from simulation, following the preselection as a function of each of these four variables. Note that the -tagged object definition does not require the presence of -tagged jets, nor are -tagged jets inside -tagged objects rejected from the -tagged jet counting. Thus there is not a one-to-one correspondence between the numbers of -tagged objects and -tagged jets in an event. Two different analysis optimizations are used to get the best sensitivity for direct top squark production models (T2tt and T2tb) versus gluino-mediated production models (T1tttt and T5ttcc). For direct top squark production models, the multiplicities of -tagged jets and -tagged objects are binned as , and , . Due to the possibility of having more than two top quarks in the decay chain, the gluino-mediated production models are interpreted using bins with , , and , , . To improve background suppression, in particular of the \ttbarcontribution, and to improve the sensitivity to the various signal topologies, each (, ) bin is further subdivided by placing requirements on the \METand variables, as shown in Figs. 5 and 6. These figures also list the search region bin numbers used throughout the paper. The subdivision of any given (, ) bin according to the \METand variables is the same for both the direct top squark and the gluino-mediated production optimizations.
0.4 Background estimation
About 70% of the expected SM background (integrated over all search bins) comes from \ttbar, \PW+jets, and single top quark events with leptonic boson decays. If the boson decays to a lepton that decays hadronically, this lepton is reconstructed as a jet and passes the lepton vetoes. If, on the other hand, the boson decays to an electron or muon, events can survive the lepton vetoes when the electron or muon is “lost,” \ie, is not isolated, not identified/reconstructed, or out of the acceptance region. The remaining SM background contributions, in order of decreasing importance, originate from the +jets, QCD multijet, and other rare processes such as triboson and production. The \ttbar, \PW+jets, single top quark, and QCD multijet backgrounds are determined using data-driven methods and are validated with closure tests in the simulation. The +jets background is estimated using simulated events that are weighted to match the data in control regions. Small contributions from and other rare processes are estimated directly from simulated events. The background estimation methods are presented in the following subsections.
0.4.1 Estimation of the lost-lepton background
The contribution to the background from events with lost leptons (LL) is determined from a data control sample (CS) that consists mainly of events. This CS is collected using the search trigger and is defined to match the preselection, but the muon veto is replaced by the requirement that there be exactly one well-identified and isolated muon with and , and the isolated track veto is removed. To reduce possible signal contamination in this CS, only events with less than 100\GeVare considered, with reconstructed from the muon \ptand as described for tracks in Eq. (2). For \ttbar, \PW+jets, and single top quark events with one decay, originates from the produced neutrino. This means that the distribution represents the transverse mass and falls off sharply above ; however, this is not the case for signal events.
The predicted number of events with lost leptons, , originating from the \ttbar, \PW+jets, and single top quark processes contributing to each search region bin is calculated as
[TABLE]
where is the sum over the events measured directly in the corresponding bin of the single muon CS defined above. The factors , , and convert the number of events in the CS to the number of LL events due to isolation, reconstruction and identification, and acceptance criteria (typical values are, respectively, around , , and ). These scale factors are determined from isolation and reconstruction efficiencies, as well as the acceptance, which are obtained for each search region bin using simulated \ttbarevents. The contribution to the signal region from dilepton \ttbarevents where both leptons are lost is corrected with the term ( for muons and for electrons). The CS is normalized by the factor (around ) to compensate for the efficiency of the requirement. Finally, the isolated track veto efficiency factor, , is applied to get the final number of predicted LL background events. The isolated track veto efficiency, \ie, the fraction of events surviving the isolated track veto, is around 60%.
The main systematic uncertainty for the LL background prediction is derived from a closure test, which assesses whether the method can correctly predict the background yield in simulated event samples. The test is performed by comparing the LL background in the search regions, as predicted by applying the LL background determination procedure to the simulated muon CS, to the expectation obtained directly from \ttbar, single top quark, and \PW+jets simulation. The result of the closure test for the 45 search bins optimized for gluino-mediated production is shown in the top plot of Fig. 7. The closure test uncertainty (up to 26%, depending on the search bin) is dominated by statistical fluctuations and included as a systematic uncertainty in the LL background prediction. The closure uncertainties for the 37 search bins optimized for direct top squark production are of similar size. The following other sources of systematic uncertainty are also included: lepton isolation efficiency (effect on prediction is between and ), lepton reconstruction and identification efficiency ( to ), lepton acceptance from uncertainty in the PDFs (about ), control sample purity (), corrections due to the presence of dilepton events (around ), efficiency of the selection (less than ), and isolated-track veto ( to ).
0.4.2 Estimation of the hadronically decaying lepton background
Events from \ttbar, \PW+jets, and single top quark processes in which a lepton decays hadronically () are one of the largest components of the SM background contributing to the search regions. When a boson decays to a neutrino and a , the presence of neutrinos in the final state results in \ptvecmiss, and the event passes the lepton veto because the hadronically decaying lepton is reconstructed as a jet. A veto on isolated tracks is used in the preselection to reduce the background with a minimal impact on signal efficiency.
The estimate of the remaining background is based on a CS of +jets events selected from data using a trigger with requirements on both muon \ptand , and a requirement of exactly one muon with and . An upper threshold on the transverse mass reconstructed from the muon and \MET, , is required to select events containing a decay and to suppress signal events contaminating the +jets sample. Since both +jets and \tauh+jets production arise from the same underlying process, the hadronic component of the events is expected to be the same, aside from the response of the detector to a muon or . The muon \ptis smeared by response template distributions derived for a hadronically decaying lepton to correct the leptonic part of the event. The response templates are derived using \ttbar, \PW+jets, and single top quark simulated samples by comparing the true lepton \ptwith the reconstructed \tauhjet \pt. The kinematic variables of the event are recalculated with this jet, and the search selections are applied to predict the \tauhbackground.
The probability to mistag a jet as a jet is significant (about 0.1) and affects the distribution of background events. The dependence of the mistag rate on the jet \ptis larger for \ttbarevents than for \PW+jets events, because the quark from the top quark decay can overlap with the jet. This mistag rate is taken into account in the +jets CS by randomly selecting a simulated jet and counting it as a jet with the probability obtained from MC simulation in \PW+jets events for the corresponding jet \pt.
The background prediction is calculated as follows:
[TABLE]
where the first summation is over the events in the +jets CS, the second is over the bins of the response template, and is the probability of the response from each bin. The various correction factors applied to convert +jets events into +jets events to construct the final sample are:
- •
the branching fraction ratio = 0.65;
- •
the muon reconstruction and identification efficiency (0.94–0.98) and the muon isolation efficiency (0.5–0.95 depending on the muon and the of PF candidates within an annulus with outer radius of and inner radius equal to the isolation cone);
- •
the muon acceptance (typically around 0.8–0.9);
- •
the selection efficiency ();
- •
the correction to account for the contamination in the CS from muons from decays, (around 0.8 depending on and \MET);
- •
the isolated track veto efficiency for , (around 0.7), as determined from simulated \ttbar, \PW+jets and single top quark events by matching isolated tracks to jets;
- •
the contribution that overlaps with the LL background prediction due to contamination of dileptonic events in the CS, , to avoid double counting (0.98);
- •
and a correction for the trigger efficiency, (0.95).
The muon reconstruction, identification, and isolation efficiency are the same as those used for the LL background determination.
A closure test is performed comparing the \tauhbackground in the search regions as predicted by applying the \tauhbackground determination procedure to the simulated muon CS to the expectation obtained directly from simulation. The result of the closure test for the 45 search bins optimized for gluino-mediated production is shown in the lower plot of Fig. 7. The closure uncertainty for each search bin (between 2% and 28%) is dominated by statistical fluctuations and is included as a systematic uncertainty in the \tauhbackground prediction. The closure uncertainties for the 37 search bins optimized for direct top squark production are of similar size. In addition, systematic uncertainties are evaluated for each of the ingredients in the prediction, which arise from uncertainties in the following sources: the response template (2%), the muon reconstruction and isolation efficiency (1%), the acceptance due to uncertainties in the PDFs (up to 5%), the mistag rate of the jet (up to 15%), due to uncertainties in the \METscale (), the efficiency of the isolated track veto (–), contamination from lost leptons (2.4%), and the trigger efficiency (1%).
0.4.3 Estimation of the background
The background is derived using simulated events that have been corrected for observed differences between data and simulation. A control sample is used to validate the MC and residual differences in both shape of the jet multiplicity () distribution and overall normalization present therein are corrected for. The central value of the background prediction for each search bin can be written as
[TABLE]
where is the predicted number of background events in search bin . The sum runs over all simulated events that fall in search bin , and is a standard event weight including the assumed cross section, the integrated luminosity, the tagging efficiency scale factors, and the measured trigger efficiency. Each simulated event is additionally weighted using two scale factors, and , that correct the normalization of the simulation and the shape of the simulated distribution, respectively. Both scale factors are calculated in a dimuon CS that has events with two muons, with , and no muon or isolated track vetoes. In this region the two muons are treated as if they were neutrinos.
The first scale factor, , is derived using a tight dimuon CS in data. This control region has the same selection as the search region preselection, apart from the muon requirement and without any requirements on -tagged jets. This region is selected for its kinematic similarity to the signal region, but lacks the statistical precision required for shape comparison. The scale factor is computed by comparing the expected event yield in the tight region in the DY simulation with the observed event yield in data after subtraction of the other SM processes.
The second scale factor, , depends on the number of jets in the event and is designed to correct the mismodeling of the jet multiplicity distribution in simulation. The scale factor is derived in a loose dimuon control region in which the signal region requirements on \MET, , and are removed, and the \HTrequirement is relaxed to . The scale factor is derived for each bin as the ratio between the data, with non-DY backgrounds subtracted, and the DY simulation. Due to \ttbarcontributions similar to the DY processes for greater jet and -tagged jet multiplicities, the \ttbarMC events are similarly reweighted using a CS selected to have an electron and a muon with before subtraction from the dimuon data. The and \METdistributions in the loose dimuon CS after applying the scale factor are shown in Fig. 8. The distribution agrees well between data and simulation, whereas the \METdistribution has some disagreement between 300 and 600. The disagreement is taken into account with a shape uncertainty equal to the magnitude of the disagreement and has a negligible effect on the final results.
The systematic uncertainties for the background prediction are divided into two broad categories: uncertainties associated with the use of MC simulation and uncertainties specifically associated with the background prediction method. The first category includes systematic uncertainties in the PDFs and renormalization/factorization scale choices, jet and \METenergy scale uncertainties, tagging efficiency scale factor uncertainties, and trigger efficiency uncertainties. The second category includes uncertainties from the method used to determine and the scale factors, and uncertainties based on the residual shape disagreement between data and DY+jets simulation in the loose dimuon CS. The uncertainty in , derived from the statistical uncertainties on data and MC in the tight CS, results in a 19% uncertainty in the predicted event yield for each search bin. The uncertainties associated with are the dominant uncertainties and are related to residual shape uncertainties (after applying the scale factor) in the search region variables \MET, , , and . These uncertainties are evaluated in the loose CS with the additional requirement that so that is well defined. The resulting shift of the central value of the search bin predictions is used as the systematic uncertainty from the residual shape disagreements. Depending on the search bin, this uncertainty ranges between 10 and 82%. The statistical uncertainties in the ratios between data and simulation, as well as in , are also included as a 15–75% systematic uncertainty in the prediction.
0.4.4 Estimation of the QCD multijet background
The procedure to predict the QCD multijet background consists of selecting a signal-depleted data CS, rich in QCD multijet events, from which significant contributions of other SM backgrounds, such as \ttbar, \PW+jets, and \cPZ+jets, are subtracted. Following that, a translation factor, partly determined from data and partly from simulation, is used to convert the number of events measured in the data CS into a prediction for each search region bin.
The CS is defined by applying the full set of preselection requirements described in Sec. 0.3.3, except that the requirements are inverted, requiring that the \METbe aligned with one of the leading three jets. The estimated number of QCD multijet events in the inverted- CS is computed by subtracting the contributions from LL, hadronically decaying leptons, and \cPZ+jets processes from the number of data events observed in that region. The same methods as described in the previous sections are used to estimate the contributions from LL and processes, but applied to this QCD multijet-rich CS. Simulation is used to estimate the contribution from events, since it is expected to be small.
The translation factor between the QCD multijet-rich CS and the search region bins is computed in data, using a sideband of the preselection region, defined by the requirement and without an requirement, where the amount of data is sufficiently large to make an accurate measurement. The contributions from processes other than QCD multijet are subtracted from the observed number of events in this low-\METdata sideband, following the procedure outlined above. The dependence of the translation factor as a function of \METis accounted for by using a linear approximation derived from simulation. To take into account the dependence as a function of , the translation factor is computed separately for values below and above . The translation factor ranges from 0.01 to 0.14 depending on \METand .
The main systematic uncertainty in the QCD multijet prediction is obtained from a closure test in which the expectation for the signal region event yields, as obtained directly from the QCD multijet simulation, is compared to the prediction obtained by applying the QCD multijet background prediction procedure to simulated event samples. The result for the 45 search bins optimized for gluino-mediated production is shown in Fig. 9, and any observed nonclosure from the relaxed \METand requirements is taken into account as the systematic uncertainty. If there is insufficient simulation to populate a bin in the closure prediction, the uncertainty from the next lowest \METbin is used. This uncertainty ranges from 5% to 500% depending on the search bin. The closure uncertainties for the 37 search bins optimized for direct top squark production are of similar size. The high closure uncertainties for some search bins are due to statistical limitations of the simulation, but have a small effect on the final results because the QCD multijet yields are very low in these search bins compared to other backgrounds. In addition, another major source of systematic uncertainty in the QCD multijet prediction is the uncertainty in the factors.
0.4.5 Backgrounds from and other SM rare processes
Similar to the background, is an irreducible background when bosons decay to neutrinos and both top quarks decay hadronically. The cross section at 13\TeVis only 783\unitfb (computed at NLO using \MADGRAPH5_a\MCATNLO) and the predicted yield of events in the search bins is less than 10% of the total background. Given the presence of genuine \METand jets in events, and given the small cross section associated with this process, we rely on simulation to predict its contribution to each search region bin. The simulation is validated using a trilepton control sample in data, and the 30% statistical uncertainty in this data measurement is propagated to the prediction.
The contribution of the process to the signal region is covered by the LL and background estimation methods. The signal region yields for the diboson and multiboson processes are fully determined by simulation and are combined into a single rare background prediction.
0.5 Results and interpretation
The predicted number of SM background events and the number of events observed in data for each of the search regions defined in Sec. 0.3.3 are summarized in Fig. 10 and Tables 0.5 and 0.5 for the binning optimized for direct top squark production, and in Fig. 11 and Tables 0.5 and 0.5 for the binning optimized for gluino-mediated production models. Typically, the most significant background across the search regions comes from SM \ttbaror boson production, where the boson decay contains genuine \METfrom a neutrino. Generally, the next largest contribution comes from production in association with jets (including heavy-flavor jets) in which the neutrino pair gives rise to large \METand the top quark conditions are satisfied by an accidental combination of the jets. For search regions with very high \METrequirements, the background can become dominant. The QCD multijet contribution and the contribution from other rare SM processes are subdominant across all bins. The largest rare SM process contribution (though still small) comes from with the boson decaying into a pair of neutrinos. No statistically significant deviation between the observed data events and the SM background prediction is found.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] ATLAS Collaboration, “Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC”, Phys. Lett. B 716 (2012) 1, 10.1016/j.physletb.2012.08.020 , ar Xiv:1207.7214 . · doi ↗
- 2[2] CMS Collaboration, “Observation of a new boson at a mass of 125 Ge V with the CMS experiment at the LHC”, Phys. Lett. B 716 (2012) 30, 10.1016/j.physletb.2012.08.021 , ar Xiv:1207.7235 . · doi ↗
- 3[3] CMS Collaboration, “Observation of a new boson with mass near 125 Ge V in pp collisions at s 𝑠 \sqrt{s} = 7 and 8 Te V”, JHEP 06 (2013) 081, 10.1007/JHEP 06(2013)081 , ar Xiv:1303.4571 . · doi ↗
- 4[4] L. Evans and P. Bryant, “LHC Machine”, JINST 3 (2008) S 08001, 10.1088/1748-0221/3/08/S 08001 . · doi ↗
- 5[5] ATLAS and CMS Collaborations, “Combined measurement of the Higgs boson mass in p p 𝑝 𝑝 pp collisions at s = 7 𝑠 7 \sqrt{s}=7 and 8 Te V with the ATLAS and CMS experiments”, Phys. Rev. Lett. 114 (2015) 191803, 10.1103/Phys Rev Lett.114.191803 , ar Xiv:1503.07589 . · doi ↗
- 6[6] S. Dimopoulos and S. Raby, “Supercolor”, Nucl. Phys. B 192 (1981) 353, 10.1016/0550-3213(81)90430-2 . · doi ↗
- 7[7] E. Witten, “Dynamical breaking of supersymmetry”, Nucl. Phys. B 188 (1981) 513, 10.1016/0550-3213(81)90006-7 . · doi ↗
- 8[8] M. Dine, W. Fischler, and M. Srednicki, “Supersymmetric technicolor”, Nucl. Phys. B 189 (1981) 575, 10.1016/0550-3213(81)90582-4 . · doi ↗
