Search for top-quark decays $t \rightarrow Hq$ with 36 fb$^{-1}$ of $pp$ collision data at $\sqrt{s}=13$ TeV with the ATLAS detector
ATLAS Collaboration

TL;DR
This paper searches for rare top-quark decays into a Higgs boson and an up-type quark using 36 fb$^{-1}$ of 13 TeV collision data, setting upper limits on the branching ratios and couplings with no significant signal found.
Contribution
First combined search for $t ightarrow Hq$ decays in multiple Higgs decay channels using ATLAS data at 13 TeV, improving constraints on these rare processes.
Findings
No significant excess observed above background.
95% CL upper limits on $t ightarrow Hc$ and $t ightarrow Hu$ branching ratios are around 1.1-1.2 x 10^{-3}.
Upper limits on the couplings $| ext{lambda}_{tcH}|$ and $| ext{lambda}_{tuH}|$ are approximately 0.055-0.066.
Abstract
A search for flavour-changing neutral current decays of a top quark into an up-type quark () and the Standard Model Higgs boson, , is presented. The search is based on a dataset of collisions at TeV recorded in 2015 and 2016 with the ATLAS detector at the CERN Large Hadron Collider and corresponds to an integrated luminosity of 36.1 fb. Two complementary analyses are performed that search for top-quark pair events in which one top quark decays into and the other top quark decays into , and target the and decay modes, respectively. The high multiplicity of -quark jets, or the presence of hadronically decaying -leptons, are exploited in the two analyses respectively. Multivariate techniques are used to separate the signal from the background, which is dominated by…
| Preselection requirements | |||
|---|---|---|---|
| Requirement | search | search | |
| channel | channel | ||
| Trigger | single-lepton trigger | single-lepton trigger | di- trigger |
| Leptons | =1 isolated or | =1 isolated or | no isolated or |
| – | 1 | 2 | |
| Electric charge () | – | ||
| Jets | 4 jets | 3 jets | 3 jets |
| -tagging | 2 -tagged jets | =1 -tagged jets | =1 -tagged jets |
| Variable | 3j | 4j | 3j | 4j |
|---|---|---|---|---|
| centrality | ||||
| 4j, 2b | 4j, 3b | 4j, 4b | |
| +light-jets | |||
| +jets | |||
| +jets | |||
| Single top | |||
| Diboson | |||
| Multijet | |||
| Total background | |||
| Data | 120572 | 11275 | 176 |
| 4j, 2b | 4j, 3b | 4j, 4b | |
|---|---|---|---|
| +light-jets | |||
| +jets | |||
| +jets | |||
| Single-top | |||
| Diboson | |||
| Multijet | |||
| Total | |||
| Data | 120572 | 11275 | 176 |
| 4j, 2b | 4j, 3b | 4j, 4b | |
|---|---|---|---|
| +light-jets | |||
| +jets | |||
| +jets | |||
| Single-top | |||
| Diboson | |||
| Multijet | |||
| Total | |||
| Data | 120572 | 11275 | 176 |
| , 3j | , 4j | , 3j | , 4j | |
| Fake | ||||
| Top (real ) | ||||
| Other | ||||
| Total background | ||||
| Data | 7259 | 6768 | 1119 | 894 |
| , 3j | , 4j | , 3j | , 4j | |
|---|---|---|---|---|
| Fake | ||||
| Top (real ) | ||||
| Other | ||||
| Total | ||||
| Data | 7259 | 6768 | 1119 | 894 |
| , 3j | , 4j | , 3j | , 4j | |
|---|---|---|---|---|
| Fake | ||||
| Top (real ) | ||||
| Other | ||||
| Total | ||||
| Data | 7259 | 6768 | 1119 | 894 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\AtlasTitle
Search for top-quark decays with 36 fb*-1* of collision data at with the ATLAS detector \AtlasAbstract A search for flavour-changing neutral current decays of a top quark into an up-type quark () and the Standard Model Higgs boson, , is presented. The search is based on a dataset of collisions at recorded in 2015 and 2016 with the ATLAS detector at the CERN Large Hadron Collider and corresponding to an integrated luminosity of 36.1 fb*-1*. Two complementary analyses are performed to search for top-quark pair events in which one top quark decays into and the other top quark decays into , and target the and decay modes, respectively. The high multiplicity of -quark jets, or the presence of hadronically decaying -leptons, is exploited in the two analyses respectively. Multivariate techniques are used to separate the signal from the background, which is dominated by top-quark pair production. No significant excess of events above the background expectation is found, and 95% CL upper limits on the branching ratios are derived. The combination of these searches with ATLAS searches in diphoton and multilepton final states yields observed (expected) 95% CL upper limits on the and branching ratios of () and (), respectively. The corresponding combined observed (expected) upper limits on the and couplings are 0.064 (0.055) and 0.066 (0.055), respectively.
\AtlasRefCodeTOPQ-2017-07 \AtlasJournalJHEP \AtlasJournalRefJHEP 05 (2019) 123 \AtlasDOI10.1007/JHEP05(2019)123 \PreprintIdNumberCERN-EP-2018-295 \AtlasCoverSupportingNoteSearch for , https://cds.cern.ch/record/2257631 \AtlasCoverSupportingNoteSearch for , https://cds.cern.ch/record/2273683 \AtlasCoverSupportingNoteCombination of searcheshttps://cds.cern.ch/record/2312520/ \AtlasCoverCommentsDeadline4 November 2018 \AtlasCoverAnalysisTeam : Trisha Farooque, Davide Gerbaudo, Aurelio Juste, Nicola Orlando,
Cheng Peng, Laura Pereira, Yulia Rodina, Yanjun Tu, Loïc Valéry,
Tal Van Daalen, Jia-Shian Wang, Daiki Yamaguchi;
: Xin Chen, Antonio De Maria, Boyang Li, Ligang Xia, Gang Zhang;
Combination: Peter Onyisi, Harish Potti \AtlasCoverEdBoardMemberKetevi Assamagan (chair), Garabed Halladjian, Fabrice Hubaut, Michele Weber (chair) \[email protected] \AtlasCoverEgroupEdBoardatlas-TOPQ-2017-07-editorial-board@cern.ch
1 Introduction
Following the observation of the Higgs boson by the ATLAS and CMS experiments [1, 2] at the Large Hadron Collider (LHC), a comprehensive programme of measurements of its properties is underway. An interesting possibility is the presence of flavour-changing neutral-current (FCNC) interactions between the Higgs boson, the top quark, and a - or -quark, (). Since the Higgs boson is lighter than the top quark [3], such interactions would manifest themselves as FCNC top-quark decays [4], . In the Standard Model (SM), such decays are suppressed relative to the dominant decay mode, since interactions are forbidden at the tree level and suppressed even at higher orders in the perturbative expansion due to the Glashow–Iliopoulos–Maiani (GIM) mechanism [5]. As a result, the SM predictions for the branching ratios () are exceedingly small, and [6, 7, 8, 9], making them undetectable in the foreseeable future. In contrast, large enhancements of these branching ratios are possible in some scenarios beyond the SM. Examples include quark-singlet models [10], two-Higgs-doublet models (2HDM) of type I, with explicit flavour conservation, and of type II, such as the minimal supersymmetric SM (MSSM) [11, 12, 13, 14], supersymmetric models with R-parity violation [15], composite Higgs models with partial compositeness [16], or warped extra dimensions models with SM fermions in the bulk [17]. In these scenarios, branching ratios can be as high as . An even larger branching ratio of can be reached in 2HDM without explicit flavour conservation (type III), since a tree-level FCNC coupling is not forbidden by any symmetry [18, 19, 20, 21, 22, 23, 24, 25]. While other FCNC top couplings (, , ) are also enhanced in these scenarios beyond the SM, the largest enhancements are typically found for the couplings, and in particular the coupling [4].
Searches for decays have been performed by the ATLAS and CMS collaborations, taking advantage of the large samples of top-quark pair () events collected in proton-proton () collisions at centre-of-mass energies of and [26, 27, 28] during Run 1 of the LHC, as well as at [29, 30, 31] using early Run 2 data. In these searches, one of the top quarks is required to decay into , while the other top quark decays into , yielding .111 In the following, is used to denote both and its charge conjugate, . Similarly, is used to denote . The Higgs boson is assumed to have a mass of and to decay as predicted by the SM. The simplifying assumption of SM-like Higgs boson branching ratios is motivated by the fact that measurements of the flavour-diagonal Higgs boson couplings by the ATLAS and CMS collaborations are in agreement with the SM prediction within about 10% [32, 33]. Furthermore, typical beyond-the-SM scenarios that predict significant enhancements to , also predict modifications to the Higgs boson branching ratios at the few percent level or below, well beyond the current experimental precision. Some of the most sensitive single-channel searches have been performed in the decay mode, which has a small branching ratio of , but benefits from having a very small background contamination and excellent diphoton mass resolution. Searches targeting signatures with two same-charge leptons or three leptons (electrons or muons), generically referred to as multileptons, are able to exploit a branching ratio that is significantly larger for the decay modes than for the decay mode, and are also characterised by relatively small backgrounds. Finally, searches have also been performed exploiting the dominant Higgs boson decay mode, , which has a branching ratio of . Compared with Run 1, the Run 2 searches benefit from the increased cross section at , as well as the larger integrated luminosity. Using 36.1 fb*-1* of data at , the ATLAS Collaboration has derived upper limits at 95% confidence level (CL) of using decays [29], and of based on multilepton signatures resulting from , in which both -leptons decay leptonically, or [30]. These upper limits are derived assuming that . Similar upper limits are obtained for if . The CMS Collaboration has performed a search using decays [31] with 35.9 fb*-1* of data at , resulting in upper limits of and , in each case neglecting the other decay mode. Compared with previous searches, the search in Ref. [31] considers in addition the contribution to the signal from production [34].
The searches presented in this paper are focussed on fermionic decay modes of the Higgs boson. Therefore, they help to complete the ATLAS experiment’s programme of searches for decays based on collision data at recorded in 2015 and 2016. The corresponding integrated luminosity is 36.1 fb*-1*. Two analyses are performed, searching for production (ignoring production) and targeting the and decay modes, which this paper refers to as “ search” and “ search”, respectively. The search selects events with one isolated electron or muon from the decay, and multiple jets, several of which are identified with high purity as originating from the hadronisation of -quarks. The search selects events with two -lepton candidates, at least one of which decays hadronically, as well as multiple jets. The latter requirement aims to select events with a hadronically decaying boson, since this allows an improved reconstruction of the event kinematics.
Both searches employ multivariate techniques to discriminate between the signal and the background on the basis of their different kinematics. These two searches are combined with previous ATLAS searches in the diphoton and multilepton final states using the same dataset [29, 30], and bounds are set on and , as well as on the corresponding non-flavour-diagonal Yukawa couplings. The combination is performed after verifying the overall consistency of the results obtained by the different searches, which exploit very different experimental signatures and thus are affected by different backgrounds and related systematic uncertainties. By combining all searches, the expected sensitivity is improved by about a factor of two relative to the most sensitive individual results.
2 ATLAS detector
The ATLAS detector [35] at the LHC covers almost the entire solid angle around the collision point,222ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector. The -axis points from the IP to the centre of the LHC ring, the -axis points upward, and the -axis coincides with the axis of the beam pipe. Cylindrical coordinates (,) are used in the transverse plane, being the azimuthal angle around the beam pipe. The pseudorapidity is defined in terms of the polar angle as . Angular distance is measured in units of . and consists of an inner tracking detector surrounded by a thin superconducting solenoid producing a 2 T axial magnetic field, electromagnetic and hadronic calorimeters, and a muon spectrometer incorporating three large toroid magnet assemblies with eight coils each. The inner detector contains a high-granularity silicon pixel detector, including the insertable B-layer [36, 37, 38], installed in 2014, and a silicon microstrip tracker, together providing a precise reconstruction of tracks of charged particles in the pseudorapidity range . The inner detector also includes a transition radiation tracker that provides tracking and electron identification for . The calorimeter system covers the pseudorapidity range . Within the region , electromagnetic (EM) calorimetry is provided by barrel and endcap high-granularity lead/liquid-argon (LAr) sampling calorimeters, with an additional thin LAr presampler covering , to correct for energy loss in material upstream of the calorimeters. Hadronic calorimetry is provided by a steel/scintillator-tile calorimeter, segmented into three barrel structures within , and two copper/LAr hadronic endcap calorimeters. The solid angle coverage is completed with forward copper/LAr and tungsten/LAr calorimeter modules optimised for electromagnetic and hadronic measurements, respectively. The calorimeters are surrounded by a muon spectrometer within a magnetic field provided by air-core toroid magnets with a bending integral of about 2.5 Tm in the barrel and up to 6 Tm in the endcaps. The muon spectrometer measures the trajectories of muons with using multiple layers of high-precision tracking chambers, and is instrumented with separate trigger chambers covering . A two-level trigger system [39], consisting of a hardware-based level-1 trigger followed by a software-based high-level trigger, is used to reduce the event rate to a maximum of around 1 kHz for offline storage.
3 Event reconstruction
The event reconstruction is affected by multiple collisions in a single bunch crossing and by collisions in neighbouring bunch crossings, referred to as pile-up. Interaction vertices from the collisions are reconstructed from at least two tracks with transverse momentum () larger than that are consistent with originating from the beam collision region in the – plane. If more than one primary vertex candidate is found, the candidate whose associated tracks form the largest sum of squared [40] is selected as the hard-scatter primary vertex.
Electron candidates [41, 42] are reconstructed from energy clusters in the EM calorimeter that are matched to reconstructed tracks in the inner detector; electron candidates in the transition region between the EM barrel and endcap calorimeters () are excluded. In the () search, electron candidates are required to have and , and to satisfy tight (medium) likelihood-based identification criteria [41] based on calorimeter, tracking and combined variables that provide separation between electrons and jets.
Muon candidates [43] are reconstructed by matching track segments in different layers of the muon spectrometer to tracks found in the inner detector; the resulting muon candidates are re-fitted using the complete track information from both detector systems. In the () search, muon candidates are required to have and and to satisfy medium identification criteria [43].
Electron (muon) candidates are matched to the primary vertex by requiring that the significance of their transverse impact parameter, , satisfies , where is the measured uncertainty in , and by requiring that their longitudinal impact parameter, , satisfies mm. To further reduce the background from non-prompt leptons, photon conversions and hadrons, lepton candidates are also required to be isolated in the tracker and in the calorimeter. A track-based lepton isolation criterion is defined by calculating the quantity , where the scalar sum includes all tracks (excluding the lepton candidate itself) within the cone defined by around the direction of the lepton. The value of is the smaller of and , where is set to 0.2 (0.3) for electron (muon) candidates, and is the lepton . The search requires lepton candidates to satisfy , while the search makes -dependent requirements on . Additionally, the search requires leptons to satisfy a calorimeter-based isolation criterion: the sum of the transverse energy within a cone of size around the lepton, after subtracting the contributions from pile-up and the energy deposit of the lepton itself, is required to be less than a -dependent fraction of the lepton energy.
Candidate jets are reconstructed with the anti- algorithm [44, 45] with a radius parameter , as implemented in the FastJet package [46]. Jet reconstruction in the calorimeter starts from topological clustering [47] of individual calorimeter cells calibrated to the electromagnetic energy scale. The reconstructed jets are then calibrated to the particle level by the application of a jet energy scale derived from simulation and in situ corrections based on data [48]. The calibrated jets used in the search are required to have and , while the search uses jets with and . Jet four-momenta are corrected for pile-up effects using the jet-area method [49].
Quality criteria are imposed to reject events that contain any jets arising from non-collision sources or detector noise [50]. To reduce the contamination due to jets originating from pile-up interactions, additional requirements are imposed on the jet vertex tagger (JVT) [51] output for jets with and , or on the forward JVT [52] output for jets with and .
Jets containing -hadrons are identified (-tagged) via an algorithm [53, 54] that uses multivariate techniques to combine information about the impact parameters of displaced tracks and the topological properties of secondary and tertiary decay vertices reconstructed within the jet. For each jet, a value for the multivariate -tagging discriminant is calculated. In the search, a jet is considered -tagged if this value is above the threshold corresponding to an average 70% efficiency to tag a -quark jet, with a light-jet333Light-jet refers to a jet originating from the hadronisation of a light quark (, , ) or a gluon. rejection factor of about 380 and a charm-jet rejection factor of about 12, as determined for jets with and in simulated events. In contrast, the search employs a tighter -tagging requirement, corresponding to an average efficiency of 60% to tag a -quark jet, and light-jet and charm-jet rejection factors of about 1500 and 34, respectively.
Hadronically decaying -lepton () candidates are reconstructed from energy clusters in the calorimeters and associated inner-detector tracks [55]. Candidates are required to have either one or three associated tracks, with a total charge of . Candidates are required to have and , excluding the EM calorimeter’s transition region. A boosted decision tree (BDT) discriminant [56, 57, 58] using calorimeter- and tracking-based variables is used to identify candidates and reject jet backgrounds. Three working points labelled loose, medium and tight are defined, and correspond to different identification efficiency values, with the efficiency designed to be independent of . The search uses the medium working point for the nominal selection, while the loose working point is used for background estimation. The medium working point has a combined reconstruction and identification efficiency of 55% (40%) for one-prong (three-prong) decays [59], and an expected rejection factor against light-jets of 100 [55]. Electrons that are reconstructed as one-prong candidates are removed via a BDT trained to reject electrons. Any candidate that is also -tagged is rejected.
Overlaps between reconstructed objects are removed sequentially. In the search, firstly, electron candidates that lie within of a muon candidate are removed to suppress contributions from muon bremsstrahlung. Overlaps between electron and jet candidates are resolved next, and finally, overlaps between remaining jet candidates and muon candidates are removed. Energy clusters from identified electrons are not excluded during jet reconstruction. In order to avoid double-counting of electrons as jets, the closest jet whose axis is within of an electron is discarded. If the electron is within of the axis of any jet after this initial removal, the jet is retained and the electron is removed. The overlap removal procedure between the remaining jet candidates and muon candidates is designed to remove those muons that are likely to have arisen in the decay of hadrons and to retain the overlapping jet instead. Jets and muons may also appear in close proximity when the jet results from high- muon bremsstrahlung, and in such cases the jet should be removed and the muon retained. Such jets are characterised by having very few matching inner-detector tracks. Selected muons that satisfy are rejected if the jet has at least three tracks originating from the primary vertex; otherwise the jet is removed and the muon is kept. The overlap removal procedure in the search is similar to that of the search, except that the first step is the removal of candidates within of electrons or muons, and the last step is the removal of jets whose axis lies within of the leading (highest-) candidate or the two leading candidates (depending on the search channel). In addition, the muon–jet overlap removal is slightly different: if a muon lies within of the axis of a jet, the jet is removed if either it has fewer than three tracks originating from the primary vertex or it has a small compared with that of the muon (the of the jet is less than 50% of the of the muon, or the scalar sum of the of the tracks associated with the jet is less than 70% of the of the muon).
The missing transverse momentum (with magnitude ) is defined as the negative vector sum of the of all selected and calibrated objects in the event, including a term to account for momentum from soft particles in the event which are not associated with any of the selected objects. This soft term is calculated from inner-detector tracks matched to the selected primary vertex to make it more resilient to contamination from pile-up interactions [60].
4 Data sample and event preselection
Both searches are based on a dataset of collisions at with 25 ns bunch spacing collected in 2015 and 2016, corresponding to an integrated luminosity of 36.1\leavevmode\nobreak\ \mbox{fb{}^{-1}}. Only events recorded with a single-electron trigger, a single-muon trigger, or a di- trigger under stable beam conditions and for which all detector subsystems were operational are considered. The number of interactions per bunch crossing in this dataset ranges from about 8 to 45, with an average of 24.
Single-electron and single-muon triggers with low thresholds and lepton isolation requirements are combined in a logical OR with higher-threshold triggers but with a looser identification criterion and without any isolation requirement. The lowest threshold used for muons is 20 (26) GeV in 2015 (2016), while for electrons the threshold is 24 (26) GeV. For di- triggers, the threshold of the leading (trailing) candidate is 35 (25) GeV. In both searches, events satisfying the trigger selection are required to have at least one primary vertex candidate.
Events selected by the search are recorded with a single-electron or single-muon trigger and are required to have exactly one electron or muon that matches, with , the lepton reconstructed by the trigger. Furthermore, at least four jets are required, of which at least two must be -tagged.
In the search, events are classified into and channels depending on the multiplicity of selected leptons. Events in the channel are recorded with a single-electron or single-muon trigger and are required to have exactly one selected electron or muon and at least one candidate. The selected electron or muon is required to match, with , the lepton reconstructed by the trigger and to have a exceeding the trigger threshold by 1 GeV or 2 GeV (depending on the lepton trigger and data-taking conditions). In addition, its electric charge is required to be of opposite sign to that of the leading candidate. Events in the channel are recorded with a di- trigger, and are required to have at least two candidates and no selected electrons or muons. The two leading candidates are required to have charges of opposite sign. In addition, in both search channels, trigger matching for candidates, at least three jets and exactly one -tagged jet are required.
The above requirements apply to the reconstructed objects defined in Section 3. These requirements, which ensure a negligible overlap between the and searches, are referred to as the preselection and are summarised in Table 1.
5 Signal and background modelling
Signal and most background processes are modelled using Monte Carlo (MC) simulation. After the event preselection, the main background is production, often in association with jets, denoted by +jets in the following. Small contributions arise from single-top-quark, +jets, multijet and diboson () production, as well as from the associated production of a vector boson () or a Higgs boson and a pair ( and ). All backgrounds with prompt leptons, i.e. those originating from the decay of a boson, a boson, or a -lepton, are estimated using samples of simulated events and initially normalised to their theoretical cross sections. In the simulation, the top-quark and SM Higgs boson masses are set to and , respectively, and the Higgs boson is allowed to decay into all SM particles with branching ratios calculated using Hdecay [61]. Backgrounds with non-prompt electrons or muons, with photons or jets misidentified as electrons, or with jets misidentified as candidates, generically referred to as fake leptons, are estimated using data-driven methods. The background prediction is further improved during the statistical analysis by performing a likelihood fit to data using several signal-depleted analysis regions, as discussed in Sections 6 and 7.
5.1 Simulated signal and background processes
Samples of simulated events were generated with the next-to-leading-order (NLO) generator444In the following, the order of a generator should be understood as referring to the order in the strong coupling constant at which the matrix-element calculation is performed. Madgraph5_aMC@NLO 2.4.3 [62] (referred to in the following as MG5_aMC) with the NNPDF3.0 NLO [63] parton distribution function (PDF) set and interfaced to Pythia 8.212 [64] with the NNPDF2.3 LO [65] PDF set for the modelling of parton showering, hadronisation, and the underlying event. The A14 [66] set of tuned parameters in Pythia controlling the description of multiparton interactions and initial- and final-state radiation, referred to as the tune, was used. The signal sample is normalised to the same total cross section as used for the inclusive sample (see discussion below) and assuming an arbitrary branching ratio of . The case of both top quarks decaying into is neglected in the analysis given the existing upper limits on (Section 1).
The nominal sample used to model the background was generated with the NLO generator Powheg-Box v2 [67, 68, 69, 70] using the NNPDF3.0 NLO PDF set. The Powheg-Box model parameter , which controls matrix element to parton shower matching and effectively regulates the high- radiation, was set to 1.5 times the top-quark mass. The parton showers, hadronisation, and underlying event were modelled by Pythia 8.210 with the NNPDF2.3 LO PDF set in combination with the A14 tune. Alternative simulation samples used to derive systematic uncertainties are described in Section 8.3. The generated samples are normalised to a theoretical cross section of pb, computed using Top++ v2.0 [71] at next-to-next-to-leading order (NNLO), including resummation of next-to-next-to-leading logarithmic (NNLL) soft gluon terms [72, 73, 74, 75, 76].
The background selected by the search is enriched in +heavy-flavour production, and thus requires a more sophisticated treatment than provided by the nominal sample; this treatment is briefly outlined below. A detailed discussion can be found in Ref. [77]. The simulated events are categorised depending on the flavour content of additional particle jets not originating from the decay of the system. Events labelled as either +1 or +1 are generically referred to in the following as +HF events, where HF stands for heavy flavour. The remaining events are labelled as +light-jets events, including those with no additional jets. A finer categorisation of +1 events is considered for the purpose of applying further corrections and assigning systematic uncertainties associated with the modelling of heavy-flavour production in different event topologies [77]. In particular, the +1 events are reweighted to an NLO prediction in the four-flavour (4F) scheme of +1 production including parton showering [78], based on Sherpa+OpenLoops [79, 80] (referred to as SherpaOL in the following) using the CT10 4F PDF set. This reweighting is performed in such a way that the inter-normalisations of the +1 categories are at NLO accuracy, while preserving the +1 cross section of the nominal sample. This reweighting is also applied to the alternative samples that are used to study systematic uncertainties.
Samples of single-top-quark events corresponding to the -channel production mechanism were generated with the Powheg-Box v1 [81] generator, using the 4F scheme for the NLO matrix-element calculations and the fixed 4F CT10f4 [82] PDF set. Samples corresponding to the - and -channel production mechanisms were generated with Powheg-Box v1 using the CT10 PDF set. Overlaps between the and final states were avoided by using the diagram removal scheme [83]. The parton showers, hadronisation and the underlying event were modelled using Pythia 6.428 [84] with the CTEQ6L1 [85, 86] PDF set in combination with the Perugia 2012 tune [87]. The single-top-quark samples are normalised to the approximate NNLO theoretical cross sections [88, 89, 90].
Samples of +jets events were generated with the Sherpa 2.2.1 [79] generator. The matrix element was calculated for up to two partons at NLO and up to four partons at LO using Comix [91] and OpenLoops [80]. The matrix-element calculation is merged with the Sherpa parton shower [92] using the ME+PS@NLO prescription [93]. The PDF set used for the matrix-element calculation is NNPDF3.0 NNLO [63] with a dedicated parton shower tuning developed for Sherpa. Separate samples were generated for different +jets categories using filters for a -jet (+1+jets), a -jet and no -jet (+1+jets), and with a veto on - and -jets (+light-jets), which are combined into the inclusive +jets samples. Both the +jets and +jets samples are normalised to their respective inclusive NNLO theoretical cross sections calculated with FEWZ [94].
Samples of +jets events were generated with Sherpa 2.2.1 using the CT10 PDF set and include processes containing up to four electroweak vertices. In the case of +jets (+jets) the matrix element was calculated for zero (up to one) additional partons at NLO and up to three partons at LO using the same procedure as for the +jets samples. The final states simulated require one of the bosons to decay leptonically and the other hadronically. All diboson samples are normalised to their NLO theoretical cross sections provided by Sherpa.
Samples of and events were generated with MG5_aMC 2.2.1, using NLO matrix elements and the NNPDF3.0 NLO PDF set, and interfaced to Pythia 8.210 with the NNPDF2.3 LO PDF set and the A14 tune. Instead, the samples used in the search are based on LO matrix elements computed for up to two additional partons using the NNPDF3.0 NLO PDF set, and merged using the CKKW-L approach [95]. The samples are normalised to the NLO cross section computed with MG5_aMC, while the sample is normalised using the NLO cross section recommended in Ref. [96].
All generated samples, except those produced with the Sherpa [79] event generator, utilise EvtGen 1.2.0 [97] to model the decays of heavy-flavour hadrons. To model the effects of pile-up, events from minimum-bias interactions were generated using Pythia 8.186 [64] in combination with the A2 tune [98], and overlaid onto the simulated hard-scatter events according to the luminosity profile of the recorded data. The generated events were processed through a simulation [99] of the ATLAS detector geometry and response using Geant4 [100]. A faster simulation, where the full Geant4 simulation of the calorimeter response is replaced by a detailed parameterisation of the shower shapes [101], was adopted for some of the samples used to estimate systematic uncertainties in background modelling. Simulated events were processed through the same reconstruction software as the data, and corrections were applied so that the object identification efficiencies, energy scales and energy resolutions match those determined from data control samples.
5.2 Backgrounds with fake leptons
5.2.1 Fake electrons and muons
In the search, the background from multijet production (multijet background in the following) contributes to the selected data sample via several production and misreconstruction mechanisms. In the electron channel, it consists of non-prompt electrons (from semileptonic - or -hadron decays) as well as misidentified photons (from a conversion of a photon into an pair) or jets with a high fraction of their energy deposited in the EM calorimeter. In the muon channel, the multijet background originates mainly from non-prompt muons. The multijet background normalisation and shape are estimated directly from data by using the matrix method technique [102, 103], which exploits differences in lepton identification and isolation properties between prompt leptons and leptons that are either non-prompt or result from the misidentification of photons or jets.
5.2.2 Fake -lepton candidates
In the search, the background with one or more fake candidates mainly arises from or multijet production, depending on the search channel, with +jets production contributing to a lesser extent. Studies based on the simulation show that, for all the above processes, fake candidates primarily result from the misidentification of light-quark jets, with the contribution from -quarks and gluon jets playing a subdominant role. It is also found that the fake rate decreases for all jet flavours as the candidate increases.
This background is estimated directly from data by defining control regions (CR) enriched in fake candidates via loosened requirements or flipped charge. These CRs do not overlap with the main search regions (SRs), discussed in Section 7. The CR selection requirements are analogous to those used to define the different SRs, except that the leading (trailing) candidate in the () channel is required to fail the medium identification but pass the loose identification, or the two candidates have the same charge.
The fake background prediction in a given SR is modelled by the distribution (referred to as the fake template) derived from data in the corresponding CR. The fake template is defined as the data distribution from which the contributions from the simulated backgrounds with real candidates, originating primarily from +jets and +jets, are subtracted. In the channel, simulation studies indicate that the fake background composition is consistent between the SR and the CR, and dominated by production. In the channel, the fake background is expected to be dominated by multijet production. However, simulation studies indicate that the contribution of events to the fake background is higher in the SR than in the CR. Therefore, an appropriate number of simulated events with fake candidates in the CR is added to the fake template to match the fake background composition in the SR. In both the and channels, the fake template in each SR is initially normalised to the estimated fake background yield, defined as the data yield minus the contributions from the simulated backgrounds with real candidates (assuming no signal contribution). During the statistical analysis, the normalisation of the fake background in each SR is allowed to vary freely in the fit to data, as discussed in Section 10.2.
6 Strategy for the search
This section presents an overview of the analysis strategy adopted in the search, which closely follows that of the previous search performed on the Run 1 dataset [27].
6.1 Event categorisation
Given that the and decay modes are chosen, the signal is expected to have four jets in the final state, three of them originating from -quarks, which can be effectively exploited to suppress the background. Additional jets can also be present because of initial- or final-state radiation. However, the use of the 60% -tagging efficiency operating point, characterised by a low mistag rate for - and light-jets, results in both the and signals having a similar -tag multiplicity distribution, with a very small fraction of events having four or more -tagged jets.
In order to optimise the sensitivity of the search, the selected events are categorised into different analysis regions depending on the number of jets (4, 5 and 6) and on the number of -tagged jets (2, 3 and 4). Therefore, a total of nine analysis regions are considered: (4j, 2b), (4j, 3b), (4j, 4b), (5j, 2b), (5j, 3b), (5j, 4b), (6j, 2b), (6j, 3b), and (6j, 4b), where (j, b) indicates selected jets and -tagged jets.
The overall rate and composition of the +jets background strongly depends on the jet and -tag multiplicities, as illustrated in Figure 1. Regions with exactly two -tagged jets are dominated by +light-jets, while regions with at least four -tagged jets are dominated by +1. Intermediate compositions are found in regions with exactly three -tagged jets. Most of the +light-jets background events in these regions have a -tagged charm jet from the hadronic boson decay, in addition to the two -jets from the top-quark decays.
In the regions with four or five jets and exactly three -tagged jets, which dominate the sensitivity of this search, the selected signal events have a decay in more than 97% of the events. The other regions have significantly lower signal-to-background ratios, but they are used to improve the +jets background prediction and constraining the related systematic uncertainties through a likelihood fit to data. Because of a somewhat larger fraction of signal in the regions with exactly three -tagged jets, resulting from the higher mistag rate for -jets than for light-jets, this analysis is expected to have slightly better sensitivity to a signal than to a signal.
6.2 Likelihood discriminant
After event categorisation, the signal-to-background ratio is insufficient even in the best cases to achieve sensitivity, and a suitable discriminating variable between signal and background needs to be constructed in order to improve the sensitivity of the search. Since both signal and background result from the decay, their discrimination is a challenge and it is based on a few measured quantities. The most prominent features are the different resonances present in the decay (the Higgs boson in the case of the signal and a hadronically decaying boson in the case of the background), and the different flavours of the jets forming those resonances. However, the large number of jets in the final state causes ambiguities in the calculation of these kinematic variables to discriminate signal events from background events.
This search uses a likelihood (LH) discriminant similar to that developed in Ref. [27]. The LH variable for a given event is defined as:
[TABLE]
where and represent the probability density functions (pdf) of a given event under the signal hypothesis () and under the background hypothesis (), respectively. Both and are functions of , representing the four-momentum vectors of all final-state particles at the reconstruction level: the lepton, the missing transverse momentum, and the selected jets in a given analysis region. The value of the multivariate -tagging discriminant for each jet is also included in . As in Ref. [27], and are approximated as a product of one-dimensional pdfs over the set of two-body and three-body invariant masses that correspond to the expected resonances in the event (the leptonically decaying boson, the Higgs boson or the hadronically decaying boson, and the corresponding parent top quarks) and averaged over all possible parton–jet matching combinations. Combinations are weighted using the per-jet multivariate -tagging discriminant value to suppress the impact from parton–jet assignments that are inconsistent with the correct flavour of the parton candidates. The invariant masses are computed from the reconstructed lepton, missing transverse momentum, and jets. After a suitable transformation of the three-body invariant masses (see Ref. [27]), all considered invariant mass variables are largely uncorrelated, thus making possible the factorisation of and as discussed above.
Two background hypotheses are considered, corresponding to the dominant backgrounds in the analysis: +light-jets and +1. Thus, is computed as the average of the pdfs for the two hypotheses, weighted by their relative fractions found in simulated +jets events, which depend on the analysis region considered. Furthermore, in a significant fraction of simulated events (about 40–50% in regions with exactly three -tagged jets), the light-quark jet from the hadronic top-quark decay is not among the selected jets. Similarly, in about 30–40% (50–90%) of simulated +light-jets () background events in regions with exactly three -tagged jets, the light-quark jet originating from the boson decay is also not selected. Thus, the calculation of and also includes an additional hypothesis to account for this topology, again weighted by the corresponding fractions. In this case, the invariant masses involving the missing jet are computed using the highest- jet not matched to a decay product from the system.
Figure 2 shows a comparison between data and prediction in the most sensitive analysis region, (4j, 3b), for several kinematic variables associated with the reconstructed lepton, jets, and missing transverse momentum. The distributions shown correspond to the lepton , the , the scalar sum of the transverse momenta of the jets, and the invariant mass distribution of the two -tagged jets with lowest separation. The variables displayed do not correspond directly to those used internally in the evaluation the LH discriminant, as to build them it is necessary to select a particular signal or background hypothesis and a jet permutation. Instead, these distributions are shown to demonstrate that a good description of the data by the background prediction is observed in several kinematic variables related to the information used in the LH discriminant construction.
Figure 3 compares the shape of the LH discriminant distribution between the and signals and the background in each of the analysis regions considered. Since this analysis has higher expected sensitivity to a signal than to a signal, in order to allow probing of the versus plane, the LH discriminant optimised for is used for both decay modes. It was verified that using the discriminant for the search does not result in a significant sensitivity loss.
7 Strategy for the search
The analysis strategy adopted in the search closely follows that developed in Ref. [104] and is summarised in this section.
7.1 Event categorisation and kinematic reconstruction
In the search, the signal being probed is characterised by the presence of -leptons from the decay of the Higgs boson and at least four jets, only one of which originates from a -quark. If one of the -leptons decays leptonically, an isolated electron or muon and significant is also expected. However, in a significant fraction of the events the lowest- jet from the boson decay fails the minimum requirement of , resulting in signal events with only three jets reconstructed. In order to optimise the sensitivity of the search, the selected events are categorised into four SRs depending on the number of and candidates, and on the number of jets: (, 3j), (, 4j), (, 3j), and (, 4j).
This event categorisation is primarily motivated by the different quality of the event kinematic reconstruction, depending on the amount of in the event (larger in events compared with events), and whether a jet from the hadronic top-quark decay is missing or not (events with exactly three jets or at least four jets). The event kinematic reconstruction is based on the strategy used in Ref. [104], and is summarised below.
Events with exactly three jets that are compatible with having a fully reconstructed hadronically decaying top quark () are rejected, as the decay cannot be reconstructed due to the missing light-quark jet. This compatibility is assessed via a likelihood function that depends on the reconstructed mass of the three-jet system and the two non--tagged jets. For the remaining events, the selected jets are assigned to the different top-quark decay products via a criterion based on minimising a sum of angular distances between objects. Finally, the four-momenta of the invisible decay products for each -lepton decay are estimated by minimising a function based on the probability density functions for the angular distance of the visible and invisible products of the -lepton decay, and including Gaussian constraints on the -lepton mass, the Higgs boson mass and the measured within their expected resolutions. The resolution on the -lepton mass and the Higgs boson mass are taken to be and , respectively, while the resolution on the measured is parameterised as a linear function of , with denoting the scalar sum of the of all physics objects contributing to the reconstruction [60]. After the minimisation, the Higgs boson four-momentum, and hence its invariant mass, as well as the four-momentum of the parent top quark, are determined with better resolution. Following the event kinematic reconstruction, several kinematic variables that discriminate between signal and background are defined. These variables are used in the multivariate analysis discussed in the next section.
7.2 Multivariate discriminant
Boosted decision trees are used in each SR to improve the separation between signal and background. In the training, only signal events are used against the total SM background (including both real and fake contributions), whereas to obtain the result the contributions from signal events are also taken into account.
A large set of potential variables were investigated in each SR separately, and only those variables that led to better discrimination by the BDT were kept. The discrimination of a given variable was quantified by the “separation" and “importance" measures provided by the TMVA package [105]. The BDT input variables in each SR are listed in Table 2 and defined in the following:
- •
: the invariant mass of the two -lepton candidates after the reconstruction of the neutrinos, indicating the reconstructed Higgs boson mass.
- •
: the invariant mass of the reconstructed Higgs boson and the associated light-quark jet in the decay, corresponding to the reconstructed mass of the parent top quark.
- •
: the transverse mass calculated from the lepton and in the channel.
- •
and : the transverse momenta of the lepton and candidate (referred to as particles 1 and 2 respectively) in the channel, or the transverse momenta of the leading and trailing candidates (referred to as particles 1 and 2 respectively) in the channel.
- •
centrality: a variable that quantifies the angular position of relative to the visible -lepton decay products in the transverse plane. It is defined as:
[TABLE]
where denotes the azimuthal angle of , and and denote the azimuthal angles the two -lepton candidates (the lepton and candidate in the channel, or the leading and trailing candidates in the channel), referred to as particles 1 and 2 respectively.
- •
: the magnitude of the projection of the original vector parallel to the fitted vector, minus the magnitude of the fitted vector.
- •
: the magnitude of the projection of the original vector perpendicular to the fitted vector.
- •
: the invariant mass of the -jet and the leading jet candidate from the hadronically decaying boson.
- •
: the invariant mass of the lepton and the jet that has the smallest angular distance to the candidate.
- •
: the invariant mass of the candidate and the jet that has the smallest angular distance to the candidate.
- •
and : the momentum fractions carried by the visible decay products from the two -lepton candidates (whether or ) per event. It is based on the best-fit four-momentum of the neutrino(s) according to the event reconstruction procedure outlined in Section 7.1.
- •
: the invariant mass of the -jet and the two jets originating from the boson in the decay, corresponding to the reconstructed mass of the parent top quark. This variable is only defined for events with at least four jets.
Among these variables, the most discriminating are , , and . A comparison between data and the predicted background for some of these variables in each of the SRs considered is shown in Figures 4 and 5. A good description of the data by the background model is observed in all cases. The level of discrimination between signal and background achieved by the BDTs is illustrated in Figure 6.
8 Systematic uncertainties
Several sources of systematic uncertainty that can affect the normalisation of signal and background and/or the shape of their corresponding discriminant distributions are considered. Each source is considered to be uncorrelated with the other sources. Correlations of a given systematic uncertainty are maintained across processes and channels as appropriate. The following sections describe the systematic uncertainties considered.
8.1 Luminosity
The uncertainty in the integrated luminosity is 2.1%, affecting the overall normalisation of all processes estimated from the simulation. It is derived, following a methodology similar to that detailed in Ref. [106], and using the LUCID-2 detector for the baseline luminosity measurements [107], from a calibration of the luminosity scale using – beam-separation scans.
8.2 Reconstructed objects
Uncertainties associated with electrons, muons, and candidates arise from the trigger, reconstruction, identification and isolation (in the case of electrons and muons) efficiencies, as well as the momentum scale and resolution. These are measured using and events () [41, 43] in the case of electrons and muons, and using events in the case of candidates [59].
Uncertainties associated with jets arise from the jet energy scale and resolution, and the efficiency to pass the JVT requirements. The largest contribution results from the jet energy scale, whose uncertainty dependence on jet and , jet flavour, and pile-up treatment, is split into 21 uncorrelated components that are treated independently [48].
Uncertainties associated with energy scales and resolutions of leptons and jets are propagated to . Additional uncertainties originating from the modelling of the underlying event, in particular its impact on the scale and resolution of unclustered energy, are negligible.
Efficiencies to tag -jets and -jets in the simulation are corrected to match the efficiencies in data by -dependent factors, whereas the light-jet efficiency is scaled by - and -dependent factors. The -jet efficiency is measured in a data sample enriched in events [108], while the -jet efficiency is measured using events [109] or +-jet events [53]. The light-jet efficiency is measured in a multijet data sample enriched in light-flavour jets [110]. Since the sample used to measure the -jet tagging efficiency overlaps with the analysis sample, the search uses instead the +-jet scale factors. In the case of the () search, the uncertainties in these scale factors include a total of 6 independent sources affecting -jets, 1 (2) source(s) affecting -jets, and 17 sources affecting light-jets. These systematic uncertainties are taken as uncorrelated between -jets, -jets, and light-jets. An additional uncertainty is included due to the extrapolation of these corrections to jets with beyond the kinematic reach of the data calibration samples used ( for - and -jets, and for light-jets); it is taken to be correlated among the three jet flavours. Since the fraction of signal and background in this kinematic regime is very small, these uncertainties have a negligible impact in the analyses. Finally, an uncertainty related to the application of -jet scale factors to -jets is considered, which also has a negligible impact.
8.3 Background modelling
A number of sources of systematic uncertainty affecting the modelling of +jets are considered. An uncertainty of 6% is assigned to the inclusive production cross section [71], including contributions from varying the factorisation and renormalisation scales, as well as from the top-quark mass, the PDF and . The latter two represent the largest contribution to the overall theoretical uncertainty in the cross section and were calculated using the PDF4LHC prescription [111] with the MSTW 2008 68% CL NNLO, CT10 NNLO [82, 112] and NNPDF2.3 5F FFN [65] PDF sets. The uncertainty associated with the choice of NLO generator is derived by comparing the nominal prediction from Powheg-Box+Pythia 8 with a prediction from Sherpa 2.2.1. For the latter, the matrix-element calculation is performed for up to two partons at NLO and up to four partons at LO using Comix and OpenLoops, and merged with the Sherpa parton shower using the ME+PS@NLO prescription. The uncertainty due to the choice of parton shower and hadronisation (PS & Had) model is derived by comparing the predictions from Powheg-Box interfaced either to Pythia 8 or Herwig 7. The latter uses the MMHT2014 LO [113] PDF set in combination with the H7UE tune [114]. The uncertainty in the modelling of additional radiation is assessed with two alternative Powheg-Box+Pythia 8 samples: a sample with increased radiation (referred to as radHi) is obtained by decreasing the renormalisation and factorisation scales by a factor of two, doubling the parameter, and using the Var3c upward variation of the A14 parameter set; a sample with decreased radiation (referred to as radLow) is obtained by increasing the scales by a factor of two and using the Var3c downward variation of the A14 set [115].
In the case of the search, where the +HF background plays a prominent role (see Fig. 1), a more detailed treatment of its associated systematic uncertainties is used. In particular, since several analysis regions have a sufficiently large number of +1 background events, its normalisation is determined in the fit to data. In the case of the +1 normalisation, an uncertainty of 50% is assumed, as the fit to the data is unable to precisely determine it, and the analysis has very limited sensitivity to this uncertainty. Since the diagrams that contribute to +light-jets, +1, and +1 production are different, all above uncertainties in +jets background modelling (NLO generator, PS & Had, and radHi/radLow), except the uncertainty of the inclusive cross section, are considered to be uncorrelated among these processes. Additional uncertainties of the +1 background are considered associated with the NLO prediction from SherpaOL, which is used for reweighting the nominal Powheg-Box+Pythia 8 prediction. These include three different scale variations, a different shower-recoil model scheme, and two alternative PDF sets (MSTW 2008 NLO and NNPDF2.3 NLO). Additional uncertainties are assessed for the contributions to the +1 background originating from multiple parton interactions. Finally, an additional uncertainty is assigned to the +1 background by comparing the predictions from Powheg-Box+Pythia 8 and SherpaOL 4F (5F vs 4F). In the derivation of the above uncertainties, the overall normalisations of the +1 and +1 backgrounds at the particle level are fixed to the nominal prediction. In order to maintain the inclusive cross section, the normalisation of the +light-jets background at the particle level is adjusted accordingly.
Uncertainties affecting the normalisation of the +jets background are estimated for the sum of +jets and +jets, and separately for +light-jets, +1+jets, and +1+jets subprocesses. The total normalisation uncertainty of +jets processes is estimated by comparing the data and total background prediction in the different analysis regions considered, but requiring exactly zero -tagged jets. Agreement between data and predicted background in these modified regions, which are dominated by +light-jets, is found to be within approximately 30%. This bound is taken to be the normalisation uncertainty, correlated across all +jets subprocesses. Since Sherpa 2.2 has been found to underestimate +heavy-flavour production by about a factor of 1.3 [116], additional 30% normalisation uncertainties are assumed for +1+jets and +1+jets subprocesses, considered uncorrelated between them.
Uncertainties affecting the modelling of the single-top-quark background include a / uncertainty of the total cross section estimated as a weighted average of the theoretical uncertainties in -, - and -channel production [88, 89, 90]. Additional uncertainties associated with the modelling of additional radiation are assessed by comparing the nominal samples with alternative samples where generator parameters are varied. For the - and -channel processes, an uncertainty due to the choice of parton shower and hadronisation model is derived by comparing events produced by Powheg-Box interfaced to Pythia 6 or Herwig++. These uncertainties are treated as fully correlated among single-top-quark production processes, but uncorrelated with the corresponding uncertainty of the +jets background. An additional systematic uncertainty in -channel production concerning the separation between and at NLO is assessed by comparing the nominal sample, which uses the diagram removal scheme [117], with an alternative sample using the diagram subtraction scheme [117].
Uncertainties of the diboson background normalisation include 5% from the NLO theory cross sections [118, 119], as well as an additional 24% normalisation uncertainty added in quadrature for each additional inclusive jet-multiplicity bin, based on a comparison among different algorithms for merging LO matrix elements and parton showers [120] (it is assumed that two jets originate from the decay, as in ). Therefore, the total normalisation uncertainty is , where is the selected jet multiplicity, resulting in 34%, 42%, and 48%, for events with exactly 4 jets, exactly 5 jets, and 6 jets, respectively. Recent comparisons between data and Sherpa 2.1.1 for 4 jets show agreement within the experimental uncertainty of approximately 40% [121], which further justifies the above uncertainty. Given the very small contribution of this background to the total prediction, the final result is not affected by the assumed modelling uncertainties.
Uncertainties of the and cross sections are 15% and /, respectively, from the uncertainties of their respective NLO theoretical cross sections [122, 123, 96].
Uncertainties of the data-driven multijet background in the search include contributions from the limited size of the data sample, particularly at high jet and -tag multiplicities, as well as from the uncertainty in the rate of fake leptons, estimated in different control regions (e.g. selected with an upper requirement on either or ). A combined normalisation uncertainty of 50% due to all these effects is assigned, which is taken as correlated across jet and -tag multiplicity bins, but uncorrelated between electron and muon channels. No explicit shape uncertainty is assigned since the large statistical uncertainties associated with the multijet background prediction, which are uncorrelated between bins in the final discriminant distribution, effectively cover all possible shape uncertainties.
Uncertainties of the data-driven fake background in the search are obtained by using additional signal-depleted regions. The construction is similar to that of the SRs and corresponding CRs discussed in Section 5.2, but employing further loosened identification criteria, and thus referred to as “loose SR” and “loose CR”. In each loose SR, after subtracting the small simulation-predicted contribution from real candidates, the relative difference in the shape of the distribution between the remaining data and the fake background estimate based on its associated loose CR is assigned as an uncertainty of the prediction in the nominal SR. In addition, a 30% uncertainty is applied to the fraction of events with a fake candidate from the simulation that are added to the fake template in the channel as part of the fake background estimation procedure. This uncertainty, associated with the modelling of the fake rate by the simulation, is estimated by comparing data and simulation in a sample enriched in dilepton events plus a fake candidate. The same uncertainty is assigned to the selected signal events with fake candidates. In addition, a systematic uncertainty is assigned to account for the different fractional composition of particles (various types of leptons and partons) producing the fake candidates between each SR and its corresponding CR in the simulation. Finally, the normalisation of the fake background in each SR is determined in the fit to data.
8.4 Signal modelling
Several normalisation and shape uncertainties are taken into account for the signal. The uncertainty of the cross section also applies to the signal and is taken to be the same as, and fully correlated with, the uncertainty assigned to the background. Uncertainties of the Higgs boson branching ratios are taken into account following the recommendation in Ref. [96]. Additional uncertainties associated with the modelling of additional radiation, with the choice of NLO generator, and with the choice of parton shower and hadronisation model, are estimated from the comparison of the nominal and alternative background samples (discussed in Section 8.3) and applied to signal. These modelling uncertainties are taken to be uncorrelated with those affecting the background.
9 Statistical analysis
For each search, the final discriminant distributions across all analysis regions considered are jointly analysed to test for the presence of a signal. The statistical analysis uses a binned likelihood function constructed as a product of Poisson probability terms over all bins considered in the search. This function depends on the signal-strength parameter , defined as a factor multiplying the expected yield of signal events normalised to a reference branching ratio , and , a set of nuisance parameters that encode the effect of systematic uncertainties on the signal and background expectations. Therefore, the expected total number of events in a given bin depends on and . All nuisance parameters are subject to Gaussian or log-normal constraints in the likelihood, with the exception of a few parameters that control the normalisation of some background components (e.g. the +1 background in the case of the search), which are treated as free parameters in the fit.
For a given value of , the nuisance parameters allow variations of the expectations for signal and background according to the corresponding systematic uncertainties, and their fitted values result in the deviations from the nominal expectations that globally provide the best fit to the data. This procedure allows a reduction of the impact of systematic uncertainties on the search sensitivity by taking advantage of the highly populated background-dominated bins included in the likelihood fit. Statistical uncertainties in each bin of the predicted final discriminant distributions are taken into account by dedicated parameters in the fit. The best-fit is obtained by performing a binned likelihood fit to the data under the signal-plus-background hypothesis, maximising the likelihood function over and .
The fitting procedure was initially validated through extensive studies using mock data, defined as the sum of all predicted backgrounds plus an injected signal of variable strength, as well as by performing fits to real data where bins of the final discriminant variable with a signal contamination above 5% are excluded (referred to as blinding requirements). In both cases, the robustness of the model for systematic uncertainties is established by verifying the stability of the fitted background when varying assumptions about some of the leading sources of uncertainty. After this, the blinding requirements are removed in the data and a fit under the signal-plus-background hypothesis is performed. Further checks involve the comparison of the fitted nuisance parameters before and after removal of the blinding requirements, and their values are found to be consistent. In addition, it is verified that the fit is able to correctly determine the strength of a simulated signal injected into the real data.
The test statistic is defined as the profile likelihood ratio, , where and are the values of the parameters that maximise the likelihood function (subject to the constraint ), and are the values of the nuisance parameters that maximise the likelihood function for a given value of . The test statistic is evaluated with the RooFit package [124, 125]. A related statistic is used to determine whether the observed data is compatible with the background-only hypothesis (the so-called discovery test) by setting in the profile likelihood ratio and leaving unconstrained: . The -value (referred to as ), representing the level of agreement between the data and the background-only hypothesis, is estimated by integrating the distribution of based on the asymptotic formulae in Ref. [126], above the observed value of in the data. Upper limits on , and thus on , are derived by using in the CL method [127, 128]. For a given signal scenario, values of the yielding CL, where CL is computed using the asymptotic approximation [126], are excluded at CL.
10 Results
This section presents the results obtained from the individual searches for , as well as their combination, following the statistical analysis discussed in Section 9.
10.1 search
A binned likelihood fit under the signal-plus-background hypothesis is performed on the LH discriminant distributions in the nine analysis regions considered. In the regions with exactly three -tagged jets, which have the highest sensitivity, the full LH distribution is used with ten equal-width bins. In contrast, in the regions with at least four -tagged jets, which have a limited number of data events and a small signal fraction, only two equal-width bins are used. Finally, in the regions with exactly two -tagged jets the total event yield after requiring the LH discriminant to be above 0.6, is used. The unconstrained parameters of the fit are the signal strength and a global normalisation factor applied to the +1 background common to all analysis regions. Figures 7 and 8 show a comparison of the LH discriminant for data and prediction in the regions with exactly three and at least four -tagged jets, both before and after performing the fit to data, in the case of the search. Tables summarising the pre-fit and post-fit yields can be found in Appendix A.
The best-fit branching ratio obtained is , assuming . A similar fit is performed for the search, yielding , assuming . The total uncertainties of the measured branching ratios are dominated by systematic uncertainties.
The large number of events in the analysis regions considered, together with their different background compositions, allows the fit to place constraints on the combined effect of several sources of systematic uncertainty. As a result, an improved background prediction is obtained with a significantly reduced uncertainty, not only in the signal-depleted regions, but also in the most sensitive analysis regions for this search, (4j, 3b) and (5j, 3b). The regions with two -tagged jets are used to constrain the leading uncertainties affecting the +light-jets background prediction, while the channels with at least four -tagged jets are sensitive to the uncertainties affecting the +HF background prediction. In particular, one of the main corrections applied by the fit is an increase of the normalisation by a factor of relative to the nominal prediction by adjusting the corresponding nuisance parameter. The normalisation is also increased, by a factor of . These corrections are in agreement with those found in Ref. [77]. Additionally, a few nuisance parameters are adjusted by the fit, with the largest effects corresponding to the leading nuisance parameters related to the -tagging and -tagging calibrations (by about 0.8 standard deviations), and those related to and modelling, which are based on a comparison with alternative generators (by 0.5 standard deviations or less). The leading uncertainties affecting the signal extraction by the fit are related to the -tagging calibration ( ), followed by the +light-jets PS & Had uncertainty ( ). Smaller contributions ( 0.5– each) result from the uncertainties associated with the 5F vs 4F comparison, the dependence of jet energy scale on the jet flavour, the uncertainty of the normalisation, and the limited size of the simulated samples in some of the bins with the highest signal-to-background ratio. The uncertainty most strongly constrained by the fit is that related to the -tagging calibration. It is reduced by about a factor of two of its value as originally determined in +-jet events [53]. This is possible because the fit exploits the large number of events with two and three -tagged jets to effectively perform a -tagging calibration, whose results are found to be consistent with those of Ref. [109]. Beyond the constraints on a few individual uncertainties, the significant reduction of the total background uncertainty achieved by the fit primarily derives from the anti-correlations found among systematic uncertainties from different sources.
In the absence of a significant excess of data events above the background expectation, 95% CL limits are set on and . The observed (expected) 95% CL upper limits on the branching ratios are and .
10.2 search
A binned likelihood fit under the signal-plus-background hypothesis is performed on the BDT discriminant distributions in the four analysis regions considered. The unconstrained parameters of the fit are the signal strength, and four independent parameters associated with the normalisation of the fake background in each of the analysis regions. No significant pulls or constraints are obtained for the fitted nuisance parameters, resulting in a post-fit background prediction in each analysis region that is very close to the pre-fit prediction, albeit with reduced uncertainties due to the anti-correlations among sources of systematic uncertainty resulting from the fit. Figure 9 shows a comparison of the data and prediction for the BDT discriminant distribution in the (, 3j) and (, 4j) regions, both pre- and post-fit to data, in the case of the search. A similar comparison for the (, 3j) and (, 4j) regions is shown in Figure 10. Tables summarising the pre-fit and post-fit yields can be found in Appendix B.
The best-fit branching ratio obtained is , assuming . The best-fit normalisation factors for the fake background are: in the (, 3j) region, in the (, 4j) region, in the (, 3j) region, and in the (, 4j) region. A similar fit is performed for the search, yielding , assuming . The obtained normalisation factors for the fake background agree within 1% with those obtained by the search. In both cases, the uncertainty of the measured branching ratio is dominated by the statistical uncertainty. The main contributions to the total systematic uncertainty arise from the fake background estimation and the uncertainty associated with the different responses to quark-initiated and gluon-initiated jets. No significant excess of data events above the background expectation is found, and observed (expected) 95% CL limits are set on and : and . These results are dominated by the channel, which has a sensitivity a factor of two better than that of the channel.
10.3 Combination of ATLAS searches
The and searches are combined with the ATLAS searches in diphoton [29] and multilepton [30] final states of events in the same data set, referred to as “ search” and “ search”, respectively. Since all searches, with the exception of the search, are dominated by the data statistical uncertainty, and in each search the dominant systematic uncertainties are different, the combined result is insensitive to the assumed correlations of systematic uncertainties across searches. Therefore, the only systematic uncertainties taken to be fully correlated among the four searches are those affecting the integrated luminosity, the cross section, signal modelling, a subset of the uncertainties on the Higgs boson branching ratios (those associated with uncertainties in and ), and a subset of jet-related uncertainties (jet energy resolution and JVT requirement). The rest of the jet-related uncertainties (jet energy scale and -tagging) are taken as fully correlated among the , , and searches, but uncorrelated with the search. The rest of the uncertainties, e.g. those related to leptons and to background modelling, are taken as uncorrelated among the four searches.
The first set of combined results is obtained for each branching ratio separately, setting the other branching ratio to zero. The best-fit combined branching ratios are and . A comparison of the best-fit branching ratios for the individual searches and their combination is shown in Figure 11 for and Figure 12 for . The observed (expected) 95% CL combined upper limits on the branching ratios are and . A summary of the upper limits on the branching ratios obtained by the individual searches, as well as their combination, is given in Table 3 and in Figures 13 and 14.
Upper limits on the branching ratios () can be translated into upper limits on the non-flavour-diagonal Yukawa couplings appearing in the Lagrangian [129]:
[TABLE]
The branching ratio is estimated as the ratio of its partial width [9] to the SM partial width [130], which is assumed to be dominant. Both predicted partial widths include next-to-leading-order QCD corrections. Using the expression derived in Ref. [26], the coupling can be extracted as . The coupling corresponds to the sum in quadrature of the couplings relative to the two possible chirality combinations of the quark fields, [129]. The observed (expected) upper limits on the couplings from the combination of the searches are and .
A similar set of results can be obtained by simultaneously varying both branching ratios in the likelihood function. Figure 15(a) shows the 95% CL upper limits on the branching ratios in the versus plane. The small differences between the limiting values (on the - and -axes) of the branching ratio limits obtained in the two-dimensional scan and those reported in Table 3, result from slightly different choices in the search regarding the final discriminant, which in the two-dimensional case should be common to both signals, and its binning. The corresponding upper limits on the couplings in the versus plane are shown in Figure 15(b).
11 Conclusion
A search for flavour-changing neutral-current decays of a top quark into an up-type quark () and the Standard Model Higgs boson, , is presented. The search is based on a dataset of collisions at recorded in 2015 and 2016 with the ATLAS detector at the CERN Large Hadron Collider and corresponding to an integrated luminosity of 36.1 fb*-1*. Two complementary analyses are performed to search for top-quark pair events in which one top quark decays into and the other top quark decays into , and target the and decay modes, respectively. The search selects events with one isolated electron or muon from the decay, and multiple jets, with several of them being identified with high purity as originating from the hadronisation of -quarks. The search selects events with either one or two hadronically decaying -lepton candidates, as well as multiple jets. Both searches employ multivariate techniques to discriminate between the signal and the background on the basis of their different kinematics. No significant excess of events above the background expectation is found, and 95% CL upper limits on the branching ratios are derived. In the case of the search, the observed (expected) 95% CL upper limits on the and branching ratios are and , respectively. In the case of the search, the observed (expected) 95% CL upper limits on the and branching ratios are and , respectively. The combination of these searches with ATLAS searches in diphoton and multilepton final states yields observed (expected) 95% CL upper limits on the and branching ratios of () and (), assuming and respectively. The corresponding combined observed (expected) upper limits on the and couplings are 0.064 (0.055) and 0.066 (0.055), respectively.
Acknowledgements
We thank CERN for the very successful operation of the LHC, as well as the support staff from our institutions without whom ATLAS could not be operated efficiently.
We acknowledge the support of ANPCyT, Argentina; YerPhI, Armenia; ARC, Australia; BMWFW and FWF, Austria; ANAS, Azerbaijan; SSTC, Belarus; CNPq and FAPESP, Brazil; NSERC, NRC and CFI, Canada; CERN; CONICYT, Chile; CAS, MOST and NSFC, China; COLCIENCIAS, Colombia; MSMT CR, MPO CR and VSC CR, Czech Republic; DNRF and DNSRC, Denmark; IN2P3-CNRS, CEA-DRF/IRFU, France; SRNSFG, Georgia; BMBF, HGF, and MPG, Germany; GSRT, Greece; RGC, Hong Kong SAR, China; ISF and Benoziyo Center, Israel; INFN, Italy; MEXT and JSPS, Japan; CNRST, Morocco; NWO, Netherlands; RCN, Norway; MNiSW and NCN, Poland; FCT, Portugal; MNE/IFA, Romania; MES of Russia and NRC KI, Russian Federation; JINR; MESTD, Serbia; MSSR, Slovakia; ARRS and MIZŠ, Slovenia; DST/NRF, South Africa; MINECO, Spain; SRC and Wallenberg Foundation, Sweden; SERI, SNSF and Cantons of Bern and Geneva, Switzerland; MOST, Taiwan; TAEK, Turkey; STFC, United Kingdom; DOE and NSF, United States of America. In addition, individual groups and members have received support from BCKDF, CANARIE, CRC and Compute Canada, Canada; COST, ERC, ERDF, Horizon 2020, and Marie Skłodowska-Curie Actions, European Union; Investissements d’ Avenir Labex and Idex, ANR, France; DFG and AvH Foundation, Germany; Herakleitos, Thales and Aristeia programmes co-financed by EU-ESF and the Greek NSRF, Greece; BSF-NSF and GIF, Israel; CERCA Programme Generalitat de Catalunya, Spain; The Royal Society and Leverhulme Trust, United Kingdom.
The crucial computing support from all WLCG partners is acknowledged gratefully, in particular from CERN, the ATLAS Tier-1 facilities at TRIUMF (Canada), NDGF (Denmark, Norway, Sweden), CC-IN2P3 (France), KIT/GridKA (Germany), INFN-CNAF (Italy), NL-T1 (Netherlands), PIC (Spain), ASGC (Taiwan), RAL (UK) and BNL (USA), the Tier-2 facilities worldwide and large non-WLCG resource providers. Major contributors of computing resources are listed in Ref. [131].
Appendix
Appendix A Pre-fit and post-fit event yields in the search
Table 4 presents the observed and predicted yields in each of the analysis regions for the search before the fit to data. Tables 5 and 6 present the observed and predicted yields in each of the analysis regions after the fit to the data under the signal-plus-background hypothesis, assuming and as signal, respectively.
Appendix B Pre-fit and post-fit event yields in the search
Table 7 presents the observed and predicted yields in each of the analysis regions for the search before the fit to data. Tables 8 and 9 present the observed and predicted yields in each of the analysis regions after the fit to the data under the signal-plus-background hypothesis, assuming and as signal, respectively.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] ATLAS Collaboration “Observation of a new particle in the search for the standard model Higgs boson with the ATLAS detector at the LHC” In Phys. Lett. B 716 , 2012, pp. 1 DOI: 10.1016/j.physletb.2012.08.020 · doi ↗
- 2[2] CMS Collaboration “Observation of a new boson at a mass of 125 Ge V with the CMS experiment at the LHC” In Phys. Lett. B 716 , 2012, pp. 30 DOI: 10.1016/j.physletb.2012.08.021 · doi ↗
- 3[3] ATLAS and CMS Collaborations “Combined Measurement of the Higgs boson Mass in p p 𝑝 𝑝 pp Collisions at s = 7 𝑠 7 \sqrt{s}=7 and 8 Te V with the ATLAS and CMS Experiments” In Phys. Rev. Lett. 114 , 2015, pp. 191803 DOI: 10.1103/Phys Rev Lett.114.191803 · doi ↗
- 4[4] K. Agashe “Snowmass 2013 Top quark working group report” In Proceedings, 2013 Community Summer Study on the Future of U.S. Particle Physics: Snowmass on the Mississippi (CSS 2013): Minneapolis, MN, USA, July 29-August 6, 2013 , 2013 ar Xiv: 1311.2028 [hep-ph]
- 5[5] S.. Glashow, J. Iliopoulos and L. Maiani “Weak Interactions with Lepton-Hadron Symmetry” In Phys. Rev. D 2 , 1970, pp. 1285 DOI: 10.1103/Phys Rev D.2.1285 · doi ↗
- 6[6] G. Eilam, J.. Hewett and A. Soni “Rare decays of the top quark in the standard and two Higgs doublet models” Erratum: Phys. Rev. D 59 (1998) 039901 In Phys. Rev. D 44 , 1991, pp. 1473 DOI: 10.1103/Phys Rev D.44.1473 · doi ↗
- 7[7] B. Mele, S. Petrarca and A. Soddu “A new evaluation of the t → c H → 𝑡 𝑐 𝐻 t\to c H decay width in the standard model” In Phys. Lett. B 435 , 1998, pp. 401 DOI: 10.1016/S 0370-2693(98)00822-3 · doi ↗
- 8[8] J.. Aguilar-Saavedra “Top flavor-changing neutral interactions: theoretical expectations and experimental detection” In Acta Phys. Polon. B 35 , 2004, pp. 2695 ar Xiv: hep-ph/0409342 [hep-ph]
