Six Top Messages of New Physics at the LHC
Huayong Han, Li Huang, Teng Ma, Jing Shu, Tim M.P. Tait, and Yongcheng, Wu

TL;DR
This paper proposes that six top quark signatures at the LHC can serve as a sensitive probe for new physics, particularly top partners in a composite Higgs model, with potential discovery up to 2.5 TeV.
Contribution
It introduces a novel analysis strategy for detecting six top signatures as indicators of new physics at the LHC, focusing on boosted top tagging and related kinematic variables.
Findings
LHC with 3 ab$^{-1}$ can discover top partners up to 2.5 TeV
Six top signatures may show early discrepancies indicating new physics
Analysis based on $H_T$ and boosted object tagging is effective
Abstract
Six top signatures provide a novel probe of new physics. We discuss production of six top quarks as the decay products of a pair of top partners in the setting of a composite Higgs model, and argue that the six top signal may generically provide one of the first final states to show a discrepancy. We construct an analysis based on quantities such as and the numbers of jets which are tagged as boosted tops, s, or containing -tags, and show that the LHC with 3~ab can discover top partners with masses up to around 2.5 TeV in the six top signature.
| Channel | Branching Fraction | Event Fraction | SM Backgrounds |
| (Truth level) | Reconstructed (1.5 TeV) | ||
| 1 Lepton | 17.82% | 38.65% | -jets |
| 2 Opposite-Sign Leptons | 8.46% | 9.50% | |
| 2 Same-Sign Leptons | 5.36% | 6.51% | -jets |
| 3 Mixed-Sign Leptons | 5.64% | 3.67% | -jets |
| 3 Same-Sign Leptons | 0.60% | 0.71% |
| Channels | Process | Pre Cut [fb] | Cut I [fb] | Significance [] |
| 1- | signal | 20.47 | ||
| 2-os | signal | 17.99 | ||
| 3-ms | signal | 21.41 | ||
| 2-ss | signal | 30.65 | ||
| 3-ss | signal | / | 11.48 | |
| / | ||||
| / | ||||
| Total Significance: | 47.69 | |||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
aainstitutetext: Guizhou Key Laboratory in Physics and Related Areas, Guizhou University of Finance and Economics, Guiyang 550025, Chinabbinstitutetext: CAS Key Laboratory of Theoretical Physics, Institute of Theoretical Physics, Chinese Academy of Sciences, Beijing 100190, Chinaccinstitutetext: School of Physical Sciences, University of Chinese Academy of Sciences, Beijing 100049, P. R. Chinaddinstitutetext: Laboratory for Elementary Particle Physics, Cornell University, Ithaca, NY 14853, USAeeinstitutetext: CAS Center for Excellence in Particle Physics, Beijing 100049, Chinaffinstitutetext: Center for High Energy Physics, Peking University, Beijing 100871, Chinagginstitutetext: Department of Physics and Astronomy, University of California, Irvine, CA 92697 USAhhinstitutetext: Ottawa-Carleton Institute for Physics, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario K1S 5B6, Canada
Six Top Messages of New Physics at the LHC
Huayong Han b,c,d
Li Huang b,d
Teng Ma b,c,e,f
Jing Shu g
Tim M.P. Tait h
and Yongcheng Wu
Abstract
Six top signatures provide a novel probe of new physics. We discuss production of six top quarks as the decay products of a pair of top partners in the setting of a composite Higgs model, and argue that the six top signal may generically provide one of the first final states to show a discrepancy. We construct an analysis based on quantities such as and the numbers of jets which are tagged as boosted tops, s, or containing -tags, and show that the LHC with 3 ab*-1* can discover top partners with masses up to around 2.5 TeV in the six top signature.
††preprint: UCI-HEP-TR-2018-23
1 Introduction
The Large Hadron Collider (LHC), with its unparalleled energy and high luminosity, will definitively explore the physics at the TeV scale. The discovery of Higgs boson at the LHC is a triumph of the Standard Model (SM), however, the Naturalness problem associated with the self-energy of the Higgs particle argues that it is likely that there is new physics around the TeV scale Feng:2013pwa ; Giudice:2013nak ; Altarelli:2013lla ; Farina:2013mla ; deGouvea:2014xba ; Csaki:2018hyw ; Chen:2017dwb . Various new physics models addressing this problem have been proposed, such as Supersymmetry (SUSY), little Higgs, Composite Higgs etc. Deep investigation of the naturalness problem may reveal new details underlying the physics of the electroweak symmetry breaking (EWSB) and could also provide the evidence of new physics.
Beside the Higgs, the top quark is central to arguments concerning naturalness, since it has the largest mass of the SM fermions, and hence the largest coupling to the Higgs. For this reason, partners of the top quark are ubiquitous in models of new physics at the weak scale, and their production often results in multi-top signatures at the LHC, leading to many interesting phenomena. The four top final state has been previously investigated Lillie:2007hd ; Pomarol:2008bh ; Chen:2008hh ; Kumar:2009vs ; Gregoire:2011ka and is starting to be visible in experimental analysis Sirunyan:2017roi ; Aaboud:2018jsj . However, even more tops in the final state naturally occur under simple assumptions and provides a spectacular collider signature and a complementary method to search for new physics.
In this paper, we systematically investigate the phenomenology of six-top final states in a simplified model inspired by a composite Higgs scenario. We estimate the sensitivity of the LHC to six-top final states for channels with different number of charged leptons, and the upper limit on the top partner branch ratio into are obtained in the case that no signal is observed with 3 ab*-1* of integrated luminosity. We also discuss the extraction of the top partner mass. It should be stressed that six-top final states occur in many other models of new physics, and our general analysis framework can be applied to those cases with simple adjustments.
The paper is organized as follows. In Section 2, we introduce a simplified composite Higgs model which inspires our analysis and in Section 3 discuss general features of the six top signature and current LHC constraints. The analysis strategy of LHC data are described in Section 4. We reserve Section 5 for our conclusions.
2 Six Tops from a General Composite Higgs Model
Generally, composite Higgs models with a simple UV completion (such as Csaki:2017jby ; Ryttov:2008xe ; Galloway:2010bp or the isomorphic coset space Gripaios:2009pe ; Frigerio:2012uc ; Serra:2017poj and Ma:2015gra ; Ma:2017vzm ; Cacciapaglia:2018avr ), contain a singlet scalar pseudo-Nambu-Goldstone boson (pNGB) field corresponding to a broken global symmetry. This pNGB can decay into di-bosons through Wess-Zumino-Witten (WZW) terms via fermion loops. In theories with partial compositeness, can also decay into fermion pairs through the elementary-composite mixing terms between the SM fermions and the composite top partners . Since the decay into dibosons are effectively at loop level, and the large top mass implies in such theories that the top partners predominantly mix with the SM top, generically decays into a top pair with very close to branch ratio (BR). The same large mixing generically implies that, provided the mass of the is not too large, the top partners themselves decay into and top with a significant BR. As a result, a single top partner typically undergoes the decay chain,
[TABLE]
and an event originating from pair production of the top partners results in a six top final state (see Figure 1 left panel):
[TABLE]
We work with an effective Lagrangian capturing the essential features of the interactions between top partners and . Requiring that the singlet renormalizably couples to the top and its partner, the vector-like top partners must either be electroweak singlets () or doublets () with hypercharge . In the first (singlet) case, the effective Lagrangian reads
[TABLE]
And the doublet case is described by
[TABLE]
Here is the SM Higgs doublet field, is the appropriate covariant derivative, and are the masses for top-partner and respectively, and are coupling constants. We work in the limit where the coupling is much larger than or the electroweak coupling, such that the top-partner decays are predominantly into top and with almost 100% BR, but is small enough that the width of the top partner remains relatively narrow. In this limit, the relevant parameters are the top partner and scalar masses, with mild dependence on the strength of the interactions. In the more general case where the top partners have appreciable decays into other channels, our results can be rescaled with the corresponding BR and continue to apply.
3 Top Partner Pair Production and Signatures
For modest mixing, the dominant top partner production mechanism at the LHC is production of a pair through the strong force of which the rate only depends on the partner mass and the strong coupling. The rate at the LHC operating at TeV as a function of the top partner mass is shown in the right panel of Figure 1.
As with other multi-top final states, it is convenient to classify six top final states based on the decay modes of the bosons. Leptonic decay modes allow for up to six very energetic charged leptons () in the final state. In Table. 1, we list the channels containing up to three isolated charged leptons along with their corresponding branching ratios and the primary SM backgrounds leading to topologies similar to a six top final state. Final states with four or more charged leptons are not considered, as the BR for these channels is highly suppressed. While several of these channels have previously been analyzed at the LHC Aad:2016tuk ; ATLAS-CONF-2016-013 ; Aaboud:2017dmy ; Aaboud:2018zeb ; Sirunyan:2017lae ; Khachatryan:2017qgo , the focus was on a different production mechanism, and thus not optimized to extract a six top final state. A six top final state also allows for the new, not previously analyzed, signatures such as three same-sign charged leptons.
In addition to channels with various numbers of leptons, there are several other generic features which commonly appear in the six top signature, including:
- •
Large (where the index runs over all visible final state particles), typically \mathrel{\mathchoice{\lower 2.5pt\vbox{\halign{\mathsurround 0pt\displaystyle\hfil#\hfil\cr>\crcr\sim\crcr}}}{\lower 2.5pt\vbox{\halign{\mathsurround 0pt\textstyle\hfil#\hfil\cr>\crcr\sim\crcr}}}{\lower 2.5pt\vbox{\halign{\mathsurround 0pt\scriptstyle\hfil#\hfil\cr>\crcr\sim\crcr}}}{\lower 2.5pt\vbox{\halign{\mathsurround 0pt\scriptscriptstyle\hfil#\hfil\cr>\crcr\sim\crcr}}}}2000 GeV for the range of under consideration;
- •
Boosted top jets which may appear as fat jets in the detector;
- •
High multiplicity of bottom-flavored and/or light jets.
3.1 Current Constraints
Most searches for top partners at the LHC have considered missing transverse momentum signatures (based on SUSY searches Aaboud:2017nfd ; Aaboud:2017ayj ; Aaboud:2017aeu ) which occur in theories in which the top partner is connected to a dark matter candidate. These searches exclude scalar top partners with masses up to GeV, depending on the mass of the dark matter candidate. We evaluate the constraints from visible signatures using CheckMATE Drees:2013wra , the results are shown in the - plane in the left panel of Figure 2. The most stringent constraints are coming from multi-lepton (red line) Aad:2016tuk and multi top quarks searches (green line) ATLAS-CONF-2016-013 . These constraints exclude cross section fb at the C.L. for TeV, corresponding to top partner masses up to nearly 1 TeV.
There is also the possibility to directly produce the from gluon fusion, which results in a final state whose invariant mass is resonantly enhanced at . In the right panel of Figure 2, we show the observational upper limit derived from 8 TeV LHC search for resonant top pair production Aad:2015fna on the -- coupling strength as a function of the mass. Note that, here we only present the constraints from 8 TeV analysis. New 13 TeV searches Aaboud:2018mjh will definitely improve the sensitivity. However, the detailed reanalysis of the 13 TeV result in our scheme is beyond our scope, we leave this for future works.
4 Identifying Six Top Events at the LHC
We divide our analysis into channels with 1, 2 or 3 isolated leptons (1, 2, 3-) in the final state. The 2- and 3-lepton channels are further divided according to the charges of the isolated leptons. Hence in total, we have five different channels: 1-lepton, 2 opposite sign leptons (2-os), 2 same sign leptons (2-ss), 3 mixed sign leptons (3-ms) and 3 same sign leptons (3-ss). These channels are by definition orthogonal to each other, such that a direct combination is straightforward.
4.1 Simulation and Event Reconstruction
We simulate signal and background events for the LHC running at TeV. Events are generated at the parton level via the MadGraph5 package Alwall:2014hca , using CTEQ6L parton distribution functions (PDFs) Nadolsky:2008zw . Resonances are decayed either via MadSpin Artoisenet:2012st for top quarks and bosons, or PYTHIA8 Sjostrand:2014zea for the top partners. Parton level events are then passed to PYTHIA8 for initial state radiation, showering and hadronization. The detector reconstruction is simulated by Delphes deFavereau:2013fsa using the default CMS configuration with modified lepton isolation and b-tagging efficiency (described below). Selection cuts are imposed through the ROOT framework via the PyROOT interface, with FastJet Cacciari:2011ma providing further jet reconstruction and clustering analysis.
The signal process is generated as for the set of top partner masses 1.0, 1.3, 1.5, 1.8, 2.0 and 2.5 TeV. As mentioned above, PYTHIA8 decays the top partners into top quarks via , with an assumed branching ratio. This process loses information regarding spin correlations, and thus we do not explore related observables in this analysis. For each choice of , we fix the singlet mass to be . While this choice is not general, our analysis does not rely on any selection related to this choice, and so we expect the derived efficiencies to be roughly independent of . However, the kinematic endpoints or produce unusually soft top quarks, which could impact the distribution of events containing top quarks or bosons reconstructing as fat jets. We minimize the impact by restricting ourselves to softer requirements on the corresponding variables, but it would be worthwhile to explore this region of parameter space in more detail.
The background processes are generated as:
- •
;
- •
;
- •
.
with a cut of imposed at the generator level to improve reconstruction efficiency. Even with this selection, we are computationally limited to processes with at most five final state particles, and restrict ourselves to sufficiently inclusive quantities in our analysis such that this limitation is unlikely to be important. We incorporate the possibility of “lepton charge flip” manually according to the prescription in Ref Aaboud:2017qph .
After the detector simulation, physics-level objects are reconstructed in both signal and background processes as:
- •
Leptons are required to be isolated according to the prescription in Ref. Khachatryan:2016yzq .
- •
Jets are reconstructed using the anti- algorithm Cacciari:2008gp with and GeV;
- •
Fat jets are reconstructed using anti- with and GeV;
- •
Jets are bottom-tagged according to the DeepFlavor performance shown in Ref. CMS-DP-2017-013 using the 70% tagging efficiency as the work point;
- •
Tops are tagged using a convolutional neural network (CNN) described in Appendix. A at the 50% benchmark operating point.
These reconstructed objects are fed into the selection described below to assess how well the signal may be extracted from the background. The distributions of , (number of fat jets), (number of top-tagged jets) and (number of b-tagged jets) from the SM background and the signal (with two choices of top partner mass, 1.5 TeV (red line) and 2.5 TeV (orange line)) are shown in Figure 3 for the 3 mixed sign leptons case. We can clearly see from this figure that of the signal process is usually larger than the background processes and will increase with the mass of the top partner, . The same behavior also appears in the distributions of and , as the more boosted jet is easier to be reconstructed as fat jets and further identified as top jets. The last distribution of is almost independent of , as it is almost controlled by the true number of the -jets in the events, and we model the -tagging efficiency as a constant (70% as described above) throughout the central region.
4.2 Event Selection and Sensitivity
We sort our events into five channels based on the number (and charge) of the leptons they contain as described above. The event fractions for each channel considering the detector effects are also listed in the third column of Table. 1. Note that we also include 1% lepton fake rate from jets which results in more leptons than expected just from the branch fraction due to the large multiplicity of jets in the events. For channels with two or more leptons, we eliminate GeV to reduce background from the pole. At this Pre-Cut selection level, we also require .
After the Pre-Cuts, for TeV, the signal of 1- and 2- channel is typically 10-100 times smaller than the sum of the backgrounds, while other channels have similar with or even larger signal than the backgrounds. We further optimize the significance of the top partner signal by considering following kinematic variables (Cut I):
- •
The number of fat jets ;
- •
The number of top tagged fat jets ;
- •
The number of -tagged jets .
It is likely that the number of untagged jets, is also a useful discriminant. However, the simulations are limited to five final state particles, may not be modeled well in our simulations, and we do not consider it here. Including this with sophisticated analysis will improve the sensitivity. For each channel, the cross section of the signal (for TeV) and corresponding backgrounds after each set of cuts, and the statistical significance of that channel (assuming 3 ab*-1* of integrated luminosity) are summarized in Table 2. We find that the single best channel is the one demanding two same sign charged leptons, which balances rate against standing out from the background.
For each value of , we repeat this procedure for the same set of cuts. In each case, assuming that the top partners are pair produced exclusively through the strong force, the sensitivity maps into a bound on the branching ratio for . In Figure 4, we show the limit on this branching ratio as a function of from 1000 GeV to 2500 GeV. As approaches GeV, the upper limit on the branching ratio approaches 1, implying that higher masses will only be accessible if there is an additional mechanism responsible for producing beyond the strong interaction.
4.3 Reconstructing
In the case that an excess is detected, it would be desirable to reconstruct the origin of the signal from top partner pair production, and determine the mass. Direct reconstruction as an invariant mass is challenging, since the leptonic top decays produce undetectable neutrino which results in missing momentum, and the decay products of six top quarks result in a large combinatoric confusion.
In order to improve the sensitivity to the mass, another CNN is trained to predict the probability that a set of events originate from a particular value of . This CNN has similar structure as the one explained in Appendix. A. However, instead of the data associated with one particular jet, the whole distribution in the calorimeter for the event after converting into “tensor image” is used as the input of the CNN. Using the whole distribution in one event actually captures following two features:
- •
The distribution, the sum of the of all visible particles, which increases with ;
- •
The dispersion, which describes the distribution in the whole space, which decreases with .
We show the output distribution for the 1.5 TeV classifier when fed simulated events with a variety of values of in the left panel of Figure 5. For simplicity, we neglect the background in this assessment; while this is not a good approximation for all of the channels, it well approximates the channels with the largest sensitivity (such as 2-ss). We leave a more realistic analysis for future work.
Based on the distributions shown in the left panel of Figure 5, a binned likelihood is constructed and its negative log-likelihood is shown in the middle panel of Figure 5. Also for comparison, the result corresponding to the distribution alone is also presented, illustrating the increase in sensitivity achieved by the CNN. A more detailed analysis for 1.5 TeV case is shown in the right panel of Figure 5, and an GeV determination of the top partner mass can be achieved.
5 Conclusions
Events containing six top quarks are within grasp of the LHC Run 3, and provide a fascinating laboratory to search for physics beyond the Standard Model. We have explored a simplified model which arises as the low energy limit of compelling theories of a composite Higgs, and in which top partners decay into three top quarks with a large branching ratio. We have constructed inclusive observables which are able to tease the signal out of the otherwise large Standard Model background, and find that top partner masses up to around 2.5 TeV are accessible with ab*-1* as can be seen from Figure 4.
Further, the distribution of the final state particles also provides information about the mass of the top partner. A CNN-based method is used to investigate how well one can determine the top partner mass, with the whole distribution over the calorimeter used as the input to the CNN. As shown in Figure 5, around 1.5 TeV, an GeV determination of the mass can be achieved.
Acknowledgements.
Y.W. is supported by the Natural Sciences and Engineering Research Council of Canada (NSERC). T.M.P.T. is supported in part by the US National Science Foundation through NSF Grant No. PHY-1620638. J.S. is supported by the National Natural Science Foundation of China (NSFC) under grant No.11647601, No.11690022, No.11851302, No.11675243 and No.11761141011 and also supported by the Strategic Priority Research Program of the Chinese Academy of Sciences under grant No.XDB21010200 and No.XDB23000000. H.H. is supported by NSFC under grant No. 11847151. T.M. is supported in part by project Y6Y2581B11 supported by 2016 National Postdoctoral Program for Innovative Talents. The simulations for this work were done in part at the HPC Cluster of ITP-CAS.
Appendix A Boosted Jet Tagging
Our jet classification is based on a Convolutional Neutral Network (CNN) which combines calorimeter and tracker information for each fat jet to assign the probabilities that the jet originates from a top, boson or light parton. For recent work on related strategies, see Refs. Plehn:2011tg ; Kasieczka:2017nvn ; Butter:2017cot ; Dasgupta:2018emf ; Macaluso:2018tck ; Csaki:2018hyw .
The training and testing samples are generated through the same procedure as for the signal and background events, simulating the processes with and . After reconstructing the fat jets using the anti- algorithm with and GeV, each of them is converted into a “tensor image”. A square region in the plane of size is constructed centered at the center of the jet and divided into equal-sized pixels. Each pixel records the total incident and the multiplicities of both the track and tower classes (from Delphes). This results in a four channel image with dimensions .
The tensor image serves as the input to the CNN constructed using the PyTorch framework. The CNN consists of the following elements:
- •
Four convolutional layers with a Rectified Linear Unit (ReLU) activation function;
- •
Two max-pooling layers;
- •
Classification block layers, including two linear layers with a dropout of probability and ReLU activation function;
- •
Final linear layer classifying the jet images into different categories.
In each sample, jets are divided into three bins according to their : 200 GeV 400 GeV, 400 GeV 800 GeV and 800 GeV, and the CNN is trained separately for each bin. The tagging performance is characterized by the Receiver Operating Characteristic (ROC) curve. For each pair of jet classes and (tagging against ), the ROC curve (see Figure 6) shows the “tagging efficiency” (the probability of correctly tagging the jet of class as ) on the horizontal axis, and 1-“mistagging rate” (the probability of incorrectly tagging jet of class as ) on the vertical axis.
In Figure 6, the left panels show the ROC curves for tagging a top quark against a -boson and a light jet, while the right panels are the ROC curves for tagging a boson against a top quark and a light jet. The top, middle and bottom panels correspond to the bins: [200,400] GeV, [400,800] GeV and [800,] GeV, respectively. As expected, higher tops and s are identified much more efficiently. Two benchmark working points corresponding to 50% and 80% efficiency for top tagging are marked on each curve in Figure 6, and the corresponding mistagging rates are listed in the legend of each panel. In practice, the 50% working point is used to tag the top jets.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) J. L. Feng, Naturalness and the Status of Supersymmetry , Ann. Rev. Nucl. Part. Sci. 63 (2013) 351 [ 1302.6587 ]. · doi ↗
- 2(2) G. F. Giudice, Naturalness after LHC 8 , Po S EPS-HEP 2013 (2013) 163 [ 1307.7879 ].
- 3(3) G. Altarelli, The Higgs: so simple yet so unnatural , Phys. Scripta T 158 (2013) 014011 [ 1308.0545 ]. · doi ↗
- 4(4) M. Farina, D. Pappadopulo and A. Strumia, A modified naturalness principle and its experimental tests , JHEP 08 (2013) 022 [ 1303.7244 ]. · doi ↗
- 5(5) A. de Gouvea, D. Hernandez and T. M. P. Tait, Criteria for Natural Hierarchies , Phys. Rev. D 89 (2014) 115005 [ 1402.2658 ]. · doi ↗
- 6(6) C. Csáki, F. Ferreira De Freitas, L. Huang, T. Ma, M. Perelstein and J. Shu, Naturalness Sum Rules and Their Collider Tests , 1811.01961 .
- 7(7) C.-R. Chen, J. Hajer, T. Liu, I. Low and H. Zhang, Testing naturalness at 100 Te V , JHEP 09 (2017) 129 [ 1705.07743 ]. · doi ↗
- 8(8) B. Lillie, J. Shu and T. M. P. Tait, Top Compositeness at the Tevatron and LHC , JHEP 04 (2008) 087 [ 0712.3057 ]. · doi ↗
