TL;DR
This paper demonstrates that machine learning-based anomaly detection can effectively identify Upsilon decays in CMS open data, significantly improving signal significance over traditional methods and enabling practical discovery in collider experiments.
Contribution
The study introduces a novel ML-based anomaly detection approach for isolating Upsilon signals in collider data, surpassing traditional techniques and providing a benchmark dataset for future research.
Findings
Achieved a 6.4 sigma significance for Upsilon detection
Demonstrated ML-based methods outperform cut-and-count approaches
Provided a benchmark dataset for anomaly detection in collider data
Abstract
We present the first study of anti-isolated Upsilon decays to two muons () in proton-proton collisions at the Large Hadron Collider. Using a machine learning (ML)-based anomaly detection strategy, we "rediscover" the in 13 TeV CMS Open Data from 2016, despite overwhelming anti-isolated backgrounds. We elevate the signal significance to using these methods, starting from using the dimuon mass spectrum alone. Moreover, we demonstrate improved sensitivity from using an ML-based estimate of the multi-feature likelihood compared to traditional "cut-and-count" methods. Our work demonstrates that it is possible and practical to find real signals in experimental collider data using ML-based anomaly detection, and we distill a readily-accessible benchmark dataset from the CMS Open Data to facilitate future anomaly detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
