The Dark Machines Anomaly Score Challenge: Benchmark Data and Model Independent Event Classification for the Large Hadron Collider
T. Aarrestad, M. van Beekveld, M. Bona, A. Boveia, S. Caron, J., Davies, A. De Simone, C. Doglioni, J.M. Duarte, A. Farbin, H. Gupta, L., Hendriks, L. Heinrich, J. Howarth, P. Jawahar, A. Jueid, J. Lastow, A., Leinweber, J. Mamuzic, E. Mer\'enyi, A. Morandini, P. Moskvitina

TL;DR
This paper presents a large benchmark dataset and evaluates various unsupervised machine learning algorithms for anomaly detection in LHC data, aiming to improve model-independent new physics searches.
Contribution
It introduces a comprehensive benchmark dataset of simulated LHC events and assesses multiple anomaly detection methods for the first time in this context.
Findings
Certain algorithms outperform others in anomaly detection accuracy.
The benchmark dataset enables standardized evaluation of unsupervised models.
Insights will guide future unsupervised searches for new physics at the LHC.
Abstract
We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of >1 Billion simulated LHC events corresponding to of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
