How to use Machine Learning to improve the discrimination between signal and background at particle colliders
Xabier Cid Vidal, Lorena Dieste Maro\~nas, \'Alvaro D\'osil, Su\'arez

TL;DR
This paper evaluates various machine learning algorithms and libraries for improving signal-background discrimination in particle collider experiments, providing practical guidelines for analysts to optimize their use cases.
Contribution
It compares the performance of multiple ML algorithms and libraries, including BDTs and Neural Networks, in simulated LHC data, offering new insights and recommendations.
Findings
Neural Networks outperform Boosted Decision Trees in certain scenarios.
Toolkit for Multivariate Data Analysis with ROOT is effective for quick assessments.
Guidelines are provided for choosing algorithms based on data features and size.
Abstract
The popularity of Machine Learning (ML) has been increasing in the last decades in almost every area, being the commercial and scientific fields the most notorious ones. Concerning particle physics, ML has been proved as a useful resource to make the most of projects such as the Large Hadron Collider (LHC). The main advantage provided by ML is reducing the time and effort put into the measurements done by experiments, while improving the performance. With this work we aim to encourage scientists at particle colliders to use ML and to try the different alternatives we have available nowadays, focusing in the separation between signal and background. We assess some of the most used libraries in the field, like Toolkit for Multivariate Data Analysis with ROOT, and also newer and more sophisticated options like PyTorch and Keras. We also check how optimal are some of the most common…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
