Optimal event selection and categorization in high energy physics, Part 1: Signal discovery
Konstantin K. Matchev, Prasanth Shyamsundar

TL;DR
This paper introduces a machine learning framework for optimizing event selection and categorization in high energy physics, enhancing signal detection while reducing systematic uncertainties.
Contribution
It presents a novel prescription for training ML-based event selectors that maximize significance and decorrelate from event variables, with an iterative learning approach linked to Lloyd's k-means.
Findings
Improves signal significance in HEP analyses.
Reduces systematic uncertainties by decorrelation.
Provides a publicly available Python implementation.
Abstract
We provide a prescription to train optimal machine-learning-based event selectors and categorizers that maximize the statistical significance of a potential signal excess in high energy physics (HEP) experiments, as quantified by any of six different performance measures. For analyses where the signal search is performed in the distribution of some event variables, our prescription ensures that only the information complementary to those event variables is used in event selection and categorization. This eliminates a major misalignment with the physics goals of the analysis (maximizing the significance of an excess) that exists in the training of typical ML-based event selectors and categorizers. In addition, this decorrelation of event selectors from the relevant event variables prevents the background distribution from becoming peaked in the signal region as a result of event…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Particle physics theoretical and experimental studies · Computational Physics and Python Applications
