Boosted decision trees in the era of new physics: a smuon analysis case study
Alan S. Cornell, Wesley Doorsamy, Benjamin Fuks, Gerhard Harmsen, Lara, Mason

TL;DR
This paper explores the application of gradient boosting machine learning techniques to particle physics, specifically a smuon analysis, demonstrating improvements over traditional methods and proposing new metrics and approaches for feature selection and model generalization.
Contribution
It introduces the use of gradient boosting in particle physics analysis, compares it with traditional methods, and proposes novel metrics and feature analysis techniques for better model performance.
Findings
Machine learning extends exclusion limits beyond traditional methods.
F-score can be a useful alternative metric in imbalanced datasets.
Feature analysis methods improve understanding of model decisions.
Abstract
Machine learning algorithms are growing increasingly popular in particle physics analyses, where they are used for their ability to solve difficult classification and regression problems. While the tools are very powerful, they may often be under- or mis-utilised. In the following, we investigate the use of gradient boosting techniques as applicable to a generic particle physics problem. We use as an example a Beyond the Standard Model smuon collider analysis which applies to both current and future hadron colliders, and we compare our results to a traditional cut-and-count approach. In particular, we interrogate the use of metrics in imbalanced datasets which are characteristic of high energy physics problems, offering an alternative to the widely used area under the curve (auc) metric through a novel use of the F-score metric. We present an in-depth comparison of feature selection and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
