Extracting more from boosted decision trees: A high energy physics case study
Vidhi Lalchand

TL;DR
This paper introduces a novel algorithm that enhances boosted decision trees for particle identification at the LHC by reducing overfitting, achieving performance comparable to deep neural networks on a benchmark dataset.
Contribution
The work presents a new meta-learning based method combining boosting and bagging to improve BDT performance in high energy physics classification tasks.
Findings
Achieves near state-of-the-art scores on the ATLAS Higgs to tau-tau dataset.
Outperforms traditional BDTs by reducing overfitting.
Comparable results to deep neural network ensembles.
Abstract
Particle identification is one of the core tasks in the data analysis pipeline at the Large Hadron Collider (LHC). Statistically, this entails the identification of rare signal events buried in immense backgrounds that mimic the properties of the former. In machine learning parlance, particle identification represents a classification problem characterized by overlapping and imbalanced classes. Boosted decision trees (BDTs) have had tremendous success in the particle identification domain but more recently have been overshadowed by deep learning (DNNs) approaches. This work proposes an algorithm to extract more out of standard boosted decision trees by targeting their main weakness, susceptibility to overfitting. This novel construction harnesses the meta-learning techniques of boosting and bagging simultaneously and performs remarkably well on the ATLAS Higgs (H) to tau-tau data set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParticle physics theoretical and experimental studies · Particle Detector Development and Performance · Radiomics and Machine Learning in Medical Imaging
MethodsTest
