Lecture notes on Machine Learning applications for global fits
Jorge Alda

TL;DR
This paper presents a comprehensive framework using modern Machine Learning surrogates for efficient global fits in high-energy physics, including techniques like Boosted Decision Trees, active learning, and interpretability tools, applied to ALP parameter exploration.
Contribution
It introduces a robust ML workflow for global fits, combining surrogate modeling, active learning, hyperparameter tuning, interpretability, and application to a real physics anomaly.
Findings
ML surrogates significantly reduce computational costs.
The workflow enables efficient exploration of ALP parameter space.
Application to Belle II anomaly demonstrates practical effectiveness.
Abstract
These lecture notes provide a comprehensive framework for performing global statistical fits in high-energy physics using modern Machine Learning (ML) surrogates. We begin by reviewing the statistical foundations of model building, including the likelihood function, Wilks' theorem, and profile likelihoods. Recognizing that the computational cost of evaluating model predictions often renders traditional minimization prohibitive, we introduce Boosted Decision Trees to approximate the log-likelihood function. The notes detail a robust ML workflow including efficient generation of training data with active learning and Gaussian processes, hyperparameter optimization, model compilation for speed-up, and interpretability through SHAP values to decode the influence of model parameters and interactions between parameters. We further discuss posterior distribution sampling using Markov Chain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
