Bias and Priors in Machine Learning Calibrations for High Energy Physics

Rikab Gambhir; Benjamin Nachman; and Jesse Thaler

arXiv:2205.05084·hep-ph·September 2, 2022

Bias and Priors in Machine Learning Calibrations for High Energy Physics

Rikab Gambhir, Benjamin Nachman, and Jesse Thaler

PDF

1 Repo

TL;DR

This paper investigates the prior dependence in machine learning calibration methods for high-energy physics, highlighting biases introduced by training sample spectra and proposing a Gaussian Ansatz approach to mitigate these issues.

Contribution

The paper explicitly analyzes prior dependence in ML calibration strategies and introduces a Gaussian Ansatz method to reduce biases in simulation-based calibration.

Findings

01

Simulation-based calibration can inherit training sample biases.

02

Gaussian Ansatz approach reduces prior dependence.

03

Data-based calibration remains an open challenge.

Abstract

Machine learning offers an exciting opportunity to improve the calibration of nearly all reconstructed objects in high-energy physics detectors. However, machine learning approaches often depend on the spectra of examples used during training, an issue known as prior dependence. This is an undesirable property of a calibration, which needs to be applicable in a variety of environments. The purpose of this paper is to explicitly highlight the prior dependence of some machine learning-based calibration strategies. We demonstrate how some recent proposals for both simulation-based and data-based calibrations inherit properties of the sample used for training, which can result in biases for downstream analyses. In the case of simulation-based calibration, we argue that our recently proposed Gaussian Ansatz approach can avoid some of the pitfalls of prior dependence, whereas…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hep-lbdl/calibrationpriors
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.