TL;DR
This paper enhances collider physics analysis by extending Bayesian density estimation to incorporate feature correlations, improving the extraction of signal and background distributions in top jet data.
Contribution
It introduces a correlation-aware Bayesian mixture model for unsupervised signal extraction, leveraging priors from simulators and correcting biases with data.
Findings
Inclusion of correlations improves signal-background discrimination.
Correlated models outperform uncorrelated ones in both supervised and unsupervised metrics.
Simulators provide valuable priors despite biases, aiding in distribution estimation.
Abstract
Improving the understanding of signal and background distributions in signal-region is a valuable key to enhance any analysis in collider physics. This is usually a difficult task because -- among others -- signal and backgrounds are hard to discriminate in signal-region, simulations may reach a limit of reliability if they need to model non-perturbative QCD, and distributions are multi-dimensional and many times may be correlated within each class. Bayesian density estimation is a technique that leverages prior knowledge and data correlations to effectively extract information from data in signal-region. In this work we extend previous works on data-driven mixture models for meaningful unsupervised signal extraction in collider physics to incorporate correlations between features. Using a standard dataset of top and QCD jets, we show how simulators, despite having an expected bias, can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
