The Inverse Bagging Algorithm: Anomaly Detection by Inverse Bootstrap Aggregating
Pietro Vischia, Tommaso Dorigo

TL;DR
This paper introduces the Inverse Bagging Algorithm, a novel anomaly detection method that uses bootstrap aggregating to distinguish well-modeled processes from unknown ones without altering their distributions.
Contribution
It proposes a new bootstrap-based anomaly detection technique that preserves the kinematic distributions of well-understood processes and analyzes its asymptotic properties.
Findings
Effective in identifying anomalies in collider data
Comparable or superior to existing multivariate algorithms
Demonstrated on LHC-like data set
Abstract
For data sets populated by a very well modeled process and by another process of unknown probability density function (PDF), a desired feature when manipulating the fraction of the unknown process (either for enhancing it or suppressing it) consists in avoiding to modify the kinematic distributions of the well modeled one. A bootstrap technique is used to identify sub-samples rich in the well modeled process, and classify each event according to the frequency of it being part of such sub-samples. Comparisons with general MVA algorithms will be shown, as well as a study of the asymptotic properties of the method, making use of a public domain data set that models a typical search for new physics as performed at hadronic colliders such as the Large Hadron Collider (LHC).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
