The Inverse Bagging Algorithm: Anomaly Detection by Inverse Bootstrap   Aggregating

Pietro Vischia; Tommaso Dorigo

arXiv:1611.08256·stat.ML·April 5, 2017

The Inverse Bagging Algorithm: Anomaly Detection by Inverse Bootstrap Aggregating

Pietro Vischia, Tommaso Dorigo

PDF

TL;DR

This paper introduces the Inverse Bagging Algorithm, a novel anomaly detection method that uses bootstrap aggregating to distinguish well-modeled processes from unknown ones without altering their distributions.

Contribution

It proposes a new bootstrap-based anomaly detection technique that preserves the kinematic distributions of well-understood processes and analyzes its asymptotic properties.

Findings

01

Effective in identifying anomalies in collider data

02

Comparable or superior to existing multivariate algorithms

03

Demonstrated on LHC-like data set

Abstract

For data sets populated by a very well modeled process and by another process of unknown probability density function (PDF), a desired feature when manipulating the fraction of the unknown process (either for enhancing it or suppressing it) consists in avoiding to modify the kinematic distributions of the well modeled one. A bootstrap technique is used to identify sub-samples rich in the well modeled process, and classify each event according to the frequency of it being part of such sub-samples. Comparisons with general MVA algorithms will be shown, as well as a study of the asymptotic properties of the method, making use of a public domain data set that models a typical search for new physics as performed at hadronic colliders such as the Large Hadron Collider (LHC).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.