Anomalies, Representations, and Self-Supervision

Barry M. Dillon; Luigi Favaro; Friedrich Feiden; Tanmoy Modak; Tilman; Plehn

arXiv:2301.04660·hep-ph·October 29, 2024·5 cites

Anomalies, Representations, and Self-Supervision

Barry M. Dillon, Luigi Favaro, Friedrich Feiden, Tanmoy Modak, Tilman, Plehn

PDF

Open Access

TL;DR

This paper introduces AnomalyCLR, a self-supervised contrastive learning method using a Transformer Encoder for density-based anomaly detection in collider event data, showing significant performance improvements over baseline methods.

Contribution

The paper presents a novel self-supervised anomaly detection approach combining contrastive learning, data augmentation, and Transformer encoders for collider physics data.

Findings

01

Significant performance improvements over baseline methods.

02

Effective detection of non-Standard-Model events.

03

Model-agnostic and data-driven anomaly detection.

Abstract

We develop a self-supervised method for density-based anomaly detection using contrastive learning, and test it using event-level anomaly data from CMS ADC2021. The AnomalyCLR technique is data-driven and uses augmentations of the background data to mimic non-Standard-Model events in a model-agnostic way. It uses a permutation-invariant Transformer Encoder architecture to map the objects measured in a collider event to the representation space, where the data augmentations define a representation space which is sensitive to potential anomalous features. An AutoEncoder trained on background representations then computes anomaly scores for a variety of signals in the representation space. With AnomalyCLR we find significant improvements on performance metrics for all signals when compared to the raw data baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Seismology and Earthquake Studies · COVID-19 diagnosis using AI

MethodsMulti-Head Attention · Attention Is All You Need · Test · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Linear Layer · Dropout · Softmax · Adam · Residual Connection