How to Allocate your Label Budget? Choosing between Active Learning and   Learning to Reject in Anomaly Detection

Lorenzo Perini; Daniele Giannuzzi; Jesse Davis

arXiv:2301.02909·cs.LG·January 10, 2023·1 cites

How to Allocate your Label Budget? Choosing between Active Learning and Learning to Reject in Anomaly Detection

Lorenzo Perini, Daniele Giannuzzi, Jesse Davis

PDF

Open Access 1 Repo

TL;DR

This paper proposes a mixed strategy for allocating a limited label budget between active learning and learning to reject in anomaly detection, optimizing the use of labels to improve detection performance and user trust.

Contribution

It introduces a novel approach that dynamically allocates label budget between AL and LR based on a reward function, enhancing anomaly detection under limited labels.

Findings

01

Outperforms baseline strategies on 18 benchmark datasets

02

Effectively balances label allocation to improve detection accuracy

03

Increases user trust by optimizing rejection thresholds

Abstract

Anomaly detection attempts at finding examples that deviate from the expected behaviour. Usually, anomaly detection is tackled from an unsupervised perspective because anomalous labels are rare and difficult to acquire. However, the lack of labels makes the anomaly detector have high uncertainty in some regions, which usually results in poor predictive performance or low user trust in the predictions. One can reduce such uncertainty by collecting specific labels using Active Learning (AL), which targets examples close to the detector's decision boundary. Alternatively, one can increase the user trust by allowing the detector to abstain from making highly uncertain predictions, which is called Learning to Reject (LR). One way to do this is by thresholding the detector's uncertainty based on where its performance is low, which requires labels to be evaluated. Although both AL and LR need…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lorenzo-perini/ballad
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification · Data Stream Mining Techniques