ROAST: Risk-aware Outlier-exposure for Adversarial Selective Training of Anomaly Detectors Against Evasion Attacks

Mohammed Elnawawy; Gargi Mitra; Shahrear Iqbal; Karthik Pattabiraman

arXiv:2603.26093·cs.CR·April 20, 2026

ROAST: Risk-aware Outlier-exposure for Adversarial Selective Training of Anomaly Detectors Against Evasion Attacks

Mohammed Elnawawy, Gargi Mitra, Shahrear Iqbal, Karthik Pattabiraman

PDF

TL;DR

ROAST is a risk-aware training framework that enhances anomaly detector recall against evasion attacks by selectively focusing on less vulnerable patients and injecting adversarial samples, while maintaining high precision.

Contribution

ROAST introduces a novel risk-aware outlier exposure framework that selectively trains on less vulnerable data, significantly improving recall and reducing training time against evasion attacks.

Findings

01

ROAST increases recall by 16.2% under black-box attacks.

02

ROAST improves recall by 5.89% under white-box attacks.

03

ROAST reduces training time by 88.3% compared to indiscriminate training.

Abstract

Safety-critical domains like healthcare rely on deep neural networks (DNNs) for prediction, yet DNNs remain vulnerable to evasion attacks. Anomaly detectors (ADs) are widely used to protect DNNs, but conventional ADs are trained indiscriminately on benign data from all patients, overlooking physiological differences that introduce noise, degrade robustness, and reduce recall. In this paper, we propose ROAST, a novel risk-aware outlier exposure (OE) selective training framework that improves AD recall while largely preserving precision. ROAST identifies patients who are less vulnerable to attack and focuses training on these cleaner, more reliable data, thereby reducing false negatives and improving recall. To preserve precision, the framework applies OE by injecting adversarial samples into the training set of the less vulnerable patients, avoiding noisy data from others. Experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.