ROAST: Risk-aware Outlier-exposure for Adversarial Selective Training of Anomaly Detectors Against Evasion Attacks
Mohammed Elnawawy, Gargi Mitra, Shahrear Iqbal, Karthik Pattabiraman

TL;DR
ROAST is a risk-aware training framework that enhances anomaly detector recall against evasion attacks by selectively focusing on less vulnerable patients and injecting adversarial samples, while maintaining high precision.
Contribution
ROAST introduces a novel risk-aware outlier exposure framework that selectively trains on less vulnerable data, significantly improving recall and reducing training time against evasion attacks.
Findings
ROAST increases recall by 16.2% under black-box attacks.
ROAST improves recall by 5.89% under white-box attacks.
ROAST reduces training time by 88.3% compared to indiscriminate training.
Abstract
Safety-critical domains like healthcare rely on deep neural networks (DNNs) for prediction, yet DNNs remain vulnerable to evasion attacks. Anomaly detectors (ADs) are widely used to protect DNNs, but conventional ADs are trained indiscriminately on benign data from all patients, overlooking physiological differences that introduce noise, degrade robustness, and reduce recall. In this paper, we propose ROAST, a novel risk-aware outlier exposure (OE) selective training framework that improves AD recall while largely preserving precision. ROAST identifies patients who are less vulnerable to attack and focuses training on these cleaner, more reliable data, thereby reducing false negatives and improving recall. To preserve precision, the framework applies OE by injecting adversarial samples into the training set of the less vulnerable patients, avoiding noisy data from others. Experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
