HALO: Robust Out-of-Distribution Detection via Joint Optimisation
Hugo Lyons Keenan, Sarah Erfani, Christopher Leckie

TL;DR
HALO is a new robust out-of-distribution detection method that improves performance and resistance to adversarial attacks by extending the TRADES framework with a novel objective and additional loss terms.
Contribution
The paper introduces HALO, a robust OOD detection approach that surpasses existing methods by balancing clean and adversarial robustness through a novel joint optimization framework.
Findings
HALO achieves state-of-the-art AUROC scores across multiple datasets.
HALO demonstrates strong resistance to transferred adversarial attacks.
HALO offers tunable performance and compatibility with existing frameworks.
Abstract
Effective out-of-distribution (OOD) detection is crucial for the safe deployment of machine learning models in real-world scenarios. However, recent work has shown that OOD detection methods are vulnerable to adversarial attacks, potentially leading to critical failures in high-stakes applications. This discovery has motivated work on robust OOD detection methods that are capable of maintaining performance under various attack settings. Prior approaches have made progress on this problem but face a number of limitations: often only exhibiting robustness to attacks on OOD data or failing to maintain strong clean performance. In this work, we adapt an existing robust classification framework, TRADES, extending it to the problem of robust OOD detection and discovering a novel objective function. Recognising the critical importance of a strong clean/robust trade-off for OOD detection, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Smart Grid Security and Resilience · Anomaly Detection Techniques and Applications
