Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
Maxime Darrin, Guillaume Staerman, Eduardo Dadalto C\^amara Gomes,, Jackie CK Cheung, Pablo Piantanida, Pierre Colombo

TL;DR
This paper introduces an unsupervised, data-driven method for aggregating layer-wise anomaly scores to improve textual out-of-distribution detection, outperforming traditional last-layer approaches and approaching oracle performance.
Contribution
It proposes a novel unsupervised score aggregation technique that automatically selects the best layer for OOD detection, eliminating manual feature selection.
Findings
The method achieves robust, consistent results across diverse tasks.
It outperforms traditional last-layer based OOD detection methods.
Performance approaches the oracle's best layer performance.
Abstract
Out-of-distribution (OOD) detection is a rapidly growing field due to new robustness and security requirements driven by an increased number of AI-based systems. Existing OOD textual detectors often rely on an anomaly score (e.g., Mahalanobis distance) computed on the embedding output of the last layer of the encoder. In this work, we observe that OOD detection performance varies greatly depending on the task and layer output. More importantly, we show that the usual choice (the last layer) is rarely the best one for OOD detection and that far better results could be achieved if the best layer were picked. To leverage this observation, we propose a data-driven, unsupervised method to combine layer-wise anomaly scores. In addition, we extend classical textual OOD benchmarks by including classification tasks with a greater number of classes (up to 77), which reflects more realistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Digital Media Forensic Detection · Advanced Malware Detection Techniques
MethodsFeature Selection
