Out-Of-Distribution Detection with Diversification (Provably)
Haiyun Yao, Zongbo Han, Huazhu Fu, Xi Peng, Qinghua Hu, Changqing, Zhang

TL;DR
This paper introduces diverseMix, a method that enhances out-of-distribution detection by increasing auxiliary outlier diversity, backed by theoretical guarantees and extensive experimental validation on large-scale benchmarks.
Contribution
The paper proposes diverseMix, a practical approach with theoretical guarantees, to improve OOD detection by diversifying auxiliary outliers efficiently.
Findings
diverseMix outperforms existing methods on large-scale benchmarks
Increasing auxiliary outlier diversity improves OOD detection accuracy
Theoretical analysis confirms the effectiveness of diversity in outlier sets
Abstract
Out-of-distribution (OOD) detection is crucial for ensuring reliable deployment of machine learning models. Recent advancements focus on utilizing easily accessible auxiliary outliers (e.g., data from the web or other datasets) in training. However, we experimentally reveal that these methods still struggle to generalize their detection capabilities to unknown OOD data, due to the limited diversity of the auxiliary outliers collected. Therefore, we thoroughly examine this problem from the generalization perspective and demonstrate that a more diverse set of auxiliary outliers is essential for enhancing the detection capabilities. However, in practice, it is difficult and costly to collect sufficiently diverse auxiliary outlier data. Therefore, we propose a simple yet practical approach with a theoretical guarantee, termed Diversity-induced Mixup for OOD detection (diverseMix), which…
Peer Reviews
Decision·NeurIPS 2024 poster
Strengths: - The paper is well-composed and presents an extensive array of experiments across multiple OOD detection benchmarks. - This study offers important insights into the significance of auxiliary datasets in OOD regularization, addressing a frequently neglected aspect of OOD regularization techniques. - The authors provide a robust theoretical foundation for diverseMix. Additionally, they show strong empirical evaluations which further highlight the effectiveness of diverseMix across a ra
Weakness: - The reviewer has some concerns regarding the empirical evaluations of diverseMix. In particular, the choice of how the ImageNet-1k is split into ImageNet-200 as ID while the remaining data is leveraged as OOD seems arbitrary.
Code & Models
Videos
Taxonomy
TopicsFault Detection and Control Systems · Anomaly Detection Techniques and Applications · Advanced Statistical Process Monitoring
MethodsMixup · Sparse Evolutionary Training · Focus
