How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

Xuefeng Du; Zhen Fang; Ilias Diakonikolas; Yixuan Li

arXiv:2402.03502·cs.LG·February 7, 2024·1 cites

How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

Xuefeng Du, Zhen Fang, Ilias Diakonikolas, Yixuan Li

PDF

Open Access 1 Repo 1 Video 3 Reviews

TL;DR

This paper introduces SAL, a new framework for out-of-distribution detection that separates outliers from unlabeled data and provides strong theoretical guarantees, demonstrating state-of-the-art empirical performance.

Contribution

The paper proposes SAL, a novel learning framework with theoretical error bounds for OOD detection using unlabeled data, bridging the gap between theory and practice.

Findings

01

SAL achieves state-of-the-art results on benchmarks.

02

Theoretical guarantees justify the separation approach.

03

Empirical results validate the effectiveness of SAL.

Abstract

Using unlabeled data to regularize the machine learning models has demonstrated promise for improving safety and reliability in detecting out-of-distribution (OOD) data. Harnessing the power of unlabeled in-the-wild data is non-trivial due to the heterogeneity of both in-distribution (ID) and OOD data. This lack of a clean set of OOD samples poses significant challenges in learning an optimal OOD classifier. Currently, there is a lack of research on formally understanding how unlabeled data helps OOD detection. This paper bridges the gap by introducing a new learning framework SAL (Separate And Learn) that offers both strong theoretical guarantees and empirical effectiveness. The framework separates candidate outliers from the unlabeled data and then trains an OOD classifier using the candidate outliers and the labeled ID data. Theoretically, we provide rigorous error bounds from the…

Peer Reviews

Decision·ICLR 2024 poster

Reviewer 01Rating 6· marginally above the acceptance thresholdConfidence 2

Strengths

1 SAL’s methodology is structured around two distinct phases—screening and classification—which can be independently optimized, offering enhanced flexibility. 2 Utilizing a Large Volume of Unlabeled Data: SAL effectively leverages substantial amounts of unlabeled data to extract valuable information, thereby bolstering its detection capabilities. 3 Theoretical Support: Beyond its impressive empirical performance, SAL is underpinned by robust theoretical foundations.

Weaknesses

1 In scenarios where the actual OOD data markedly diverges from the outliers present in the unlabeled dataset, there arises a question regarding the preservation of SAL’s performance. 2 The efficacy of SAL is significantly influenced by the quality of the unlabeled data employed, indicating a substantial dependence on data integrity.

Reviewer 02Rating 8· accept, good paperConfidence 3

Strengths

1. A theory has been established to investigate aspects of separability and learnability. This contribution is both novel and significant. 2. Experimental evaluations conducted on standard benchmarks demonstrate that SAL achieves SOTA performance. 3. A novel method grounded in theory has been developed to advance safe machine learning practices. Theory serves as a crucial driver in this endeavour. I am very happy to see the novel work on provable OOD detection.

Weaknesses

1. Could you provide explanations or conduct experiments to elucidate the factors contributing to the decreased ID accuracy depicted in Table 1? 2. Why is pi set to 0.1 in most experiments? Could you conduct additional experiments to investigate whether pi remains robust across a range of values? Furthermore, does the performance of pi align with theoretical predictions? 3. It appears that the top singular vector is crucial for SAL. Have you conducted any experiments to demonstrate the perfo

Reviewer 03Rating 6· marginally above the acceptance thresholdConfidence 5

Strengths

1. The manuscript presents some theoretical analyses as well as a number of intriguing illustrations. 2. The experimental results look promising, with comparisons made against numerous baseline methods.

Weaknesses

1. The manuscript lacks crucial baselines, such as [1] and [2], which are essential for a comprehensive evaluation, and fails to provide an analysis or comparison with them. 2. The essential topic of the article is weakly supervised out-of-distribution detection, although it is described from different perspectives. [1] Zhou, Zhi, et al. "Step: Out-of-distribution detection in the presence of limited in-distribution labeled data." Advances in Neural Information Processing Systems 34 (2021): 291

Code & Models

Repositories

deeplearning-wisc/sal
pytorchOfficial

Videos

How Does Unlabeled Data Provably Help Out-of-Distribution Detection?· slideslive

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Data-Driven Disease Surveillance

MethodsSparse Evolutionary Training