AHA: Human-Assisted Out-of-Distribution Generalization and Detection
Haoyue Bai, Jifan Zhang, Robert Nowak

TL;DR
AHA is a human-assisted framework that improves out-of-distribution generalization and detection by strategically labeling data within a maximally disambiguating region, significantly enhancing performance with minimal annotations.
Contribution
The paper introduces a novel integrated approach, AHA, combining human assistance and a maximum disambiguation region to enhance OOD generalization and detection.
Findings
Significantly outperforms existing methods with few annotations
Effective identification of the maximal disambiguation region
Improves both OOD detection and generalization performance
Abstract
Modern machine learning models deployed often encounter distribution shifts in real-world applications, manifesting as covariate or semantic out-of-distribution (OOD) shifts. These shifts give rise to challenges in OOD generalization and OOD detection. This paper introduces a novel, integrated approach AHA (Adaptive Human-Assisted OOD learning) to simultaneously address both OOD generalization and detection through a human-assisted framework by labeling data in the wild. Our approach strategically labels examples within a novel maximum disambiguation region, where the number of semantic and covariate OOD data roughly equalizes. By labeling within this region, we can maximally disambiguate the two types of OOD data, thereby maximizing the utility of the fixed labeling budget. Our algorithm first utilizes a noisy binary search algorithm that identifies the maximal disambiguation region…
Peer Reviews
Decision·NeurIPS 2024 poster
1. this paper is well written and easy to follow 2. good visualization and extensive experiments 3. Maximum Disambiguation Region and reduction to noisy binary search are new to me
1. there exsits a strong assumption that the weighted densities of semantic and covariance ood should equalize 2. what is difference between active learning and the proposed Human-Assisted Learning 3. what is the time used for noisy binary search 4. whether the lamda searched on the one dataset can be transfer to another dataset
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications
