Adaptive Scoring and Thresholding with Human Feedback for Robust   Out-of-Distribution Detection

Daisuke Yamada; Harit Vishwakarma; Ramya Korlakai Vinayak

arXiv:2505.02299·cs.LG·May 6, 2025

Adaptive Scoring and Thresholding with Human Feedback for Robust Out-of-Distribution Detection

Daisuke Yamada, Harit Vishwakarma, Ramya Korlakai Vinayak

PDF

Open Access

TL;DR

This paper introduces a human-in-the-loop framework that adaptively updates scoring functions and thresholds for out-of-distribution detection, ensuring high true positive rates while controlling false positive rates in real-time.

Contribution

It presents a novel adaptive method with theoretical FPR guarantees that improves OOD detection performance over static approaches.

Findings

01

Outperforms existing methods on OpenOOD benchmarks.

02

Achieves higher TPRs with FPR control in dynamic OOD scenarios.

03

Provides theoretical FPR control guarantees under stationary conditions.

Abstract

Machine Learning (ML) models are trained on in-distribution (ID) data but often encounter out-of-distribution (OOD) inputs during deployment -- posing serious risks in safety-critical domains. Recent works have focused on designing scoring functions to quantify OOD uncertainty, with score thresholds typically set based solely on ID data to achieve a target true positive rate (TPR), since OOD data is limited before deployment. However, these TPR-based thresholds leave false positive rates (FPR) uncontrolled, often resulting in high FPRs where OOD points are misclassified as ID. Moreover, fixed scoring functions and thresholds lack the adaptivity needed to handle newly observed, evolving OOD inputs, leading to sub-optimal performance. To address these challenges, we propose a human-in-the-loop framework that \emph{safely updates both scoring functions and thresholds on the fly} based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Sensor Networks and Detection Algorithms · Fault Detection and Control Systems

MethodsSparse Evolutionary Training