Quantifying Statistical Significance of Deep Nearest Neighbor Anomaly Detection via Selective Inference

Mizuki Niihori; Shuichi Nishino; Teruyuki Katsuoka; Tomohiro Shiraishi; Kouichi Taji; Ichiro Takeuchi

arXiv:2502.12978·stat.ML·December 1, 2025

Quantifying Statistical Significance of Deep Nearest Neighbor Anomaly Detection via Selective Inference

Mizuki Niihori, Shuichi Nishino, Teruyuki Katsuoka, Tomohiro Shiraishi, Kouichi Taji, Ichiro Takeuchi

PDF

Open Access 1 Video

TL;DR

This paper introduces a statistical framework for deep k-nearest neighbor anomaly detection that quantifies the significance of anomalies using p-values, addressing uncertainty estimation and false positive control in industrial applications.

Contribution

It develops a novel selective inference approach to quantify anomaly significance in deep kNN AD, managing selection bias and enabling reliable uncertainty estimation.

Findings

01

Provides a method to compute p-values for anomalies

02

Demonstrates reliable false positive control in industrial datasets

03

Enhances interpretability and trustworthiness of deep kNN AD

Abstract

In real-world applications, anomaly detection (AD) often operates without access to anomalous data, necessitating semi-supervised methods that rely solely on normal data. Among these methods, deep k-nearest neighbor (deep kNN) AD stands out for its interpretability and flexibility, leveraging distance-based scoring in deep latent spaces.Despite its strong performance, deep kNN lacks a mechanism to quantify uncertainty-an essential feature for critical applications such as industrial inspection. To address this limitation, we propose a statistical framework that quantifies the significance of detected anomalies in the form of p-values, thereby enabling control over false positive rates at a user-specified significance level (e.g.,0.05). A central challenge lies in managing selection bias, which we tackle using Selective Inference-a principled method for conducting inference conditioned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Quantifying Statistical Significance of Deep Nearest Neighbor Anomaly Detection via Selective Inference· slideslive

Taxonomy

TopicsAdvanced Statistical Process Monitoring