Can Pre-trained Networks Detect Familiar Out-of-Distribution Data?
Atsuyuki Miyai, Qing Yu, Go Irie, Kiyoharu Aizawa

TL;DR
This paper investigates how pre-trained models perform in detecting out-of-distribution data that the models have seen during pre-training, revealing vulnerabilities especially in self-supervised models, and proposes a feature-space detection method.
Contribution
It introduces the concept of PT-OOD, analyzes its impact on pre-trained models, and proposes a feature-space detection approach to improve PT-OOD detection performance.
Findings
Self-supervised models are more vulnerable to PT-OOD than supervised models.
Low linear separability of PT-OOD features degrades detection performance.
Feature-space detection can mitigate PT-OOD vulnerabilities.
Abstract
Out-of-distribution (OOD) detection is critical for safety-sensitive machine learning applications and has been extensively studied, yielding a plethora of methods developed in the literature. However, most studies for OOD detection did not use pre-trained models and trained a backbone from scratch. In recent years, transferring knowledge from large pre-trained models to downstream tasks by lightweight tuning has become mainstream for training in-distribution (ID) classifiers. To bridge the gap between the practice of OOD detection and current classifiers, the unique and crucial problem is that the samples whose information networks know often come as OOD input. We consider that such data may significantly affect the performance of large pre-trained networks because the discriminability of these OOD data depends on the pre-training algorithm. Here, we define such OOD data as PT-OOD…
Peer Reviews
Decision·ICLR 2024 Conference Withdrawn Submission
1. The PT-OOD detection problem induced by the use of pre-trained models is interesting. 2. Extensive experimental analyses were conducted. 3. The influence of self-supervised and supervised pre-training are investigated.
1. The definition of PT-OOD is very ambiguous. 2. The article gives a weak motivation for the study and fails to see why targeted testing for PT-OOD is important. 3. In the experiments, the pre-training is not large-scale.
1. On CIFAR and ImageNet-1k datasets, the experiment scope is extensive. 2. The paper is clearly written.
1. The impact of post-processors (the scoring function) is huge, but the authors did not explore the state-of-the-art scoring functions such as ViM [1] and NNGuide [2] 2. It is not clear what exactly is meant by 'instance-by-instance' discriminative representation. 3. The experiments with models pretrained on ImageNet-21/22K are not available. 4. One of the main observations (i.e., supervised > self-supervised) is too similar to the observation noted in [3] [1] Wang, Haoqi, et al. "Vim: Out-of-
The problem setting of detecting PT-OOD samples is interesting and might be valuable for future application of large pre-trained models.
1). The analysis lacks support. The paper report that OOD detection methods perform better on supervised pretrained model than on self-supervised pretrained model. The analysis in this paper says that it is because the features of models under supervised pretraining are linear seperable while the features of self-supervised trained models are not. Except the illustration in Fig.2, I have not found any theoretical or empirical evidence to support this analysis. 2). The proposed method lacks nove
They tackle an original question of whether the source data used for pre-training a model that does not overlap with the target data the model was adapted to can be easily detected by showcasing empirical results with a couple of backbone architectures and a few learning algorithms. To the best of my knowledge, they are the first to uncover that using simply the features of a model with a simple method, such as kNN density estimation, one can almost perfectly distinguish PT-OOD from ID data. Th
This paper tries to motivate the problem with web-scale models but conducts supervised pre-training on ImageNet-1K only. When actually using web-scale datasets, supervised pre-training becomes unfeasible. This does not invalidate the results found but hinders the clarity and true contribution of the manuscript. Experiments on Dino v2 trained on a web-scale dataset would be appreciated, for instance. The model weights are easily available online. Figure 2 is a conceptual drawing. It would be ni
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
