Open Set Recognition using Vision Transformer with an Additional   Detection Head

Feiyang Cai; Zhenkai Zhang; Jie Liu; Xenofon Koutsoukos

arXiv:2203.08441·cs.CV·March 17, 2022

Open Set Recognition using Vision Transformer with an Additional Detection Head

Feiyang Cai, Zhenkai Zhang, Jie Liu, Xenofon Koutsoukos

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel open set recognition method using Vision Transformer with an added detection head, achieving state-of-the-art results by effectively distinguishing known from unknown classes.

Contribution

First to leverage Vision Transformer for open set recognition, employing a two-stage training process with a detection head for improved unknown class detection.

Findings

01

Outperforms baseline methods on multiple OSR benchmarks

02

Achieves new state-of-the-art performance in open set recognition

03

Effectively distinguishes known and unknown classes using cluster-based detection

Abstract

Deep neural networks have demonstrated prominent capacities for image classification tasks in a closed set setting, where the test data come from the same distribution as the training data. However, in a more realistic open set scenario, traditional classifiers with incomplete knowledge cannot tackle test data that are not from the training classes. Open set recognition (OSR) aims to address this problem by both identifying unknown classes and distinguishing known classes simultaneously. In this paper, we propose a novel approach to OSR that is based on the vision transformer (ViT) technique. Specifically, our approach employs two separate training stages. First, a ViT model is trained to perform closed set classification. Then, an additional detection head is attached to the embedded features extracted by the ViT, trained to force the representations of known data to class-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

feiyang-cai/osr_vit
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Softmax · Dense Connections · Residual Connection · Vision Transformer