Visual Transformer for Task-aware Active Learning
Razvan Caramalau, Binod Bhattarai, Tae-Kyun Kim

TL;DR
This paper introduces a novel active learning pipeline that uses a Visual Transformer to model non-local dependencies between labeled and unlabeled data, enabling more effective sample selection for classification and detection tasks.
Contribution
It proposes a task-aware joint training approach for the learner and sampler, utilizing a Visual Transformer to improve active learning performance on multiple benchmarks.
Findings
Outperforms existing active learning methods on several benchmarks
Effectively models non-local visual dependencies for better sample selection
Demonstrates the benefit of joint training of learner and sampler
Abstract
Pool-based sampling in active learning (AL) represents a key framework for an-notating informative data when dealing with deep learning models. In this paper, we present a novel pipeline for pool-based Active Learning. Unlike most previous works, our method exploits accessible unlabelled examples during training to estimate their co-relation with the labelled examples. Another contribution of this paper is to adapt Visual Transformer as a sampler in the AL pipeline. Visual Transformer models non-local visual concept dependency between labelled and unlabelled examples, which is crucial to identifying the influencing unlabelled examples. Also, compared to existing methods where the learner and the sampler are trained in a multi-stage manner, we propose to train them in a task-aware jointly manner which enables transforming the latent space into two separate tasks: one that classifies the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Label Smoothing · Residual Connection · Dense Connections
