GCI-ViTAL: Gradual Confidence Improvement with Vision Transformers for Active Learning on Label Noise
Moseli Mots'oehli, kyungim Baek

TL;DR
This paper introduces GCI-ViTAL, a robust active learning algorithm using Vision Transformers that effectively handles label noise by selecting uncertain and semantically divergent samples, outperforming CNN-based methods across multiple datasets.
Contribution
The paper proposes GCI-ViTAL, a novel active learning method leveraging Vision Transformers and semantic information to improve robustness against label noise in image classification.
Findings
ViTs outperform CNNs in noisy label settings.
GCI-ViTAL effectively identifies uncertain and divergent samples.
Semantic information enhances model robustness under label noise.
Abstract
Active learning aims to train accurate classifiers while minimizing labeling costs by strategically selecting informative samples for annotation. This study focuses on image classification tasks, comparing AL methods on CIFAR10, CIFAR100, Food101, and the Chest X-ray datasets under varying label noise rates. We investigate the impact of model architecture by comparing Convolutional Neural Networks (CNNs) and Vision Transformer (ViT)-based models. Additionally, we propose a novel deep active learning algorithm, GCI-ViTAL, designed to be robust to label noise. GCI-ViTAL utilizes prediction entropy and the Frobenius norm of last-layer attention vectors compared to class-centric clean set attention vectors. Our method identifies samples that are both uncertain and semantically divergent from typical images in their assigned class. This allows GCI-ViTAL to select informative data points even…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Infrastructure Maintenance and Monitoring
MethodsAttention Is All You Need · Linear Layer · Dense Connections · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection · Softmax
