GCI-ViTAL: Gradual Confidence Improvement with Vision Transformers for   Active Learning on Label Noise

Moseli Mots'oehli; kyungim Baek

arXiv:2411.05939·cs.CV·November 12, 2024

GCI-ViTAL: Gradual Confidence Improvement with Vision Transformers for Active Learning on Label Noise

Moseli Mots'oehli, kyungim Baek

PDF

Open Access

TL;DR

This paper introduces GCI-ViTAL, a robust active learning algorithm using Vision Transformers that effectively handles label noise by selecting uncertain and semantically divergent samples, outperforming CNN-based methods across multiple datasets.

Contribution

The paper proposes GCI-ViTAL, a novel active learning method leveraging Vision Transformers and semantic information to improve robustness against label noise in image classification.

Findings

01

ViTs outperform CNNs in noisy label settings.

02

GCI-ViTAL effectively identifies uncertain and divergent samples.

03

Semantic information enhances model robustness under label noise.

Abstract

Active learning aims to train accurate classifiers while minimizing labeling costs by strategically selecting informative samples for annotation. This study focuses on image classification tasks, comparing AL methods on CIFAR10, CIFAR100, Food101, and the Chest X-ray datasets under varying label noise rates. We investigate the impact of model architecture by comparing Convolutional Neural Networks (CNNs) and Vision Transformer (ViT)-based models. Additionally, we propose a novel deep active learning algorithm, GCI-ViTAL, designed to be robust to label noise. GCI-ViTAL utilizes prediction entropy and the Frobenius norm of last-layer attention vectors compared to class-centric clean set attention vectors. Our method identifies samples that are both uncertain and semantically divergent from typical images in their assigned class. This allows GCI-ViTAL to select informative data points even…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Infrastructure Maintenance and Monitoring

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection · Softmax