Implicit to Explicit Entropy Regularization: Benchmarking ViT   Fine-tuning under Noisy Labels

Maria Marrium; Arif Mahmood; Mohammed Bennamoun

arXiv:2410.04256·cs.CV·October 8, 2024

Implicit to Explicit Entropy Regularization: Benchmarking ViT Fine-tuning under Noisy Labels

Maria Marrium, Arif Mahmood, Mohammed Bennamoun

PDF

Open Access

TL;DR

This paper benchmarks the robustness of Vision Transformers (ViTs) against noisy labels and explores how explicit entropy regularization can improve their performance, comparing with CNNs and analyzing various NLL methods across multiple datasets.

Contribution

It provides a comprehensive benchmark of ViT fine-tuning under noisy labels and demonstrates that explicit entropy regularization enhances ViT robustness and performance.

Findings

01

Entropy regularization improves ViT robustness to noisy labels.

02

ViTs are more vulnerable to noisy labels than CNNs.

03

Explicit entropy minimization enhances NLL methods for ViTs.

Abstract

Automatic annotation of large-scale datasets can introduce noisy training data labels, which adversely affect the learning process of deep neural networks (DNNs). Consequently, Noisy Labels Learning (NLL) has become a critical research field for Convolutional Neural Networks (CNNs), though it remains less explored for Vision Transformers (ViTs). In this study, we evaluate the vulnerability of ViT fine-tuning to noisy labels and compare its robustness with CNNs. We also investigate whether NLL methods developed for CNNs are equally effective for ViTs. Using linear probing and MLP-K fine-tuning, we benchmark two ViT backbones (ViT-B/16 and ViT-L/16) using three commonly used classification losses: Cross Entropy (CE), Focal Loss (FL), and Mean Absolute Error (MAE), alongside six robust NLL methods: GCE, SCE, NLNL, APL, NCE+AGCE, and ANL-CE. The evaluation is conducted across six datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Applications

MethodsEntropy Regularization · Focal Loss