Implicit to Explicit Entropy Regularization: Benchmarking ViT Fine-tuning under Noisy Labels
Maria Marrium, Arif Mahmood, Mohammed Bennamoun

TL;DR
This paper benchmarks the robustness of Vision Transformers (ViTs) against noisy labels and explores how explicit entropy regularization can improve their performance, comparing with CNNs and analyzing various NLL methods across multiple datasets.
Contribution
It provides a comprehensive benchmark of ViT fine-tuning under noisy labels and demonstrates that explicit entropy regularization enhances ViT robustness and performance.
Findings
Entropy regularization improves ViT robustness to noisy labels.
ViTs are more vulnerable to noisy labels than CNNs.
Explicit entropy minimization enhances NLL methods for ViTs.
Abstract
Automatic annotation of large-scale datasets can introduce noisy training data labels, which adversely affect the learning process of deep neural networks (DNNs). Consequently, Noisy Labels Learning (NLL) has become a critical research field for Convolutional Neural Networks (CNNs), though it remains less explored for Vision Transformers (ViTs). In this study, we evaluate the vulnerability of ViT fine-tuning to noisy labels and compare its robustness with CNNs. We also investigate whether NLL methods developed for CNNs are equally effective for ViTs. Using linear probing and MLP-K fine-tuning, we benchmark two ViT backbones (ViT-B/16 and ViT-L/16) using three commonly used classification losses: Cross Entropy (CE), Focal Loss (FL), and Mean Absolute Error (MAE), alongside six robust NLL methods: GCE, SCE, NLNL, APL, NCE+AGCE, and ANL-CE. The evaluation is conducted across six datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications
MethodsEntropy Regularization · Focal Loss
