ACD-U: Asymmetric co-teaching with machine unlearning for robust learning with noisy labels
Reo Fukunaga, Soh Yoshida, Mitsuji Muneyasu

TL;DR
ACD-U introduces an asymmetric co-teaching framework with machine unlearning that leverages different model architectures and post-hoc error correction to improve robustness against noisy labels in deep learning.
Contribution
It proposes a novel asymmetric co-teaching method combining CLIP-pretrained vision Transformers and CNNs with machine unlearning for effective noisy label correction.
Findings
Achieves state-of-the-art results on noisy datasets.
Effectively mitigates confirmation bias in noisy label learning.
Performs well in high-noise and instance-dependent noise scenarios.
Abstract
Deep neural networks are prone to memorizing incorrect labels during training, which degrades their generalizability. Although recent methods have combined sample selection with semi-supervised learning (SSL) to exploit the memorization effect -- where networks learn from clean data before noisy data -- they cannot correct selection errors once a sample is misclassified. To overcome this, we propose asymmetric co-teaching with different architectures (ACD)-U, an asymmetric co-teaching framework that uses different model architectures and incorporates machine unlearning. ACD-U addresses this limitation through two core mechanisms. First, its asymmetric co-teaching pairs a contrastive language-image pretraining (CLIP)-pretrained vision Transformer with a convolutional neural network (CNN), leveraging their complementary learning behaviors: the pretrained model provides stable predictions,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
