Suppressing Mislabeled Data via Grouping and Self-Attention

Xiaojiang Peng; Kai Wang; Zhaoyang Zeng; Qing Li; Jianfei Yang; Yu; Qiao

arXiv:2010.15603·cs.CV·October 30, 2020

Suppressing Mislabeled Data via Grouping and Self-Attention

Xiaojiang Peng, Kai Wang, Zhaoyang Zeng, Qing Li, Jianfei Yang, Yu, Qiao

PDF

Open Access 1 Repo

TL;DR

This paper introduces Attentive Feature Mixup (AFM), a simple and efficient training method that improves noise robustness in deep learning by focusing on clean samples and suppressing mislabeled data through group-based attention and mixup interpolation.

Contribution

The paper proposes AFM, a novel plug-and-play module that constructs groups, assigns attention weights, and performs mixup to reduce the impact of noisy labels without extra clean data or assumptions.

Findings

01

Achieves state-of-the-art results on Food101N and Clothing1M datasets.

02

Effectively suppresses influence of mislabeled data via attention-weighted mixup.

03

Reduces the ratio of useless samples compared to original noisy datasets.

Abstract

Deep networks achieve excellent results on large-scale clean data but degrade significantly when learning from noisy labels. To suppressing the impact of mislabeled data, this paper proposes a conceptually simple yet efficient training block, termed as Attentive Feature Mixup (AFM), which allows paying more attention to clean samples and less to mislabeled ones via sample interactions in small groups. Specifically, this plug-and-play AFM first leverages a \textit{group-to-attend} module to construct groups and assign attention weights for group-wise samples, and then uses a \textit{mixup} module with the attention weights to interpolate massive noisy-suppressed samples. The AFM has several appealing benefits for noise-robust deep learning. (i) It does not rely on any assumptions and extra clean subset. (ii) With massive interpolations, the ratio of useless samples is reduced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kaiwang960112/AFM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsMixup