Attend in groups: a weakly-supervised deep learning framework for learning from web data
Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, Ian Reid

TL;DR
This paper introduces a weakly-supervised deep learning framework that effectively learns from noisy web data by using random grouping and attention mechanisms, improving visual recognition performance.
Contribution
The paper presents a novel end-to-end framework combining random grouping and attention to mitigate label noise in web images for deep learning.
Findings
Outperforms baseline methods on challenging datasets
Effective noise reduction through random grouping and attention
Demonstrates robustness on a new fine-grained car dataset
Abstract
Large-scale datasets have driven the rapid development of deep neural networks for visual recognition. However, annotating a massive dataset is expensive and time-consuming. Web images and their labels are, in comparison, much easier to obtain, but direct training on such automatically harvested images can lead to unsatisfactory performance, because the noisy labels of Web images adversely affect the learned recognition models. To address this drawback we propose an end-to-end weakly-supervised deep learning framework which is robust to the label noise in Web images. The proposed framework relies on two unified strategies -- random grouping and attention -- to effectively reduce the negative impact of noisy web image annotations. Specifically, random grouping stacks multiple images into a single training instance and thus increases the labeling accuracy at the instance level. Attention,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
