Hide-and-Seek: A Data Augmentation Technique for Weakly-Supervised   Localization and Beyond

Krishna Kumar Singh; Hao Yu; Aron Sarmasi; Gautam Pradeep; Yong Jae; Lee

arXiv:1811.02545·cs.CV·November 7, 2018·68 cites

Hide-and-Seek: A Data Augmentation Technique for Weakly-Supervised Localization and Beyond

Krishna Kumar Singh, Hao Yu, Aron Sarmasi, Gautam Pradeep, Yong Jae, Lee

PDF

Open Access 2 Repos

TL;DR

Hide-and-Seek is a versatile data augmentation method that improves weakly-supervised object localization and other visual recognition tasks by randomly hiding patches in training images to encourage models to focus on diverse relevant features.

Contribution

The paper introduces a novel data augmentation technique called Hide-and-Seek that enhances model performance across multiple visual recognition tasks, especially weakly-supervised localization.

Findings

01

Improves object localization accuracy in weakly-supervised settings

02

Enhances performance in various recognition tasks like classification and segmentation

03

Works with any network without requiring changes during testing

Abstract

We propose 'Hide-and-Seek' a general purpose data augmentation technique, which is complementary to existing data augmentation techniques and is beneficial for various visual recognition tasks. The key idea is to hide patches in a training image randomly, in order to force the network to seek other relevant content when the most discriminative content is hidden. Our approach only needs to modify the input image and can work with any network to improve its performance. During testing, it does not need to hide any patches. The main advantage of Hide-and-Seek over existing data augmentation techniques is its ability to improve object localization accuracy in the weakly-supervised setting, and we therefore use this task to motivate the approach. However, Hide-and-Seek is not tied only to the image localization task, and can generalize to other forms of visual input like videos, as well as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Domain Adaptation and Few-Shot Learning