Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection
Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Yong Jae Lee,, Alexander G. Schwing, Jan Kautz

TL;DR
This paper introduces a novel framework for weakly supervised object detection that enhances instance differentiation, focuses on entire objects, and reduces memory usage, achieving state-of-the-art results on multiple benchmarks.
Contribution
It proposes an instance-aware, context-focused, and memory-efficient framework with self-training and a learnable DropBlock, advancing weakly supervised object detection methods.
Findings
Achieved state-of-the-art AP on COCO, VOC 2007, and VOC 2012 datasets.
First to benchmark ResNet models in weakly supervised detection.
Introduced a memory-efficient sequential back-propagation technique.
Abstract
Weakly supervised learning has emerged as a compelling tool for object detection by reducing the need for strong supervision during training. However, major challenges remain: (1) differentiation of object instances can be ambiguous; (2) detectors tend to focus on discriminative parts rather than entire objects; (3) without ground truth, object proposals have to be redundant for high recalls, causing significant memory consumption. Addressing these challenges is difficult, as it often requires to eliminate uncertainties and trivial solutions. To target these issues we develop an instance-aware and context-focused unified framework. It employs an instance-aware self-training algorithm and a learnable Concrete DropBlock while devising a memory-efficient sequential batch back-propagation. Our proposed method achieves state-of-the-art results on COCO (, ), VOC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · DropBlock
