Beyond Skip Connections: Top-Down Modulation for Object Detection
Abhinav Shrivastava, Rahul Sukthankar, Jitendra Malik, Abhinav Gupta

TL;DR
This paper introduces a top-down modulation approach inspired by human vision to enhance object detection by better integrating fine details from lower convolutional layers, leading to improved accuracy on COCO benchmarks.
Contribution
The paper proposes a novel top-down modulation network with lateral connections that effectively incorporate low-level details into object detection models, surpassing traditional skip connection methods.
Findings
Significant AP improvements on COCO testdev benchmark
Effective integration of low-level details enhances detection of hard categories
Outperforms baseline models without additional complex techniques
Abstract
In recent years, we have seen tremendous progress in the field of object detection. Most of the recent improvements have been achieved by targeting deeper feedforward networks. However, many hard object categories such as bottle, remote, etc. require representation of fine details and not just coarse, semantic representations. But most of these fine details are lost in the early convolutional layers. What we need is a way to incorporate finer details from lower layers into the detection architecture. Skip connections have been proposed to combine high-level and low-level features, but we argue that selecting the right features from low-level requires top-down contextual information. Inspired by the human visual pathway, in this paper we propose top-down modulations as a way to incorporate fine details into the detection framework. Our approach supplements the standard bottom-up,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Infrared Target Detection Methodologies · CCD and CMOS Imaging Sensors
