Global Context Aware RCNN for Object Detection
Wenchao Zhang, Chong Fu, Haoyu Xie, Mai Zhu, Ming Tie, Junxin Chen

TL;DR
This paper introduces GCA RCNN, a novel object detection framework that enhances global context understanding by integrating attention mechanisms and dense connections, leading to improved detection accuracy.
Contribution
The paper proposes a global context aware mechanism with attention strategies and dense connections to improve feature extraction and refinement in object detection.
Findings
Significant accuracy improvements on COCO dataset
Effective global context integration enhances detection performance
Lightweight version maintains performance with minimal complexity increase
Abstract
RoIPool/RoIAlign is an indispensable process for the typical two-stage object detection algorithm, it is used to rescale the object proposal cropped from the feature pyramid to generate a fixed size feature map. However, these cropped feature maps of local receptive fields will heavily lose global context information. To tackle this problem, we propose a novel end-to-end trainable framework, called Global Context Aware (GCA) RCNN, aiming at assisting the neural network in strengthening the spatial correlation between the background and the foreground by fusing global context information. The core component of our GCA framework is a context aware mechanism, in which both global feature pyramid and attention strategies are used for feature extraction and feature refinement, respectively. Specifically, we leverage the dense connection to improve the information flow of the global context…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
MethodsGraph Contrastive learning with Adaptive augmentation · Attentive Walk-Aggregating Graph Neural Network · Convolution · 1x1 Convolution · Feature Pyramid Network
