Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images
Bowei Du, Yecheng Huang, Jiaxin Chen, Di Huang

TL;DR
This paper introduces CEASC, a novel network that enhances sparse convolution with global context and adaptive masking, significantly improving drone image object detection speed and efficiency while maintaining accuracy.
Contribution
It proposes a global context-enhanced group normalization and adaptive multi-layer masking strategies for sparse convolutional networks in drone object detection.
Findings
Reduces GFLOPs and accelerates inference on benchmarks
Maintains competitive detection accuracy
Compatible with state-of-the-art frameworks like RetinaNet and GFL
Abstract
Object detection on drone images with low-latency is an important but challenging task on the resource-constrained unmanned aerial vehicle (UAV) platform. This paper investigates optimizing the detection head based on the sparse convolution, which proves effective in balancing the accuracy and efficiency. Nevertheless, it suffers from inadequate integration of contextual information of tiny objects as well as clumsy control of the mask ratio in the presence of foreground with varying scales. To address the issues above, we propose a novel global context-enhanced adaptive sparse convolutional network (CEASC). It first develops a context-enhanced group normalization (CE-GN) layer, by replacing the statistics based on sparsely sampled features with the global contextual ones, and then designs an adaptive multi-layer masking strategy to generate optimal mask ratios at distinct scales for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Image Enhancement Techniques
MethodsConvolution · 1x1 Convolution · Feature Pyramid Network · Group Normalization · Focal Loss · RetinaNet
