BABO: Background Activation Black-Out for Efficient Object Detection
Byungseok Roh, Han-Cheol Cho, Myung-Ho Ju, Soon Hyung Pyo

TL;DR
This paper introduces BABO, a method that sparsifies background activations in object detection models to improve inference efficiency on resource-limited devices, maintaining accuracy while reducing computation.
Contribution
BABO incorporates a lightweight objectness mask network to zero out background regions, enabling faster inference through sparse convolution techniques without sacrificing detection accuracy.
Findings
Background activation zeros increased from 36% to 68% on MobileNetV2-SSDLite.
Total MAC operations reduced to 62% of original with background sparsification.
Effective across different networks and datasets, including VGG, RetinaNet, MS-COCO, and PASCAL VOC.
Abstract
Recent advances in deep learning have enabled complex real-world use cases comprised of multiple vision tasks and detection tasks are being shifted to the edge side as a pre-processing step of the entire workload. Since running a deep model on resource-constraint devices is challenging, techniques for efficient inference methods are demanded. In this paper, we present an objectness-aware object detection method to reduce computational cost by sparsifying activation values on background regions where target objects don't exist. Sparsified activation can be exploited to increase inference speed by software or hardware accelerated sparse convolution techniques. To accomplish this goal, we incorporate a light-weight objectness mask generation (OMG) network in front of an object detection (OD) network so that it can zero out unnecessary background areas of an input image before being fed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Industrial Vision Systems and Defect Detection
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution
