Scale-Invariant Object Detection by Adaptive Convolution with Unified Global-Local Context
Amrita Singh, and Snehasis Mukherjee

TL;DR
This paper introduces SAC-Net, a novel adaptive convolutional network that dynamically adjusts atrous rates and incorporates global context, significantly improving multi-scale object detection, especially for small objects, over existing models.
Contribution
The paper proposes a switchable, adaptive atrous convolution mechanism within a unified framework, enhancing dense feature retention and scale-invariance in object detection.
Findings
Outperforms state-of-the-art models on benchmark datasets
Improves detection accuracy for small and multi-scale objects
Demonstrates robustness with dynamic atrous rate adjustment
Abstract
Dense features are important for detecting minute objects in images. Unfortunately, despite the remarkable efficacy of the CNN models in multi-scale object detection, CNN models often fail to detect smaller objects in images due to the loss of dense features during the pooling process. Atrous convolution addresses this issue by applying sparse kernels. However, sparse kernels often can lose the multi-scale detection efficacy of the CNN model. In this paper, we propose an object detection model using a Switchable (adaptive) Atrous Convolutional Network (SAC-Net) based on the efficientDet model. A fixed atrous rate limits the performance of the CNN models in the convolutional layers. To overcome this limitation, we introduce a switchable mechanism that allows for dynamically adjusting the atrous rate during the forward pass. The proposed SAC-Net encapsulates the benefits of both low-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Depthwise Convolution · Batch Normalization · Pointwise Convolution · Depthwise Separable Convolution · BiFPN · Convolution · EfficientDet
