TL;DR
PSConv introduces a novel convolutional layer that integrates multi-scale features through cyclically varying dilation rates within each filter, enhancing CNN robustness to scale variations without extra parameters.
Contribution
This paper proposes PSConv, a new convolution operation that captures multi-scale features at a finer granularity by cyclically varying dilation rates within filters, serving as a drop-in replacement for standard convolutions.
Findings
PSConv improves accuracy on ImageNet and MS COCO benchmarks.
PSConv achieves better scale robustness without additional parameters.
PSConv can be integrated into existing CNN architectures easily.
Abstract
Despite their strong modeling capacities, Convolutional Neural Networks (CNNs) are often scale-sensitive. For enhancing the robustness of CNNs to scale variance, multi-scale feature fusion from different layers or filters attracts great attention among existing solutions, while the more granular kernel space is overlooked. We bridge this regret by exploiting multi-scale features in a finer granularity. The proposed convolution operation, named Poly-Scale Convolution (PSConv), mixes up a spectrum of dilation rates and tactfully allocate them in the individual convolutional kernels of each filter regarding a single convolutional layer. Specifically, dilation rates vary cyclically along the axes of input and output channels of the filters, aggregating features over a wide range of scales in a neat style. PSConv could be a drop-in replacement of the vanilla convolution in many prevailing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
