Dilated Convolution with Learnable Spacings: beyond bilinear interpolation
Ismail Khalfaoui-Hassani, Thomas Pellegrini, Timoth\'ee Masquelier

TL;DR
This paper enhances dilated convolution with learnable spacings by exploring advanced interpolation methods like Gaussian, improving image classification performance without increasing model complexity.
Contribution
It introduces longer-range interpolation techniques, including Gaussian interpolation, into DCLS, surpassing bilinear interpolation for better accuracy on ImageNet classification.
Findings
Gaussian interpolation improves accuracy over bilinear.
Longer-range interpolations enhance performance.
No increase in model parameters needed.
Abstract
Dilated Convolution with Learnable Spacings (DCLS) is a recently proposed variation of the dilated convolution in which the spacings between the non-zero elements in the kernel, or equivalently their positions, are learnable. Non-integer positions are handled via interpolation. Thanks to this trick, positions have well-defined gradients. The original DCLS used bilinear interpolation, and thus only considered the four nearest pixels. Yet here we show that longer range interpolations, and in particular a Gaussian interpolation, allow improving performance on ImageNet1k classification on two state-of-the-art convolutional architectures (ConvNeXt and Conv\-Former), without increasing the number of parameters. The method code is based on PyTorch and is available at https://github.com/K-H-Ismail/Dilated-Convolution-with-Learnable-Spacings-PyTorch
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsDilated Convolution · Convolution · Dilated convolution with learnable spacings
