Attention-based Image Upsampling
Souvik Kundu, Hesham Mostafa, Sharath Nittur Sridhar, Sairam, Sundaresan

TL;DR
This paper introduces an attention-based upsampling method that replaces traditional transposed convolution, improving image super-resolution and joint upsampling tasks by using fewer parameters and better performance.
Contribution
It presents a novel attention-based upsampling operation that outperforms traditional methods and is particularly effective for fusing multi-modal image information.
Findings
Attention-based upsampling outperforms traditional methods.
Fewer parameters are needed for comparable or better results.
Effective in multi-modal image fusion tasks.
Abstract
Convolutional layers are an integral part of many deep neural network solutions in computer vision. Recent work shows that replacing the standard convolution operation with mechanisms based on self-attention leads to improved performance on image classification and object detection tasks. In this work, we show how attention mechanisms can be used to replace another canonical operation: strided transposed convolution. We term our novel attention-based operation attention-based upsampling since it increases/upsamples the spatial dimensions of the feature maps. Through experiments on single image super-resolution and joint-image upsampling tasks, we show that attention-based upsampling consistently outperforms traditional upsampling methods based on strided transposed convolution or based on adaptive filters while using fewer parameters. We show that the inherent flexibility of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Enhancement Techniques · Advanced Image Fusion Techniques
MethodsTransposed convolution · Convolution
