Accelerated Rotation-Invariant Convolution for UAV Image Segmentation
Manduhu Manduhu, Alexander Dow, Gerard Dooly, James Riordan

TL;DR
This paper presents a GPU-optimized rotation-invariant convolution method that significantly accelerates UAV image segmentation tasks while maintaining high accuracy, reducing computational costs and energy consumption compared to existing approaches.
Contribution
The paper introduces a novel, efficient rotation-invariant convolution framework that eliminates the need for traditional data-lowering steps, enabling faster and more energy-efficient UAV image segmentation.
Findings
Achieves 20-55% faster training than CUDNN
Reduces energy consumption by 15-45%
Improves segmentation accuracy by up to 6%
Abstract
Rotation invariance is essential for precise, object-level segmentation in UAV aerial imagery, where targets can have arbitrary orientations and exhibit fine-scale details. Conventional segmentation architectures like U-Net rely on convolution operators that are not rotation-invariant, leading to degraded segmentation accuracy across varying viewpoints. Rotation invariance can be achieved by expanding the filter bank across multiple orientations; however, this will significantly increase computational cost and memory traffic. In this paper, we introduce a GPU-optimized rotation-invariant convolution framework that eliminates the traditional data-lowering (im2col) step required for matrix-multiplication-based convolution. By exploiting structured data sharing among symmetrically rotated filters, our method achieves multi-orientation convolution with greatly reduced memory traffic and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · UAV Applications and Optimization · Advanced Image and Video Retrieval Techniques
