Computing a Characteristic Orientation for Rotation-Independent Image Analysis
Cristian Valero-Abundio, Emilio Sansano-Sansano, Ra\'ul Montoliu, Marina Mart\'inez Garc\'ia

TL;DR
This paper presents GID, a preprocessing technique that estimates and aligns images to a canonical orientation, enhancing rotation robustness in deep learning models without altering their architecture.
Contribution
Introduction of GID, a novel preprocessing method that improves rotation invariance by aligning images to a canonical frame, compatible with standard neural networks.
Findings
GID outperforms state-of-the-art rotation-invariant models on rotated MNIST.
The method maintains effectiveness on complex datasets like CIFAR-10.
GID does not require architectural modifications, reducing computational overhead.
Abstract
Handling geometric transformations, particularly rotations, remains a challenge in deep learning for computer vision. Standard neural networks lack inherent rotation invariance and typically rely on data augmentation or architectural modifications to improve robustness. Although effective, these approaches increase computational demands, require specialised implementations, or alter network structures, limiting their applicability. This paper introduces General Intensity Direction (GID), a preprocessing method that improves rotation robustness without modifying the network architecture. The method estimates a global orientation for each image and aligns it to a canonical reference frame, allowing standard models to process inputs more consistently across different rotations. Unlike moment-based approaches that extract invariant descriptors, this method directly transforms the image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image and Object Detection Techniques · Image Retrieval and Classification Techniques
