Why my photos look sideways or upside down? Detecting Canonical Orientation of Images using Convolutional Neural Networks
Kunal Swami, Pranav P. Deshpande, Gaurav Khandelwal, Ajay, Vijayvargiya

TL;DR
This paper introduces a deep learning approach using pre-trained CNNs to accurately detect the canonical orientation of images, outperforming previous methods and approaching human-level accuracy.
Contribution
It is the first to leverage large-scale pre-trained CNNs and extensive datasets for image orientation detection, significantly improving accuracy over existing low-level feature methods.
Findings
Outperforms state-of-the-art methods in orientation detection accuracy.
Generalizes well across various public datasets.
Achieves accuracy close to human performance.
Abstract
Image orientation detection requires high-level scene understanding. Humans use object recognition and contextual scene information to correctly orient images. In literature, the problem of image orientation detection is mostly confronted by using low-level vision features, while some approaches incorporate few easily detectable semantic cues to gain minor improvements. The vast amount of semantic content in images makes orientation detection challenging, and therefore there is a large semantic gap between existing methods and human behavior. Also, existing methods in literature report highly discrepant detection rates, which is mainly due to large differences in datasets and limited variety of test images used for evaluation. In this work, for the first time, we leverage the power of deep learning and adapt pre-trained convolutional neural networks using largest training dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
