A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision
Hao Ai, Zidong Cao, Lin Wang

TL;DR
This paper systematically reviews recent deep learning methods for omnidirectional vision, covering principles, representations, optimization, tasks, applications, challenges, and future directions in this rapidly evolving field.
Contribution
It provides the first comprehensive survey of DL approaches for ODI, including taxonomy, challenges, and emerging research directions, advancing understanding in this niche area.
Findings
Deep learning has significantly advanced ODI analysis.
A hierarchical taxonomy of DL methods for ODI tasks.
Identification of key challenges and open research questions.
Abstract
Omnidirectional image (ODI) data is captured with a field-of-view of 360x180, which is much wider than the pinhole cameras and captures richer surrounding environment details than the conventional perspective images. In recent years, the availability of customer-level 360 cameras has made omnidirectional vision more popular, and the advance of deep learning (DL) has significantly sparked its research and applications. This paper presents a systematic and comprehensive review and analysis of the recent progress of DL for omnidirectional vision. It delineates the distinct challenges and complexities encountered in applying DL to omnidirectional images as opposed to traditional perspective imagery. Our work covers four main contents: (i) A thorough introduction to the principles of omnidirectional imaging and commonly explored projections of ODI; (ii) A methodical review of varied…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Video Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques
