Optical Flow Estimation in 360$^\circ$ Videos: Dataset, Model and Application
Bin Duan, Keshav Bhandari, Gaowen Liu, Yan Yan

TL;DR
This paper introduces a new benchmark dataset FLOW360 for 360-degree videos, proposes a Siamese representation learning framework SLOF for omnidirectional optical flow estimation, and demonstrates its effectiveness in reducing prediction errors and improving activity recognition accuracy.
Contribution
The paper presents the first perceptually realistic 360-degree video dataset and a novel contrastive learning framework tailored for omnidirectional optical flow estimation.
Findings
FLOW360 dataset exhibits high perceptual realism and diversity.
SLOF significantly reduces optical flow prediction errors in 360-degree videos.
The approach boosts egocentric activity recognition accuracy by approximately 26%.
Abstract
Optical flow estimation has been a long-lasting and fundamental problem in the computer vision community. However, despite the advances of optical flow estimation in perspective videos, the 360 videos counterpart remains in its infancy, primarily due to the shortage of benchmark datasets and the failure to accommodate the omnidirectional nature of 360 videos. We propose the first perceptually realistic 360 filed-of-view video benchmark dataset, namely FLOW360, with 40 different videos and 4,000 video frames. We then conduct comprehensive characteristic analysis and extensive comparisons with existing datasets, manifesting FLOW360's perceptual realism, uniqueness, and diversity. Moreover, we present a novel Siamese representation Learning framework for Omnidirectional Flow (SLOF) estimation, which is trained in a contrastive manner via a hybrid loss that combines…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Retinal Imaging and Analysis · Advanced Image Processing Techniques
