TL;DR
This paper introduces SceneEDNet, a fully convolutional neural network designed for direct scene flow estimation from RGB-D videos, leveraging deep learning to improve accuracy and efficiency in robotics applications.
Contribution
It presents the first deep learning approach with an encoder-decoder architecture for direct scene flow estimation from stereo image sequences.
Findings
Effective estimation of 3D motion vectors from stereo images.
Application on large datasets yields meaningful scene flow results.
Advances in deep learning improve scene flow estimation accuracy.
Abstract
Estimating scene flow in RGB-D videos is attracting much interest of the computer vision researchers, due to its potential applications in robotics. The state-of-the-art techniques for scene flow estimation, typically rely on the knowledge of scene structure of the frame and the correspondence between frames. However, with the increasing amount of RGB-D data captured from sophisticated sensors like Microsoft Kinect, and the recent advances in the area of sophisticated deep learning techniques, introduction of an efficient deep learning technique for scene flow estimation, is becoming important. This paper introduces a first effort to apply a deep learning method for direct estimation of scene flow by presenting a fully convolutional neural network with an encoder-decoder (ED) architecture. The proposed network SceneEDNet involves estimation of three dimensional motion vectors of all the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
