A Machine Learning Approach to Recovery of Scene Geometry from Images
Hoang Trinh

TL;DR
This paper presents an unsupervised machine learning method for recovering 3D scene geometry from images, using conditional random fields and energy minimization, achieving state-of-the-art results without labeled data.
Contribution
Introduces an unsupervised CRF learning framework for 3D scene reconstruction from images, extending to structure and motion estimation with novel evaluation metrics.
Findings
Unsupervised training outperforms related work on stereo datasets.
Effective 3D shape recovery from single images, stereo pairs, and videos.
Successful estimation of surface velocity in dynamic scenes.
Abstract
Recovering the 3D structure of the scene from images yields useful information for tasks such as shape and scene recognition, object detection, or motion planning and object grasping in robotics. In this thesis, we introduce a general machine learning approach called unsupervised CRF learning based on maximizing the conditional likelihood. We apply our approach to computer vision systems that recover the 3-D scene geometry from images. We focus on recovering 3D geometry from single images, stereo pairs and video sequences. Building these systems requires algorithms for doing inference as well as learning the parameters of conditional Markov random fields (MRF). Our system is trained unsupervisedly without using ground-truth labeled data. We employ a slanted-plane stereo vision model in which we use a fixed over-segmentation to segment the left image into coherent regions called…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Computer Graphics and Visualization Techniques
