A Machine Learning Approach to Recovery of Scene Geometry from Images

Hoang Trinh

arXiv:1007.2958·cs.CV·July 20, 2010·2 cites

A Machine Learning Approach to Recovery of Scene Geometry from Images

Hoang Trinh

PDF

Open Access

TL;DR

This paper presents an unsupervised machine learning method for recovering 3D scene geometry from images, using conditional random fields and energy minimization, achieving state-of-the-art results without labeled data.

Contribution

Introduces an unsupervised CRF learning framework for 3D scene reconstruction from images, extending to structure and motion estimation with novel evaluation metrics.

Findings

01

Unsupervised training outperforms related work on stereo datasets.

02

Effective 3D shape recovery from single images, stereo pairs, and videos.

03

Successful estimation of surface velocity in dynamic scenes.

Abstract

Recovering the 3D structure of the scene from images yields useful information for tasks such as shape and scene recognition, object detection, or motion planning and object grasping in robotics. In this thesis, we introduce a general machine learning approach called unsupervised CRF learning based on maximizing the conditional likelihood. We apply our approach to computer vision systems that recover the 3-D scene geometry from images. We focus on recovering 3D geometry from single images, stereo pairs and video sequences. Building these systems requires algorithms for doing inference as well as learning the parameters of conditional Markov random fields (MRF). Our system is trained unsupervisedly without using ground-truth labeled data. We employ a slanted-plane stereo vision model in which we use a fixed over-segmentation to segment the left image into coherent regions called…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Computer Graphics and Visualization Techniques