Playing for Depth
Mohammad Mahdi Haji-Esmaeili, Gholamali Montazer

TL;DR
This paper introduces a new synthetic depth dataset from video games that improves monocular depth estimation accuracy in real-world scenarios, addressing data scarcity and generalization challenges.
Contribution
The authors present a novel, easily reproducible method for capturing high-quality depth data from video games to enhance depth estimation models.
Findings
Synthetic video game data improves depth estimation accuracy.
Models trained on this data generalize better to real-world scenes.
The dataset enables training without stereo camera constraints.
Abstract
Estimating the relative depth of a scene is a significant step towards understanding the general structure of the depicted scenery, the relations of entities in the scene and their interactions. When faced with the task of estimating depth without the use of Stereo images, we are dependent on the availability of large-scale depth datasets and high-capacity models to capture the intrinsic nature of depth. Unfortunately, creating datasets of depth images is not a trivial task as the requirements for the camera mainly limits us to areas where we can provide the necessities for the camera to work. In this work, we present a new depth dataset captured from Video Games in an easy and reproducible way. The nature of open-world video games gives us the ability to capture high-quality depth maps in the wild without the constrictions of Stereo cameras. Experiments on this dataset shows that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Video Analysis and Summarization
