DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision

Lu Ling; Yichen Sheng; Zhi Tu; Wentian Zhao; Cheng Xin; Kun Wan,; Lantao Yu; Qianyu Guo; Zixun Yu; Yawen Lu; Xuanmao Li; Xingpeng Sun; Rohan; Ashok; Aniruddha Mukherjee; Hao Kang; Xiangrui Kong; Gang Hua; Tianyi Zhang,; Bedrich Benes; Aniket Bera

arXiv:2312.16256·cs.CV·January 2, 2024·1 cites

DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision

Lu Ling, Yichen Sheng, Zhi Tu, Wentian Zhao, Cheng Xin, Kun Wan,, Lantao Yu, Qianyu Guo, Zixun Yu, Yawen Lu, Xuanmao Li, Xingpeng Sun, Rohan, Ashok, Aniruddha Mukherjee, Hao Kang, Xiangrui Kong, Gang Hua, Tianyi Zhang,, Bedrich Benes, Aniket Bera

PDF

Open Access 1 Repo 1 Datasets

TL;DR

DL3DV-10K is a comprehensive large-scale dataset for deep learning-based 3D vision, enabling better benchmarking and advancing research in novel view synthesis and 3D representation learning.

Contribution

The paper introduces DL3DV-10K, a large-scale, diverse scene dataset, and provides benchmark results and initial models demonstrating its utility for 3D vision research.

Findings

01

Benchmarking reveals strengths and weaknesses of recent NVS methods.

02

Large-scale data improves generalization in NeRF learning.

03

Dataset and models are publicly available for research use.

Abstract

We have witnessed significant progress in deep learning-based 3D vision, ranging from neural radiance field (NeRF) based 3D representation learning to applications in novel view synthesis (NVS). However, existing scene-level datasets for deep learning-based 3D vision, limited to either synthetic environments or a narrow selection of real-world scenes, are quite insufficient. This insufficiency not only hinders a comprehensive benchmark of existing methods but also caps what could be explored in deep learning-based 3D analysis. To address this critical gap, we present DL3DV-10K, a large-scale scene dataset, featuring 51.2 million frames from 10,510 videos captured from 65 types of point-of-interest (POI) locations, covering both bounded and unbounded scenes, with different levels of reflection, transparency, and lighting. We conducted a comprehensive benchmark of recent NVS methods on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DL3DV-10K/Dataset
none

Datasets

Passwerob/CoherentGS
dataset· 31 dl
31 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques