DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model
Yu Feng, Xing Shi, Mengli Cheng, Yun Xiong

TL;DR
DiffPoint introduces a novel architecture combining ViT and diffusion models to enhance high-quality 3D point cloud reconstruction from images, achieving state-of-the-art results in both single-view and multi-view scenarios.
Contribution
The paper presents a new DiffPoint architecture that effectively integrates ViT and diffusion models for improved point cloud reconstruction from images.
Findings
Achieves state-of-the-art results in single-view reconstruction
Demonstrates effective multi-view reconstruction with feature fusion
Validates the feasibility of unified architectures for 3D reconstruction
Abstract
As the task of 2D-to-3D reconstruction has gained significant attention in various real-world scenarios, it becomes crucial to be able to generate high-quality point clouds. Despite the recent success of deep learning models in generating point clouds, there are still challenges in producing high-fidelity results due to the disparities between images and point clouds. While vision transformers (ViT) and diffusion models have shown promise in various vision tasks, their benefits for reconstructing point clouds from images have not been demonstrated yet. In this paper, we first propose a neat and powerful architecture called DiffPoint that combines ViT and diffusion models for the task of point cloud reconstruction. At each diffusion step, we divide the noisy point clouds into irregular patches. Then, using a standard ViT backbone that treats all inputs as tokens (including time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications
MethodsDiffusion
