TL;DR
Points-to-3D introduces a diffusion-based framework that effectively uses point cloud priors to improve geometry control and quality in 3D asset and scene generation, outperforming existing methods.
Contribution
The paper presents a novel diffusion model that incorporates point cloud priors for structure-aware 3D generation, enabling better geometric fidelity and control.
Findings
Outperforms state-of-the-art methods in rendering quality and geometric fidelity.
Effectively utilizes point cloud priors from sensors or estimators like VGGT.
Demonstrates superior results on both object and scene scenarios.
Abstract
Recent progress in 3D generation has been driven largely by models conditioned on images or text, while readily available 3D priors are still underused. In many real-world scenarios, the visible-region point cloud are easy to obtain from active sensors such as LiDAR or from feed-forward predictors like VGGT, offering explicit geometric constraints that current methods fail to exploit. In this work, we introduce Points-to-3D, a diffusion-based framework that leverages point cloud priors for geometry-controllable 3D asset and scene generation. Built on a latent 3D diffusion model TRELLIS, Points-to-3D first replaces pure-noise sparse structure latent initialization with a point cloud priors tailored input formulation.A structure inpainting network, trained within the TRELLIS framework on task-specific data designed to learn global structural inpainting, is then used for inference with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
