Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention

Yiwen Chen; Zhihao Li; Yikai Wang; Hu Zhang; Qin Li; Chi Zhang; Guosheng Lin

arXiv:2507.17745·cs.CV·August 1, 2025

Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention

Yiwen Chen, Zhihao Li, Yikai Wang, Hu Zhang, Qin Li, Chi Zhang, Guosheng Lin

PDF

Open Access

TL;DR

Ultra3D introduces a fast, high-fidelity 3D generation framework that uses part-aware attention and efficient representations to significantly reduce computation while maintaining quality.

Contribution

It proposes Part Attention and VecSet for efficient, high-resolution 3D generation, reducing complexity and accelerating voxel modeling without quality loss.

Findings

01

Achieves up to 6.7x speed-up in latent generation.

02

Supports high-resolution 1024 3D generation.

03

Outperforms previous methods in visual fidelity and user preference.

Abstract

Recent advances in sparse voxel representations have significantly improved the quality of 3D content generation, enabling high-resolution modeling with fine-grained geometry. However, existing frameworks suffer from severe computational inefficiencies due to the quadratic complexity of attention mechanisms in their two-stage diffusion pipelines. In this work, we propose Ultra3D, an efficient 3D generation framework that significantly accelerates sparse voxel modeling without compromising quality. Our method leverages the compact VecSet representation to efficiently generate a coarse object layout in the first stage, reducing token count and accelerating voxel coordinate prediction. To refine per-voxel latent features in the second stage, we introduce Part Attention, a geometry-aware localized attention mechanism that restricts attention computation within semantically consistent part…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · Augmented Reality Applications