TopoDiT-3D: Topology-Aware Diffusion Transformer with Bottleneck Structure for 3D Point Cloud Generation

Zechao Guan; Feng Yan; Shuai Du; Lin Ma; Qingshan Liu

arXiv:2505.09140·cs.CV·May 15, 2025

TopoDiT-3D: Topology-Aware Diffusion Transformer with Bottleneck Structure for 3D Point Cloud Generation

Zechao Guan, Feng Yan, Shuai Du, Lin Ma, Qingshan Liu

PDF

Open Access 1 Repo

TL;DR

TopoDiT-3D introduces a topology-aware diffusion transformer with a bottleneck structure that effectively incorporates global topological information into 3D point cloud generation, enhancing quality and diversity.

Contribution

The paper presents a novel topology-aware diffusion transformer that integrates persistent homology into feature learning using a Perceiver Resampler-based bottleneck, improving 3D point cloud generation.

Findings

01

Outperforms state-of-the-art models in visual quality and diversity.

02

Enhances training efficiency through adaptive filtering of local features.

03

Highlights the importance of topological information in 3D shape generation.

Abstract

Recent advancements in Diffusion Transformer (DiT) models have significantly improved 3D point cloud generation. However, existing methods primarily focus on local feature extraction while overlooking global topological information, such as voids, which are crucial for maintaining shape consistency and capturing complex geometries. To address this limitation, we propose TopoDiT-3D, a Topology-Aware Diffusion Transformer with a bottleneck structure for 3D point cloud generation. Specifically, we design the bottleneck structure utilizing Perceiver Resampler, which not only offers a mode to integrate topological information extracted through persistent homology into feature learning, but also adaptively filters out redundant local features to improve training efficiency. Experimental results demonstrate that TopoDiT-3D outperforms state-of-the-art models in visual quality, diversity, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zechao-guan/topodit-3d
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Computer Graphics and Visualization Techniques

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Dropout · Layer Normalization · Diffusion · Focus · Byte Pair Encoding · Softmax