Distill, Diffuse, and Semanticize (DDS): Annotation-Free 3D Scene Understanding Based on Multi-Granularity Distillation and Graph-Diffusion-Based Segmentation
Yijing Wang, Ruonan Li, Qilin Wang, Rongqiang Zhao, Jie Liu

TL;DR
DDS is a resource-efficient framework for annotation-free 3D scene understanding that combines multi-granularity distillation and graph diffusion to produce coherent, semanticized region representations without dense annotations.
Contribution
It introduces a novel structure-oriented approach that integrates semantic cues and graph diffusion, achieving state-of-the-art performance in annotation-free 3D scene understanding.
Findings
DDS outperforms existing structure-oriented baselines in accuracy metrics.
It improves region consistency and semantic recognition efficiency.
The method is scalable and interpretable for real-world 3D scene analysis.
Abstract
3D semantic scene understanding is essential for digital twins, autonomous driving, smart agriculture, and embodied perception, yet dense point-wise annotation for point clouds remains expensive and difficult to scale. Existing annotation-free methods often face a trade-off between semantic recognition and structural efficiency: open-vocabulary and foundation-model-driven methods provide strong semantic priors, but often come with substantial computational costs, while structure-oriented methods based on superpoints, clustering, and graph reasoning are lightweight but often produce category-agnostic regions. We propose DDS, a resource-efficient structure-oriented framework for region-consistent and semanticized annotation-free 3D scene understanding. DDS preserves the lightweight superpoint-based organization paradigm while incorporating visual semantic cues from projected features and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
