Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection
Jiacheng Deng, Jiahao Lu, Tianzhu Zhang

TL;DR
Diff3DETR introduces an agent-based diffusion model that enhances semi-supervised 3D object detection by generating adaptive object queries and refining bounding boxes, leading to superior performance on benchmark datasets.
Contribution
The paper presents a novel agent-based diffusion approach with a box-aware denoising module for improved semi-supervised 3D detection.
Findings
Outperforms state-of-the-art methods on ScanNet and SUN RGB-D datasets.
Effectively generates diverse and high-quality pseudo-labels.
Refines bounding boxes incrementally using a diffusion process.
Abstract
3D object detection is essential for understanding 3D scenes. Contemporary techniques often require extensive annotated training data, yet obtaining point-wise annotations for point clouds is time-consuming and laborious. Recent developments in semi-supervised methods seek to mitigate this problem by employing a teacher-student framework to generate pseudo-labels for unlabeled point clouds. However, these pseudo-labels frequently suffer from insufficient diversity and inferior quality. To overcome these hurdles, we introduce an Agent-based Diffusion Model for Semi-supervised 3D Object Detection (Diff3DETR). Specifically, an agent-based object query generator is designed to produce object queries that effectively adapt to dynamic scenes while striking a balance between sampling locations and content embedding. Additionally, a box-aware denoising module utilizes the DDIM denoising process…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection
MethodsSoftmax · Attention Is All You Need · Diffusion
