Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object   Detection

Jiacheng Deng; Jiahao Lu; Tianzhu Zhang

arXiv:2408.00286·cs.CV·August 2, 2024

Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection

Jiacheng Deng, Jiahao Lu, Tianzhu Zhang

PDF

Open Access

TL;DR

Diff3DETR introduces an agent-based diffusion model that enhances semi-supervised 3D object detection by generating adaptive object queries and refining bounding boxes, leading to superior performance on benchmark datasets.

Contribution

The paper presents a novel agent-based diffusion approach with a box-aware denoising module for improved semi-supervised 3D detection.

Findings

01

Outperforms state-of-the-art methods on ScanNet and SUN RGB-D datasets.

02

Effectively generates diverse and high-quality pseudo-labels.

03

Refines bounding boxes incrementally using a diffusion process.

Abstract

3D object detection is essential for understanding 3D scenes. Contemporary techniques often require extensive annotated training data, yet obtaining point-wise annotations for point clouds is time-consuming and laborious. Recent developments in semi-supervised methods seek to mitigate this problem by employing a teacher-student framework to generate pseudo-labels for unlabeled point clouds. However, these pseudo-labels frequently suffer from insufficient diversity and inferior quality. To overcome these hurdles, we introduce an Agent-based Diffusion Model for Semi-supervised 3D Object Detection (Diff3DETR). Specifically, an agent-based object query generator is designed to produce object queries that effectively adapt to dynamic scenes while striking a balance between sampling locations and content embedding. Additionally, a box-aware denoising module utilizes the DDIM denoising process…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection

MethodsSoftmax · Attention Is All You Need · Diffusion