DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object   Detection

Se-Ho Kim; Inyong Koo; Inyoung Lee; Byeongjun Park; Changick Kim

arXiv:2310.16349·cs.CV·October 26, 2023·2 cites

DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection

Se-Ho Kim, Inyong Koo, Inyoung Lee, Byeongjun Park, Changick Kim

PDF

Open Access

TL;DR

DiffRef3D introduces a diffusion-based framework for refining 3D object proposals in point cloud data, significantly enhancing detection accuracy by iteratively denoising residuals during training and inference.

Contribution

It is the first to apply diffusion processes to 3D object proposal refinement, offering a versatile method that improves existing detection models.

Findings

01

Consistently improves 3D detection performance on KITTI benchmark

02

Effective in refining proposals through iterative denoising

03

Versatile framework applicable to various 3D detectors

Abstract

Denoising diffusion models show remarkable performances in generative tasks, and their potential applications in perception tasks are gaining interest. In this paper, we introduce a novel framework named DiffRef3D which adopts the diffusion process on 3D object detection with point clouds for the first time. Specifically, we formulate the proposal refinement stage of two-stage 3D object detectors as a conditional diffusion process. During training, DiffRef3D gradually adds noise to the residuals between proposals and target objects, then applies the noisy residuals to proposals to generate hypotheses. The refinement module utilizes these hypotheses to denoise the noisy residuals and generate accurate box predictions. In the inference phase, DiffRef3D generates initial hypotheses by sampling noise from a Gaussian distribution as residuals and refines the hypotheses through iterative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsDiffusion