SGCCNet: Single-Stage 3D Object Detector With Saliency-Guided Data   Augmentation and Confidence Correction Mechanism

Ao Liang; Wenyu Chen; Jian Fang; Huaici Zhao

arXiv:2407.01239·cs.CV·July 2, 2024

SGCCNet: Single-Stage 3D Object Detector With Saliency-Guided Data Augmentation and Confidence Correction Mechanism

Ao Liang, Wenyu Chen, Jian Fang, Huaici Zhao

PDF

Open Access

TL;DR

SGCCNet introduces saliency-guided data augmentation and confidence correction to improve single-stage 3D object detection accuracy and robustness, especially for low-quality objects, achieving state-of-the-art results on KITTI.

Contribution

The paper proposes SGCCNet with novel saliency-guided augmentation and confidence correction mechanisms, enhancing detection performance and addressing key challenges in point-based 3D object detection.

Findings

01

Achieves 80.82% AP3D on KITTI Moderate level, outperforming previous methods.

02

Effectively improves detection robustness for low-quality objects.

03

Demonstrates portability to other point-based detectors.

Abstract

The single-stage point-based 3D object detectors have attracted widespread research interest due to their advantages of lightweight and fast inference speed. However, they still face challenges such as inadequate learning of low-quality objects (ILQ) and misalignment between localization accuracy and classification confidence (MLC). In this paper, we propose SGCCNet to alleviate these two issues. For ILQ, SGCCNet adopts a Saliency-Guided Data Augmentation (SGDA) strategy to enhance the robustness of the model on low-quality objects by reducing its reliance on salient features. Specifically, We construct a classification task and then approximate the saliency scores of points by moving points towards the point cloud centroid in a differentiable process. During the training process, SGCCNet will be forced to learn from low saliency features through dropping points. Meanwhile, to avoid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques