SpotNet: An Image Centric, Lidar Anchored Approach To Long Range   Perception

Louis Foucard; Samar Khanna; Yi Shi; Chi-Kuei Liu; Quinn Z Shen,; Thuyen Ngo; Zi-Xiang Xia

arXiv:2405.15843·cs.CV·May 28, 2024

SpotNet: An Image Centric, Lidar Anchored Approach To Long Range Perception

Louis Foucard, Samar Khanna, Yi Shi, Chi-Kuei Liu, Quinn Z Shen,, Thuyen Ngo, Zi-Xiang Xia

PDF

Open Access

TL;DR

SpotNet introduces a fast, single-stage, image-centric approach for long-range 3D object detection that efficiently fuses LiDAR and camera data, scaling independently of range and enabling transfer across image resolutions.

Contribution

The paper presents a novel LiDAR-anchored, image-centric detection architecture that scales with range as O(1) and transfers across image resolutions without retraining.

Findings

01

Achieves accurate 3D detection with sparse LiDAR data.

02

Scales independently of range, unlike BEV methods.

03

Transfers across image resolutions without retraining.

Abstract

In this paper, we propose SpotNet: a fast, single stage, image-centric but LiDAR anchored approach for long range 3D object detection. We demonstrate that our approach to LiDAR/image sensor fusion, combined with the joint learning of 2D and 3D detection tasks, can lead to accurate 3D object detection with very sparse LiDAR support. Unlike more recent bird's-eye-view (BEV) sensor-fusion methods which scale with range $r$ as $O (r^{2})$ , SpotNet scales as $O (1)$ with range. We argue that such an architecture is ideally suited to leverage each sensor's strength, i.e. semantic understanding from images and accurate range finding from LiDAR data. Finally we show that anchoring detections on LiDAR points removes the need to regress distances, and so the architecture is able to transfer from 2MP to 8MP resolution images without re-training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Remote Sensing and LiDAR Applications · Industrial Vision Systems and Defect Detection