Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D   Visual Grounding

Yuhang Liu; Boyi Sun; Guixu Zheng; Yishuo Wang; Jing Wang; Fei-Yue; Wang

arXiv:2405.15274·cs.CV·May 27, 2024

Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding

Yuhang Liu, Boyi Sun, Guixu Zheng, Yishuo Wang, Jing Wang, Fei-Yue, Wang

PDF

Open Access

TL;DR

This paper introduces a novel human-LiDAR interaction method using 3D visual grounding, proposing new datasets and models to enhance cognitive capabilities of LiDAR systems in autonomous driving.

Contribution

It presents the Talk2LiDAR benchmark dataset and two grounding methods, advancing the integration of cognitive intelligence into parallel LiDAR systems.

Findings

01

BEVGrounding significantly improves grounding accuracy

02

Experiments demonstrate superior performance over existing methods

03

The approach enables more natural human-LiDAR interactions

Abstract

LiDAR sensors play a crucial role in various applications, especially in autonomous driving. Current research primarily focuses on optimizing perceptual models with point cloud data as input, while the exploration of deeper cognitive intelligence remains relatively limited. To address this challenge, parallel LiDARs have emerged as a novel theoretical framework for the next-generation intelligent LiDAR systems, which tightly integrate physical, digital, and social systems. To endow LiDAR systems with cognitive capabilities, we introduce the 3D visual grounding task into parallel LiDARs and present a novel human-computer interaction paradigm for LiDAR systems. We propose Talk2LiDAR, a large-scale benchmark dataset tailored for 3D visual grounding in autonomous driving. Additionally, we present a two-stage baseline approach and an efficient one-stage method named BEVGrounding, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Virtual Reality Applications and Impacts