ShaSTA-Fuse: Camera-LiDAR Sensor Fusion to Model Shape and Spatio-Temporal Affinities for 3D Multi-Object Tracking
Tara Sadjadpour, Rares Ambrus, Jeannette Bohg

TL;DR
ShaSTA-Fuse introduces a novel camera-LiDAR fusion method for 3D multi-object tracking, enhancing affinity estimation and track confidence refinement, especially for distant objects, achieving state-of-the-art results on nuScenes.
Contribution
It presents a new multimodal fusion approach combining camera and LiDAR data for improved 3D MOT, including a unique track confidence refinement technique.
Findings
Achieves state-of-the-art performance on nuScenes benchmark.
Enhances tracking of small, distant objects.
Demonstrates benefits of multimodal fusion through ablative analysis.
Abstract
3D multi-object tracking (MOT) is essential for an autonomous mobile agent to safely navigate a scene. In order to maximize the perception capabilities of the autonomous agent, we aim to develop a 3D MOT framework that fuses camera and LiDAR sensor information. Building on our prior LiDAR-only work, ShaSTA, which models shape and spatio-temporal affinities for 3D MOT, we propose a novel camera-LiDAR fusion approach for learning affinities. At its core, this work proposes a fusion technique that generates a rich sensory signal incorporating information about depth and distant objects to enhance affinity estimation for improved data association, track lifecycle management, false-positive elimination, false-negative propagation, and track confidence score refinement. Our main contributions include a novel fusion approach for combining camera and LiDAR sensory signals to learn affinities,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Air Quality Monitoring and Forecasting · Advanced Optical Sensing Technologies
