ShaSTA: Modeling Shape and Spatio-Temporal Affinities for 3D Multi-Object Tracking
Tara Sadjadpour, Jie Li, Rares Ambrus, and Jeannette Bohg

TL;DR
ShaSTA introduces a unified 3D multi-object tracking framework that models shape and spatio-temporal affinities, improving robustness and accuracy in challenging scenarios like occlusions and false detections.
Contribution
It is the first comprehensive framework to jointly address data association, false-positive elimination, false-negative propagation, and track confidence refinement in 3D MOT.
Findings
Achieves 1st place on nuScenes LiDAR-only benchmark
Reduces false-positive and false-negative tracks
Increases true-positive track count
Abstract
Multi-object tracking is a cornerstone capability of any robotic system. The quality of tracking is largely dependent on the quality of the detector used. In many applications, such as autonomous vehicles, it is preferable to over-detect objects to avoid catastrophic outcomes due to missed detections. As a result, current state-of-the-art 3D detectors produce high rates of false-positives to ensure a low number of false-negatives. This can negatively affect tracking by making data association and track lifecycle management more challenging. Additionally, occasional false-negative detections due to difficult scenarios like occlusions can harm tracking performance. To address these issues in a unified framework, we propose to learn shape and spatio-temporal affinities between tracks and detections in consecutive frames. Our affinity provides a probabilistic matching that leads to robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety · Impact of Light on Environment and Health
