Relation3DMOT: Exploiting Deep Affinity for 3D Multi-Object Tracking from View Aggregation
Can Chen, Luca Zanotti Fragonara, Antonios Tsourdos

TL;DR
This paper introduces Relation3DMOT, a novel 3D multi-object tracking method that fuses 2D and 3D features and employs a new convolutional operation to improve data association, achieving state-of-the-art results.
Contribution
The paper proposes a joint feature extractor and a new RelationConv operation to better exploit object correlations for improved 3D MOT.
Findings
Achieves state-of-the-art performance on KITTI benchmark.
Effectively fuses 2D and 3D appearance features.
Improves data association accuracy with RelationConv.
Abstract
Autonomous systems need to localize and track surrounding objects in 3D space for safe motion planning. As a result, 3D multi-object tracking (MOT) plays a vital role in autonomous navigation. Most MOT methods use a tracking-by-detection pipeline, which includes object detection and data association processing. However, many approaches detect objects in 2D RGB sequences for tracking, which is lack of reliability when localizing objects in 3D space. Furthermore, it is still challenging to learn discriminative features for temporally-consistent detection in different frames, and the affinity matrix is normally learned from independent object features without considering the feature interaction between detected objects in the different frames. To settle these problems, We firstly employ a joint feature extractor to fuse the 2D and 3D appearance features captured from both 2D RGB images and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Autonomous Vehicle Technology and Safety
