DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on   Camera-LiDAR Fusion with Deep Association

Xiyang Wang; Chunyun Fu; Zhankun Li; Ying Lai; Jiawei He

arXiv:2202.12100·cs.CV·August 29, 2022

DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep Association

Xiyang Wang, Chunyun Fu, Zhankun Li, Ying Lai, Jiawei He

PDF

1 Repo

TL;DR

DeepFusionMOT introduces a camera-LiDAR fusion approach for 3D multi-object tracking that balances high accuracy with computational efficiency by leveraging a deep association mechanism for effective data integration.

Contribution

It presents a novel fusion-based MOT framework with a deep association mechanism that improves tracking accuracy and speed over existing methods.

Findings

01

Outperforms state-of-the-art methods in accuracy and speed

02

Effective in tracking objects with limited LiDAR data

03

Achieves smooth fusion of 2D and 3D trajectories

Abstract

In the recent literature, on the one hand, many 3D multi-object tracking (MOT) works have focused on tracking accuracy and neglected computation speed, commonly by designing rather complex cost functions and feature extractors. On the other hand, some methods have focused too much on computation speed at the expense of tracking accuracy. In view of these issues, this paper proposes a robust and fast camera-LiDAR fusion-based MOT method that achieves a good trade-off between accuracy and speed. Relying on the characteristics of camera and LiDAR sensors, an effective deep association mechanism is designed and embedded in the proposed MOT method. This association mechanism realizes tracking of an object in a 2D domain when the object is far away and only detected by the camera, and updating of the 2D trajectory with 3D information obtained when the object appears in the LiDAR field of view…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wangxiyang2022/DeepFusionMOT
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings