Multiple Object Tracking by Flowing and Fusing
Jimuyang Zhang, Sanping Zhou, Xin Chang, Fangbin Wan, Jinjun Wang,, Yang Wu, Dong Huang

TL;DR
The paper introduces Flow-Fuse-Tracker, an end-to-end deep neural network for multiple object tracking that effectively models target motions and fuses detection sources, achieving state-of-the-art results on key benchmarks.
Contribution
It proposes a novel end-to-end DNN framework with target flowing and fusing techniques, improving scalability and accuracy in MOT tasks.
Findings
Achieved top MOTA scores on MOT benchmarks
Outperformed existing online and offline methods
Effectively models complex target motions
Abstract
Most of Multiple Object Tracking (MOT) approaches compute individual target features for two subtasks: estimating target-wise motions and conducting pair-wise Re-Identification (Re-ID). Because of the indefinite number of targets among video frames, both subtasks are very difficult to scale up efficiently in end-to-end Deep Neural Networks (DNNs). In this paper, we design an end-to-end DNN tracking approach, Flow-Fuse-Tracker (FFT), that addresses the above issues with two efficient techniques: target flowing and target fusing. Specifically, in target flowing, a FlowTracker DNN module learns the indefinite number of target-wise motions jointly from pixel-level optical flows. In target fusing, a FuseTracker DNN module refines and fuses targets proposed by FlowTracker and frame-wise object detection, instead of trusting either of the two inaccurate sources of target proposal. Because…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Infrared Target Detection Methodologies · Advanced Neural Network Applications
