Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking
ShiJie Sun, Naveed Akhtar, XiangYu Song, HuanSheng Song, Ajmal Mian,, Mubarak Shah

TL;DR
This paper introduces DMM-Net, a deep learning model that jointly detects and tracks multiple objects by estimating their motion parameters, achieving high speed and accuracy without relying on detector-biased methods.
Contribution
The paper presents DMM-Net, a novel end-to-end deep network for simultaneous detection and tracking that models object motion, and introduces Omni-MOT, a large synthetic dataset for vehicle tracking evaluation.
Findings
DMM-Net achieves PR-MOTA score of 12.80 at 120+ fps on UA-DETRAC.
Omni-MOT dataset contains over 14 million frames with precise ground-truth annotations.
DMM-Net outperforms existing methods in speed and accuracy, reducing detector bias.
Abstract
Deep learning-based Multiple Object Tracking (MOT) currently relies on off-the-shelf detectors for tracking-by-detection.This results in deep models that are detector biased and evaluations that are detector influenced. To resolve this issue, we introduce Deep Motion Modeling Network (DMM-Net) that can estimate multiple objects' motion parameters to perform joint detection and association in an end-to-end manner. DMM-Net models object features over multiple frames and simultaneously infers object classes, visibility, and their motion parameters. These outputs are readily used to update the tracklets for efficient MOT. DMM-Net achieves PR-MOTA score of 12.80 @ 120+ fps for the popular UA-DETRAC challenge, which is better performance and orders of magnitude faster. We also contribute a synthetic large-scale public dataset Omni-MOT for vehicle tracking that provides precise ground-truth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Air Quality Monitoring and Forecasting
