Spatio-Temporal Point Process for Multiple Object Tracking
Tao Wang, Kean Chen, Weiyao Lin, John See, Zenghui Zhang, Qian Xu, and, Xia Jia

TL;DR
This paper introduces a novel spatio-temporal point process framework using conv-RNNs to predict and mask noisy detections in multiple object tracking, significantly improving tracking accuracy.
Contribution
It proposes a data-driven, neural network-based point process model to effectively handle noisy detections in MOT, enhancing existing tracking algorithms.
Findings
Improved tracking accuracy on MOT datasets
Effective masking of noisy detections
State-of-the-art performance achieved
Abstract
Multiple Object Tracking (MOT) focuses on modeling the relationship of detected objects among consecutive frames and merge them into different trajectories. MOT remains a challenging task as noisy and confusing detection results often hinder the final performance. Furthermore, most existing research are focusing on improving detection algorithms and association strategies. As such, we propose a novel framework that can effectively predict and mask-out the noisy and confusing detection results before associating the objects into trajectories. In particular, we formulate such "bad" detection results as a sequence of events and adopt the spatio-temporal point process}to model such events. Traditionally, the occurrence rate in a point process is characterized by an explicitly defined intensity function, which depends on the prior knowledge of some specific tasks. Thus, designing a proper…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
