ModTrack: Sensor-Agnostic Multi-View Tracking via Identity-Informed PHD Filtering with Covariance Propagation
Aditya Iyer, Jack Roberts, Nora Ayanian

TL;DR
ModTrack is a modular multi-view multi-object tracking system that achieves high accuracy, generalizes across sensors and datasets, and provides traceable uncertainty by combining analytical fusion with a GM-PHD filter.
Contribution
It introduces a sensor-agnostic, modular MV-MOT framework that confines learning to detection and feature extraction, enabling flexible fusion, association, and tracking with principled uncertainty modeling.
Findings
Achieves 95.5 IDF1 and 91.4 MOTA on WildTrack
Surpasses prior modular methods by over 21 points
Transfers unchanged to MultiviewX and RadarScenes datasets
Abstract
Multi-View Multi-Object Tracking (MV-MOT) aims to localize and maintain consistent identities of objects observed by multiple sensors. This task is challenging, as viewpoint changes and occlusion disrupt identity consistency across views and time. Recent end-to-end approaches address this by jointly learning 2D Bird's Eye View (BEV) representations and identity associations, achieving high tracking accuracy. However, these methods offer no principled uncertainty accounting and remain tightly coupled to their training configuration, limiting generalization across sensor layouts, modalities, or datasets without retraining. We propose ModTrack, a modular MV-MOT system that matches end-to-end performance while providing cross-modal, sensor-agnostic generalization and traceable uncertainty. ModTrack confines learning methods to just the \textit{Detection and Feature Extraction} stage of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
