One Homography is All You Need: IMM-based Joint Homography and Multiple Object State Estimation
Paul Johannes Claasen, Johan Pieter de Villiers

TL;DR
This paper introduces IMM-JHSE, a novel online multi-object tracking algorithm that uses a single homography estimate and IMM filtering to jointly model camera motion and object states, improving tracking accuracy across multiple datasets.
Contribution
The paper presents a new online MOT method that jointly models homography and object states using IMM filtering, reducing reliance on explicit camera motion compensation and enhancing robustness.
Findings
Outperforms existing methods on DanceTrack and KITTI-car datasets.
Achieves higher HOTA scores, indicating better tracking accuracy.
Competitive performance on MOT17, MOT20, and KITTI-pedestrian datasets.
Abstract
A novel online MOT algorithm, IMM Joint Homography State Estimation (IMM-JHSE), is proposed. IMM-JHSE uses an initial homography estimate as the only additional 3D information, whereas other 3D MOT methods use regular 3D measurements. By jointly modelling the homography matrix and its dynamics as part of track state vectors, IMM-JHSE removes the explicit influence of camera motion compensation techniques on predicted track position states, which was prevalent in previous approaches. Expanding upon this, static and dynamic camera motion models are combined using an IMM filter. A simple bounding box motion model is used to predict bounding box positions to incorporate image plane information. In addition to applying an IMM to camera motion, a non-standard IMM approach is applied where bounding-box-based BIoU scores are mixed with ground-plane-based Mahalanobis distances in an IMM-like…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · Video Surveillance and Tracking Methods · Target Tracking and Data Fusion in Sensor Networks
