HM-Net: A Regression Network for Object Center Detection and Tracking on Wide Area Motion Imagery
Hakki Motorcu, Hasan F. Ates, H. Fatih Ugurdag, and Bahadir Gunturk

TL;DR
HM-Net is a fast, deep learning-based model for object detection and tracking in high-resolution Wide Area Motion Imagery, outperforming existing methods in accuracy and speed by leveraging heat map predictions and feedback filters.
Contribution
The paper introduces HM-Net, a novel heat map-based neural network that jointly detects and tracks objects in WAMI data, improving speed and accuracy over prior methods.
Findings
HM-Net achieves 96.2% F1 and 94.4% mAP detection scores.
HM-Net outperforms state-of-the-art in detection and tracking accuracy.
HM-Net is significantly faster than existing frame differencing methods.
Abstract
Wide Area Motion Imagery (WAMI) yields high-resolution images with a large number of extremely small objects. Target objects have large spatial displacements throughout consecutive frames. This nature of WAMI images makes object tracking and detection challenging. In this paper, we present our deep neural network-based combined object detection and tracking model, namely, Heat Map Network (HM-Net). HM-Net is significantly faster than state-of-the-art frame differencing and background subtraction-based methods, without compromising detection and tracking performances. HM-Net follows the object center-based joint detection and tracking paradigm. Simple heat map-based predictions support an unlimited number of simultaneous detections. The proposed method uses two consecutive frames and the object detection heat map obtained from the previous frame as input, which helps HM-Net monitor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Retinal Imaging and Analysis · Advanced Image and Video Retrieval Techniques
