SMTrack: End-to-End Trained Spiking Neural Networks for Multi-Object Tracking in RGB Videos
Pengzhi Zhong, Xinzhe Wang, Dan Zeng, Qihua Zhou, Feixiang He, and Shuiwang Li

TL;DR
SMTrack is a pioneering end-to-end trained spiking neural network framework for multi-object tracking in RGB videos, introducing novel loss and association modules to enhance accuracy and robustness in complex scenarios.
Contribution
This work is the first to develop a directly trained deep SNN framework for end-to-end multi-object tracking on standard RGB videos, incorporating novel loss and association techniques.
Findings
Achieves performance comparable to leading ANN-based methods.
Introduces Asa-NWDLoss for scale-aware detection.
Demonstrates robustness in complex tracking scenarios.
Abstract
Brain-inspired Spiking Neural Networks (SNNs) exhibit significant potential for low-power computation, yet their application in visual tasks remains largely confined to image classification, object detection, and event-based tracking. In contrast, real-world vision systems still widely use conventional RGB video streams, where the potential of directly-trained SNNs for complex temporal tasks such as multi-object tracking (MOT) remains underexplored. To address this challenge, we propose SMTrack-the first directly trained deep SNN framework for end-to-end multi-object tracking on standard RGB videos. SMTrack introduces an adaptive and scale-aware Normalized Wasserstein Distance loss (Asa-NWDLoss) to improve detection and localization performance under varying object scales and densities. Specifically, the method computes the average object size within each training batch and dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Video Surveillance and Tracking Methods · CCD and CMOS Imaging Sensors
