TF-SASM: Training-free Spatial-aware Sparse Memory for Multi-object Tracking
Thuc Nguyen-Quang, Minh-Triet Tran

TL;DR
This paper introduces TF-SASM, a training-free, spatial-aware sparse memory method for multi-object tracking that improves efficiency and accuracy by selectively storing critical features based on object motion and overlap.
Contribution
It proposes a novel memory-based approach that reduces redundancy and enhances tracking performance without additional training, outperforming previous methods on DanceTrack.
Findings
Achieves 2.0% higher AssA score on DanceTrack
Improves IDF1 score by 2.1%
Stores longer temporal information with fewer features
Abstract
Multi-object tracking (MOT) in computer vision remains a significant challenge, requiring precise localization and continuous tracking of multiple objects in video sequences. The emergence of data sets that emphasize robust reidentification, such as DanceTrack, has highlighted the need for effective solutions. While memory-based approaches have shown promise, they often suffer from high computational complexity and memory usage due to storing feature at every single frame. In this paper, we propose a novel memory-based approach that selectively stores critical features based on object motion and overlapping awareness, aiming to enhance efficiency while minimizing redundancy. As a result, our method not only store longer temporal information with limited number of stored features in the memory, but also diversify states of a particular object to enhance the association performance. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Energy Efficient Wireless Sensor Networks · Air Quality Monitoring and Forecasting
