Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach
Shiao Wang, Xiao Wang, Liye Jin, Bo Jiang, Lin Zhu, Lan Chen, Yonghong Tian, Bin Luo

TL;DR
This paper introduces SFTrack, a dual-mode visual object tracking framework using event cameras that balances high-precision slow tracking with low-latency fast tracking, suitable for resource-constrained environments.
Contribution
The paper proposes a novel Slow-Fast Tracking paradigm with graph-based representation learning and FlashAttention backbones, enabling adaptable, efficient, and accurate event-based object tracking.
Findings
Fast tracker achieves low latency with multiple bounding boxes in one pass.
The framework outperforms existing methods on FE240, COESOT, and EventVOT benchmarks.
Seamless integration of slow and fast trackers improves overall tracking performance.
Abstract
Existing tracking algorithms typically rely on low-frame-rate RGB cameras coupled with computationally intensive deep neural network architectures to achieve effective tracking. However, such frame-based methods inherently face challenges in achieving low-latency performance and often fail in resource-constrained environments. Visual object tracking using bio-inspired event cameras has emerged as a promising research direction in recent years, offering distinct advantages for low-latency applications. In this paper, we propose a novel Slow-Fast Tracking paradigm that flexibly adapts to different operational requirements, termed SFTrack. The proposed framework supports two complementary modes, i.e., a high-precision slow tracker for scenarios with sufficient computational resources, and an efficient fast tracker tailored for latency-aware, resource-constrained environments. Specifically,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Advanced Memory and Neural Computing
MethodsKnowledge Distillation
