vSTMD: Visual Motion Detection for Extremely Tiny Target at Various Velocities
Mingshuo Xu, Hao Luan, Zhou Daniel Hao, Jigen Peng, and Shigang Yue

TL;DR
This paper introduces vSTMD, a novel, learning-free visual motion detection model for tiny targets across various velocities, outperforming existing methods in accuracy and speed, suitable for real-time applications.
Contribution
The paper presents vSTMD, a new natural architecture with a self-adaptive mechanism and efficient gradient calculation, enabling robust detection of tiny targets at diverse velocities without training.
Findings
vSTMD achieves 30% higher F1 score than SOTA models.
vSTMD-F improves F1 score by 58% over previous approaches.
The model is 60 times faster than deep learning methods.
Abstract
Visual motion detection for extremely tiny (ET-) targets is challenging, due to their category-independent nature and the scarcity of visual cues, which often incapacitate mainstream feature-based models. Natural architectures with rich interpretability offer a promising alternative, where STMD architectures derived from insect visual STMD (Small Target Motion Detector) pathways have demonstrated their effectiveness. However, previous STMD models are constrained to a narrow velocity range, hindering their efficacy in real-world scenarios where targets exhibit diverse and unstable dynamics. To address this limitation, we present vSTMD, a learning-free model for motion detection of ET-targets at various velocities. Our key innovations include: (1) a cross-Inhibition Dynamic Potential (cIDP) that serves as a self-adaptive mechanism efficiently capturing motion cues across a wide velocity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition
