vSTMD: Visual Motion Detection for Extremely Tiny Target at Various Velocities

Mingshuo Xu; Hao Luan; Zhou Daniel Hao; Jigen Peng; and Shigang Yue

arXiv:2501.13054·cs.CV·January 21, 2026

vSTMD: Visual Motion Detection for Extremely Tiny Target at Various Velocities

Mingshuo Xu, Hao Luan, Zhou Daniel Hao, Jigen Peng, and Shigang Yue

PDF

Open Access 1 Repo

TL;DR

This paper introduces vSTMD, a novel, learning-free visual motion detection model for tiny targets across various velocities, outperforming existing methods in accuracy and speed, suitable for real-time applications.

Contribution

The paper presents vSTMD, a new natural architecture with a self-adaptive mechanism and efficient gradient calculation, enabling robust detection of tiny targets at diverse velocities without training.

Findings

01

vSTMD achieves 30% higher F1 score than SOTA models.

02

vSTMD-F improves F1 score by 58% over previous approaches.

03

The model is 60 times faster than deep learning methods.

Abstract

Visual motion detection for extremely tiny (ET-) targets is challenging, due to their category-independent nature and the scarcity of visual cues, which often incapacitate mainstream feature-based models. Natural architectures with rich interpretability offer a promising alternative, where STMD architectures derived from insect visual STMD (Small Target Motion Detector) pathways have demonstrated their effectiveness. However, previous STMD models are constrained to a narrow velocity range, hindering their efficacy in real-world scenarios where targets exhibit diverse and unstable dynamics. To address this limitation, we present vSTMD, a learning-free model for motion detection of ET-targets at various velocities. Our key innovations include: (1) a cross-Inhibition Dynamic Potential (cIDP) that serves as a self-adaptive mechanism efficiently capturing motion cues across a wide velocity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mingshuoxu/stmdnet
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition