Robust Visual Tracking by Motion Analyzing
Mohammed Leo, Kurban Ubul, ShengJie Cheng, Michael Ma

TL;DR
This paper introduces a motion analysis-based algorithm for visual tracking that leverages tensor decomposition to improve accuracy and enable real-time performance across multiple benchmarks.
Contribution
It presents a novel motion pattern analysis method using Tucker2 tensor decomposition for robust, real-time visual tracking, outperforming existing methods on key benchmarks.
Findings
Achieved state-of-the-art results on LaSOT, AVisT, OTB100, and GOT-10k.
Capable of real-time operation in practical scenarios.
Effectively describes target motion using tensor-based analysis.
Abstract
In recent years, Video Object Segmentation (VOS) has emerged as a complementary method to Video Object Tracking (VOT). VOS focuses on classifying all the pixels around the target, allowing for precise shape labeling, while VOT primarily focuses on the approximate region where the target might be. However, traditional segmentation modules usually classify pixels frame by frame, disregarding information between adjacent frames. In this paper, we propose a new algorithm that addresses this limitation by analyzing the motion pattern using the inherent tensor structure. The tensor structure, obtained through Tucker2 tensor decomposition, proves to be effective in describing the target's motion. By incorporating this information, we achieved competitive results on Four benchmarks LaSOT\cite{fan2019lasot}, AVisT\cite{noman2022avist}, OTB100\cite{7001050}, and GOT-10k\cite{huang2019got}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
MethodsVOS
