Large Margin Object Tracking with Circulant Feature Maps
Mengmeng Wang, Yong Liu, Zeyi Huang

TL;DR
This paper introduces a fast, large margin object tracking method combining structured output SVM advantages with correlation filters, incorporating multimodal detection and feedback mechanisms for improved accuracy and real-time performance.
Contribution
It presents a novel large margin tracking algorithm that integrates correlation filters and multimodal detection, achieving high accuracy and speed with robustness against model drift.
Findings
Achieves over 80 fps in tracking speed.
Outperforms state-of-the-art algorithms on benchmark sequences.
Demonstrates robustness with both handcrafted and deep CNN features.
Abstract
Structured output support vector machine (SVM) based tracking algorithms have shown favorable performance recently. Nonetheless, the time-consuming candidate sampling and complex optimization limit their real-time applications. In this paper, we propose a novel large margin object tracking method which absorbs the strong discriminative ability from structured output SVM and speeds up by the correlation filter algorithm significantly. Secondly, a multimodal target detection technique is proposed to improve the target localization precision and prevent model drift introduced by similar objects or background noise. Thirdly, we exploit the feedback from high-confidence tracking results to avoid the model corruption problem. We implement two versions of the proposed tracker with the representations from both conventional hand-crafted and deep convolution neural networks (CNNs) based features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Fire Detection and Safety Systems · IoT-based Smart Home Systems
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Support Vector Machine
