High Speed Tracking With A Fourier Domain Kernelized Correlation Filter
Mingyang Guan, Zhengguo Li, Renjie He, and Changyun Wen

TL;DR
This paper introduces a high-speed, kernelized correlation filter tracker that combines l1 and l2 regularizations in the Fourier domain, improving accuracy and robustness while maintaining efficiency.
Contribution
A novel Fourier domain optimization framework that efficiently integrates l1 and l2 regularizations for robust, high-speed visual tracking.
Findings
Significantly improves tracking accuracy over original KCF.
Outperforms state-of-the-art methods in accuracy, efficiency, and robustness.
Maintains high tracking speed with enhanced robustness.
Abstract
It is challenging to design a high speed tracking approach using l1-norm due to its non-differentiability. In this paper, a new kernelized correlation filter is introduced by leveraging the sparsity attribute of l1-norm based regularization to design a high speed tracker. We combine the l1-norm and l2-norm based regularizations in one Huber-type loss function, and then formulate an optimization problem in the Fourier Domain for fast computation, which enables the tracker to adaptively ignore the noisy features produced from occlusion and illumination variation, while keep the advantages of l2-norm based regression. This is achieved due to the attribute of Convolution Theorem that the correlation in spatial domain corresponds to an element-wise product in the Fourier domain, resulting in that the l1-norm optimization problem could be decomposed into multiple sub-optimization spaces in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Image Enhancement Techniques · Urban Heat Island Mitigation
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution
