FEAR: Fast, Efficient, Accurate and Robust Visual Tracker
Vasyl Borsuk, Roman Vei, Orest Kupyn, Tetiana Martyniuk, Igor, Krashenyi, Ji\v{r}i Matas

TL;DR
FEAR introduces a family of Siamese visual trackers that are fast, efficient, and accurate, leveraging dual-template representation and novel modules to outperform existing methods on benchmarks while also considering energy consumption.
Contribution
The paper proposes a new dual-template representation and pixel-wise fusion block, significantly improving the speed and accuracy of Siamese trackers, and introduces the FEAR benchmark for energy efficiency evaluation.
Findings
FEAR trackers outperform most Siamese trackers in accuracy and efficiency.
FEAR-XS is over 10 times faster than current Siamese trackers with near state-of-the-art accuracy.
FEAR-XS is 2.4x smaller and 4.3x faster than LightTrack, with superior accuracy.
Abstract
We present FEAR, a family of fast, efficient, accurate, and robust Siamese visual trackers. We present a novel and efficient way to benefit from dual-template representation for object model adaption, which incorporates temporal information with only a single learnable parameter. We further improve the tracker architecture with a pixel-wise fusion block. By plugging-in sophisticated backbones with the abovementioned modules, FEAR-M and FEAR-L trackers surpass most Siamese trackers on several academic benchmarks in both accuracy and efficiency. Employed with the lightweight backbone, the optimized version FEAR-XS offers more than 10 times faster tracking than current Siamese trackers while maintaining near state-of-the-art results. FEAR-XS tracker is 2.4x smaller and 4.3x faster than LightTrack with superior accuracy. In addition, we expand the definition of the model efficiency by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Visual Attention and Saliency Detection · Human Pose and Action Recognition
