LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking
Shaohua Dong, Yunhe Feng, Qing Yang, Yuewei Lin, Heng Fan

TL;DR
LoReTrack introduces a dual knowledge distillation approach to enhance low-resolution Transformer tracking, achieving high accuracy and efficiency, and demonstrating superior performance on multiple benchmarks.
Contribution
The paper proposes a novel dual knowledge distillation method to improve low-resolution Transformer trackers, balancing efficiency and accuracy without increasing model size.
Findings
LoReTrack outperforms baseline low-resolution trackers in accuracy.
It runs 52% faster and uses 56% fewer MACs than high-resolution counterparts.
Achieves real-time CPU tracking with competitive accuracy.
Abstract
High-performance Transformer trackers have shown excellent results, yet they often bear a heavy computational load. Observing that a smaller input can immediately and conveniently reduce computations without changing the model, an easy solution is to adopt the low-resolution input for efficient Transformer tracking. Albeit faster, this hurts tracking accuracy much due to information loss in low resolution tracking. In this paper, we aim to mitigate such information loss to boost the performance of the low-resolution Transformer tracking via dual knowledge distillation from a frozen high-resolution (but not a larger) Transformer tracker. The core lies in two simple yet effective distillation modules, comprising query-key-value knowledge distillation (QKV-KD) and discrimination knowledge distillation (Disc-KD), across resolutions. The former, from the global view, allows the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPower Line Inspection Robots · Smart Parking Systems Research · IoT Networks and Protocols
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections
