# Real-Time Correlation Tracking via Joint Model Compression and Transfer

**Authors:** Ning Wang, Wengang Zhou, Yibing Song, Chao Ma, Houqiang Li

arXiv: 1907.09831 · 2020-06-24

## TL;DR

This paper introduces a method to compress and transfer CNN models into lightweight networks for real-time visual tracking on low-resource devices, maintaining accuracy while significantly improving speed.

## Contribution

It presents a joint model compression and transfer framework using knowledge distillation, with novel fidelity and tracking losses, enabling real-time tracking on single-core CPUs.

## Key findings

- Achieves real-time tracking speed on a single-core CPU.
- Maintains comparable accuracy to state-of-the-art deep CF trackers.
- Demonstrates effectiveness across five challenging datasets.

## Abstract

Correlation filters (CF) have received considerable attention in visual tracking because of their computational efficiency. Leveraging deep features via off-the-shelf CNN models (e.g., VGG), CF trackers achieve state-of-the-art performance while consuming a large number of computing resources. This limits deep CF trackers to be deployed to many mobile platforms on which only a single-core CPU is available. In this paper, we propose to jointly compress and transfer off-the-shelf CNN models within a knowledge distillation framework. We formulate a CNN model pretrained from the image classification task as a teacher network, and distill this teacher network into a lightweight student network as the feature extractor to speed up CF trackers. In the distillation process, we propose a fidelity loss to enable the student network to maintain the representation capability of the teacher network. Meanwhile, we design a tracking loss to adapt the objective of the student network from object recognition to visual tracking. The distillation process is performed offline on multiple layers and adaptively updates the student network using a background-aware online learning scheme. Extensive experiments on five challenging datasets demonstrate that the lightweight student network accelerates the speed of state-of-the-art deep CF trackers to real-time on a single-core CPU while maintaining almost the same tracking accuracy.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.09831/full.md

## Figures

18 figures with captions in the complete paper: https://tomesphere.com/paper/1907.09831/full.md

## References

68 references — full list in the complete paper: https://tomesphere.com/paper/1907.09831/full.md

---
Source: https://tomesphere.com/paper/1907.09831