Context-aware Deep Feature Compression for High-speed Visual Tracking

Jongwon Choi; Hyung Jin Chang; Tobias Fischer; Sangdoo Yun; Kyuewang; Lee; Jiyeoup Jeong; Yiannis Demiris; Jin Young Choi

arXiv:1803.10537·cs.CV·October 21, 2020

Context-aware Deep Feature Compression for High-speed Visual Tracking

Jongwon Choi, Hyung Jin Chang, Tobias Fischer, Sangdoo Yun, Kyuewang, Lee, Jiyeoup Jeong, Yiannis Demiris, Jin Young Choi

PDF

1 Repo

TL;DR

This paper introduces a context-aware deep feature compression method using multiple expert auto-encoders for high-speed visual tracking, achieving real-time performance with state-of-the-art accuracy.

Contribution

It presents a novel framework that dynamically selects the best auto-encoder based on target appearance, enabling fast and accurate tracking.

Findings

01

Achieves over 100 fps tracking speed.

02

Maintains competitive accuracy with state-of-the-art methods.

03

Effective feature compression with improved denoising and orthogonality loss.

Abstract

We propose a new context-aware correlation filter based tracking framework to achieve both high computational speed and state-of-the-art performance among real-time trackers. The major contribution to the high computational speed lies in the proposed deep feature compression that is achieved by a context-aware scheme utilizing multiple expert auto-encoders; a context in our framework refers to the coarse category of the tracking target according to appearance patterns. In the pre-training phase, one expert auto-encoder is trained per category. In the tracking phase, the best expert auto-encoder is selected for a given target, and only this auto-encoder is used. To achieve high tracking performance with the compressed feature map, we introduce extrinsic denoising processes and a new orthogonality loss term for pre-training and fine-tuning of the expert auto-encoders. We validate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jongwon20000/TRACA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings