On Designing Light-Weight Object Trackers through Network Pruning: Use CNNs or Transformers?
Saksham Aggarwal, Taneesh Gupta, Pawan Kumar Sahu, Arnav Chavan,, Rishabh Tiwari, Dilip K. Prasad, Deepak K. Gupta

TL;DR
This paper explores designing ultra-lightweight object trackers by applying neural architectural pruning to large CNN and transformer models, comparing their performance and stability at various compression levels.
Contribution
It introduces a method for creating highly compressed, efficient object trackers from large models and provides a comparative analysis of CNNs, transformers, and hybrid architectures.
Findings
Highly compressed trackers can maintain performance at low ratios.
Transformers and CNNs show different stability profiles under pruning.
Extreme pruning as low as 1% reveals limits of network compression in tracking.
Abstract
Object trackers deployed on low-power devices need to be light-weight, however, most of the current state-of-the-art (SOTA) methods rely on using compute-heavy backbones built using CNNs or transformers. Large sizes of such models do not allow their deployment in low-power conditions and designing compressed variants of large tracking models is of great importance. This paper demonstrates how highly compressed light-weight object trackers can be designed using neural architectural pruning of large CNN and transformer based trackers. Further, a comparative study on architectural choices best suited to design light-weight trackers is provided. A comparison between SOTA trackers using CNNs, transformers as well as the combination of the two is presented to study their stability at various compression ratios. Finally results for extreme pruning scenarios going as low as 1% in some cases are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Brain Tumor Detection and Classification · Advanced Neural Network Applications
MethodsPruning
