Feature Selection Convolutional Neural Networks for Visual Tracking
Zhiyan Cui, Na Lu

TL;DR
This paper introduces a feature selection CNN-based visual tracking algorithm that reduces computational complexity and increases speed while maintaining high precision, achieving around 10 fps compared to 1 fps of traditional MDNet.
Contribution
It proposes a novel feature selection method and an efficient RoIAlign-based strategy to accelerate MDNet-based tracking without sacrificing accuracy.
Findings
Achieved around 10 fps speed on OTB100 benchmark.
Maintained high tracking precision comparable to existing methods.
Reduced network complexity significantly.
Abstract
Most of the existing tracking methods based on CNN(convolutional neural networks) are too slow for real-time application despite the excellent tracking precision compared with the traditional ones. Moreover, neural networks are memory intensive which will take up lots of hardware resources. In this paper, a feature selection visual tracking algorithm combining CNN based MDNet(Multi-Domain Network) and RoIAlign was developed. We find that there is a lot of redundancy in feature maps from convolutional layers. So valid feature maps are selected by mutual information and others are abandoned which can reduce the complexity and computation of the network and do not affect the precision. The major problem of MDNet also lies in the time efficiency. Considering the computational complexity of MDNet is mainly caused by the large amount of convolution operations and fine-tuning of the network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Vision and Imaging · Infrared Target Detection Methodologies
