Online Tracking by Learning Discriminative Saliency Map with   Convolutional Neural Network

Seunghoon Hong; Tackgeun You; Suha Kwak; Bohyung Han

arXiv:1502.06796·cs.CV·February 25, 2015·518 cites

Online Tracking by Learning Discriminative Saliency Map with Convolutional Neural Network

Seunghoon Hong, Tackgeun You, Suha Kwak, Bohyung Han

PDF

Open Access

TL;DR

This paper introduces an online visual tracking method that leverages CNN-derived features and saliency maps to improve target localization and segmentation, demonstrating superior performance on challenging benchmarks.

Contribution

The paper presents a novel online tracking algorithm that constructs target-specific saliency maps guided by CNN features and an online SVM, enhancing localization accuracy and segmentation.

Findings

01

Outperforms state-of-the-art tracking algorithms on benchmark datasets.

02

Enables pixel-level target segmentation through saliency map visualization.

03

Improves target localization accuracy in challenging scenarios.

Abstract

We propose an online visual tracking algorithm by learning discriminative saliency map using Convolutional Neural Network (CNN). Given a CNN pre-trained on a large-scale image repository in offline, our algorithm takes outputs from hidden layers of the network as feature descriptors since they show excellent representation performance in various general visual recognition problems. The features are used to learn discriminative target appearance models using an online Support Vector Machine (SVM). In addition, we construct target-specific saliency map by backpropagating CNN features with guidance of the SVM, and obtain the final tracking result in each frame based on the appearance model generatively constructed with the saliency map. Since the saliency map visualizes spatial configuration of target effectively, it improves target localization accuracy and enable us to achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Visual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques

MethodsSupport Vector Machine