Tracking in Aerial Hyperspectral Videos using Deep Kernelized Correlation Filters
Burak Uzkent, Aneesh Rangnekar, and Matthew J. Hoffman

TL;DR
This paper introduces DeepHKCF, a novel deep kernelized correlation filter tracker for aerial hyperspectral vehicle tracking, leveraging multi-modal sensors and synthetic data for improved accuracy and efficiency.
Contribution
The paper presents DeepHKCF, combining deep CNN features with kernelized correlation filters for aerial hyperspectral tracking, and introduces a synthetic dataset for vehicle classification in WAMI.
Findings
DeepHKCF outperforms existing trackers on synthetic hyperspectral videos.
The ROI mapping strategy significantly speeds up deep feature extraction.
A large synthetic dataset for vehicle classification is created and released.
Abstract
Hyperspectral imaging holds enormous potential to improve the state-of-the-art in aerial vehicle tracking with low spatial and temporal resolutions. Recently, adaptive multi-modal hyperspectral sensors have attracted growing interest due to their ability to record extended data quickly from aerial platforms. In this study, we apply popular concepts from traditional object tracking, namely (1) Kernelized Correlation Filters (KCF) and (2) Deep Convolutional Neural Network (CNN) features to aerial tracking in hyperspectral domain. We propose the Deep Hyperspectral Kernelized Correlation Filter based tracker (DeepHKCF) to efficiently track aerial vehicles using an adaptive multi-modal hyperspectral sensor. We address low temporal resolution by designing a single KCF-in-multiple Regions-of-Interest (ROIs) approach to cover a reasonably large area. To increase the speed of deep convolutional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
