Online Hybrid Lightweight Representations Learning: Its Application to Visual Tracking
Ilchae Jung, Minji Kim, Eunhyeok Park, Bohyung Han

TL;DR
This paper introduces a hybrid deep neural network framework combining quantized and lightweight full-precision models for real-time visual tracking, achieving competitive accuracy with reduced computational resources.
Contribution
It proposes a novel parallel hybrid representation learning architecture for streaming data, specifically tailored for online visual tracking.
Findings
Achieves competitive accuracy on standard benchmarks.
Reduces computational cost and memory footprint.
Maintains high fidelity representations with a hybrid model.
Abstract
This paper presents a novel hybrid representation learning framework for streaming data, where an image frame in a video is modeled by an ensemble of two distinct deep neural networks; one is a low-bit quantized network and the other is a lightweight full-precision network. The former learns coarse primary information with low cost while the latter conveys residual information for high fidelity to original representations. The proposed parallel architecture is effective to maintain complementary information since fixed-point arithmetic can be utilized in the quantized network and the lightweight model provides precise representations given by a compact channel-pruned network. We incorporate the hybrid representation technique into an online visual tracking task, where deep neural networks need to handle temporal variations of target appearances in real-time. Compared to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Image Enhancement Techniques · Advanced Vision and Imaging
