Dynamic Fusion Network for RGBT Tracking
Jingchao Peng, Haitao Zhao, Zhengwei Hu

TL;DR
The paper introduces DFNet, a dynamic fusion network for RGBT tracking that adaptively weights features from visible and infrared images, achieving high accuracy with minimal computational overhead.
Contribution
It proposes a novel two-stream architecture with adaptive weighting of shared and non-shared convolution kernels for improved RGBT tracking.
Findings
Achieves 88.1% precision rate and 71.9% success rate.
Operates at 28.658 FPS, with minimal additional computational cost.
Effectively handles varying contributions of features across different sequences.
Abstract
For both visible and infrared images have their own advantages and disadvantages, RGBT tracking has attracted more and more attention. The key points of RGBT tracking lie in feature extraction and feature fusion of visible and infrared images. Current RGBT tracking methods mostly pay attention to both individual features (features extracted from images of a single camera) and common features (features extracted and fused from an RGB camera and a thermal camera), while pay less attention to the different and dynamic contributions of individual features and common features for different sequences of registered image pairs. This paper proposes a novel RGBT tracking method, called Dynamic Fusion Network (DFNet), which adopts a two-stream structure, in which two non-shared convolution kernels are employed in each layer to extract individual features. Besides, DFNet has shared convolution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image Fusion Techniques · Infrared Target Detection Methodologies
MethodsConvolution
