Optimizing Multi-Modality Trackers via Significance-Regularized Tuning
Zhiwen Chen, Jinjian Wu, Zhiyu Zhu, Yifan Zhang, Guangming Shi, Junhui Hou

TL;DR
This paper introduces a significance-regularized fine-tuning framework for multi-modality trackers, improving transferability and performance by balancing stability and adaptability through intrinsic parameter significance.
Contribution
It proposes a novel regularization method that leverages parameter significance to optimize pre-trained models for multi-modality tracking tasks.
Findings
Outperforms state-of-the-art on multiple benchmarks.
Enhances transferability across different modalities.
Improves stability and adaptability in multi-modal tracking.
Abstract
This paper tackles the critical challenge of optimizing multi-modality trackers by effectively adapting pre-trained models for RGB data. Existing fine-tuning paradigms oscillate between excessive flexibility and over-restriction, both leading to suboptimal plasticity-stability trade-offs. To mitigate this dilemma, we propose a novel significance-regularized fine-tuning framework, which delicately refines the learning process by incorporating intrinsic parameter significance. Through a comprehensive investigation of the transition from pre-trained to multi-modality contexts, we identify that parameters crucial to preserving foundational patterns and managing cross-domain shifts are the primary drivers of this issue. Specifically, we first probe the tangent space of pre-trained weights to measure and orient prior significance, dedicated to preserving generalization. Subsequently, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
