SEATrack: Simple, Efficient, and Adaptive Multimodal Tracker

Junbin Su; Ziteng Xue; Shihui Zhang; Kun Chen; Weiming Hu; Zhipeng Zhang

arXiv:2604.12502·cs.CV·April 15, 2026

SEATrack: Simple, Efficient, and Adaptive Multimodal Tracker

Junbin Su, Ziteng Xue, Shihui Zhang, Kun Chen, Weiming Hu, Zhipeng Zhang

PDF

1 Repo 1 Models

TL;DR

SEATrack is a novel multimodal tracker that enhances cross-modal alignment and global relation modeling, achieving a better balance of performance and efficiency across various tracking tasks.

Contribution

It introduces AMG-LoRA for dynamic attention alignment and HMoE for efficient global relation modeling, advancing multimodal tracking performance and efficiency.

Findings

01

Outperforms state-of-the-art methods in RGB-T, RGB-D, and RGB-E tracking.

02

Achieves a better balance of accuracy and computational efficiency.

03

Demonstrates the effectiveness of AMG-LoRA and HMoE modules.

Abstract

Parameter-efficient fine-tuning (PEFT) in multimodal tracking reveals a concerning trend where recent performance gains are often achieved at the cost of inflated parameter budgets, which fundamentally erodes PEFT's efficiency promise. In this work, we introduce SEATrack, a Simple, Efficient, and Adaptive two-stream multimodal tracker that tackles this performance-efficiency dilemma from two complementary perspectives. We first prioritize cross-modal alignment of matching responses, an underexplored yet pivotal factor that we argue is essential for breaking the trade-off. Specifically, we observe that modality-specific biases in existing two-stream methods generate conflicting matching attention maps, thereby hindering effective joint representation learning. To mitigate this, we propose AMG-LoRA, which seamlessly integrates Low-Rank Adaptation (LoRA) for domain adaptation with Adaptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AutoLab-SAI-SJTU/SEATrack
github

Models

🤗
jbs99/SEATrack
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.