Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning
Zelin Zang, Yehui Yang, Fei Wang, Liangyu Li, and Baigui Sun

TL;DR
This paper introduces a novel framework called DACSM that uses beneficial noise in cross-attention mechanisms to improve domain adaptation by focusing on content and ignoring style variations, achieving state-of-the-art results.
Contribution
The paper proposes the DACSM framework with beneficial noise in cross-attention to enhance domain-invariant feature learning and robustness against scale and style variations.
Findings
DACSM outperforms previous methods on multiple benchmarks.
Beneficial noise improves focus on content over style distractions.
Significant gains on the 'truck' class demonstrate robustness to scale discrepancies.
Abstract
Unsupervised Domain Adaptation (UDA) seeks to transfer knowledge from a labeled source domain to an unlabeled target domain but often suffers from severe domain and scale gaps that degrade performance. Existing cross-attention-based transformers can align features across domains, yet they struggle to preserve content semantics under large appearance and scale variations. To explicitly address these challenges, we introduce the concept of beneficial noise, which regularizes cross-attention by injecting controlled perturbations, encouraging the model to ignore style distractions and focus on content. We propose the Domain-Adaptive Cross-Scale Matching (DACSM) framework, which consists of a Domain-Adaptive Transformer (DAT) for disentangling domain-shared content from domain-specific style, and a Cross-Scale Matching (CSM) module that adaptively aligns features across multiple resolutions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Generative Adversarial Networks and Image Synthesis
