Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning

Zelin Zang; Yehui Yang; Fei Wang; Liangyu Li; and Baigui Sun

arXiv:2603.17474·cs.CV·March 19, 2026

Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning

Zelin Zang, Yehui Yang, Fei Wang, Liangyu Li, and Baigui Sun

PDF

Open Access

TL;DR

This paper introduces a novel framework called DACSM that uses beneficial noise in cross-attention mechanisms to improve domain adaptation by focusing on content and ignoring style variations, achieving state-of-the-art results.

Contribution

The paper proposes the DACSM framework with beneficial noise in cross-attention to enhance domain-invariant feature learning and robustness against scale and style variations.

Findings

01

DACSM outperforms previous methods on multiple benchmarks.

02

Beneficial noise improves focus on content over style distractions.

03

Significant gains on the 'truck' class demonstrate robustness to scale discrepancies.

Abstract

Unsupervised Domain Adaptation (UDA) seeks to transfer knowledge from a labeled source domain to an unlabeled target domain but often suffers from severe domain and scale gaps that degrade performance. Existing cross-attention-based transformers can align features across domains, yet they struggle to preserve content semantics under large appearance and scale variations. To explicitly address these challenges, we introduce the concept of beneficial noise, which regularizes cross-attention by injecting controlled perturbations, encouraging the model to ignore style distractions and focus on content. We propose the Domain-Adaptive Cross-Scale Matching (DACSM) framework, which consists of a Domain-Adaptive Transformer (DAT) for disentangling domain-shared content from domain-specific style, and a Cross-Scale Matching (CSM) module that adaptively aligns features across multiple resolutions.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Generative Adversarial Networks and Image Synthesis