From Fake Focus to Real Precision: Confusion-Driven Adversarial Attention Learning in Transformers

Yawei Liu

arXiv:2512.20661·cs.AI·December 25, 2025

From Fake Focus to Real Precision: Confusion-Driven Adversarial Attention Learning in Transformers

Yawei Liu

PDF

Open Access

TL;DR

This paper introduces an adversarial attention learning method for Transformers that redistributes attention to task-relevant words, significantly improving sentiment analysis accuracy without manual annotations.

Contribution

The paper proposes a novel adversarial feedback mechanism that dynamically adjusts attention in Transformers, enhancing focus on important words for sentiment analysis.

Findings

01

Achieves state-of-the-art results on three datasets

02

Improves large language model performance by 12.6%

03

Demonstrates effective attention redistribution without manual labels

Abstract

Transformer-based models have been widely adopted for sentiment analysis tasks due to their exceptional ability to capture contextual information. However, these methods often exhibit suboptimal accuracy in certain scenarios. By analyzing their attention distributions, we observe that existing models tend to allocate attention primarily to common words, overlooking less popular yet highly task-relevant terms, which significantly impairs overall performance. To address this issue, we propose an Adversarial Feedback for Attention(AFA) training mechanism that enables the model to automatically redistribute attention weights to appropriate focal points without requiring manual annotations. This mechanism incorporates a dynamic masking strategy that attempts to mask various words to deceive a discriminator, while the discriminator strives to detect significant differences induced by these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Hate Speech and Cyberbullying Detection · Generative Adversarial Networks and Image Synthesis