Attention-MoA: Enhancing Mixture-of-Agents via Inter-Agent Semantic Attention and Deep Residual Synthesis
Jianyu Wen, Yang Wei, Xiongxi Yu, Changxuan Xiao, Ke Zeng

TL;DR
Attention-MoA introduces inter-agent semantic attention and residual synthesis to improve collaborative reasoning in mixture-of-agents models, significantly enhancing performance and efficiency over existing methods.
Contribution
The paper proposes a novel framework that enables deep semantic interaction among agents and mitigates information loss, advancing the capabilities of mixture-of-agents systems.
Findings
Outperforms state-of-the-art baselines on multiple benchmarks.
Achieves high win rates and superior scores with smaller models.
Enables small models to surpass large proprietary models in performance.
Abstract
As the development of Large Language Models (LLMs) shifts from parameter scaling to inference-time collaboration, the Mixture-of-Agents (MoA) framework has emerged as a general paradigm to harness collective intelligence by layering diverse models. While recent MoA variants have introduced dynamic routing and residual connections to improve efficiency, these methods often fail to facilitate deep semantic interaction between agents, limiting the system's ability to actively correct hallucinations and refine logic. In this paper, we introduce Attention-MoA, a novel MoA-based framework that redefines collaboration through Inter-agent Semantic Attention. Complemented by an Inter-layer Residual Module with Adaptive Early Stopping Mechanism, our architecture mitigates information degradation in deep layers while improving computational efficiency. Extensive evaluations across AlpacaEval 2.0,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Materials Science · Artificial Intelligence in Healthcare and Education
