Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation

Yuxiang Zhou; Jichang Li; Yanhao Zhang; Haonan Lu; Guanbin Li

arXiv:2511.12254·cs.AI·March 13, 2026

Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation

Yuxiang Zhou, Jichang Li, Yanhao Zhang, Haonan Lu, Guanbin Li

PDF

Open Access 1 Video

TL;DR

This paper introduces Mobile-Agent-RAG, a hierarchical multi-agent framework that enhances long-horizon mobile automation by integrating dual-level retrieval augmentation, significantly improving success rates and efficiency over existing methods.

Contribution

It proposes a novel hierarchical multi-agent system with dual retrieval augmentation for high-level planning and low-level execution, addressing knowledge reliance issues in mobile agents.

Findings

01

Improved task completion rate by 11.0%.

02

Enhanced step efficiency by 10.2%.

03

Outperforms state-of-the-art baselines.

Abstract

Mobile agents show immense potential, yet current state-of-the-art (SoTA) agents exhibit inadequate success rates on real-world, long-horizon, cross-application tasks. We attribute this bottleneck to the agents' excessive reliance on static, internal knowledge within MLLMs, which leads to two critical failure points: 1) strategic hallucinations in high-level planning and 2) operational errors during low-level execution on user interfaces (UI). The core insight of this paper is that high-level planning and low-level UI operations require fundamentally distinct types of knowledge. Planning demands high-level, strategy-oriented experiences, whereas operations necessitate low-level, precise instructions closely tied to specific app UIs. Motivated by these insights, we propose Mobile-Agent-RAG, a novel hierarchical multi-agent framework that innovatively integrates dual-level retrieval…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation· underline

Taxonomy

TopicsAI-based Problem Solving and Planning · Personal Information Management and User Behavior · Multi-Agent Systems and Negotiation