Faithful Mobile GUI Agents with Guided Advantage Estimator

Haowen Hu; Pengzhou Cheng; Zheng Wu; Lingzhong Dong; Gongshen Liu; Zhuosheng Zhang

arXiv:2605.01208·cs.AI·May 5, 2026

Faithful Mobile GUI Agents with Guided Advantage Estimator

Haowen Hu, Pengzhou Cheng, Zheng Wu, Lingzhong Dong, Gongshen Liu, Zhuosheng Zhang

PDF

TL;DR

This paper introduces Faithful-Agent, a framework for GUI interaction that enhances faithfulness and evidence grounding using a two-stage training process and a novel guided advantage estimator, significantly improving task success rates.

Contribution

It presents a new faithfulness-first approach with a guided advantage estimator to improve GUI agent reliability and grounding, outperforming previous methods.

Findings

01

Trap SR increased from 13.88% to 80.21% with Faithful-Agent.

02

Faithful-Agent maintains strong instruction-following performance.

03

GuAE prevents advantage collapse in low-variance rollout groups.

Abstract

Vision-language model based graphical user interface (GUI) agents have shown strong interaction capabilities. However, they often behave unfaithfully, relying on memorized shortcuts rather than grounding actions in displayed screen evidence or user instructions. To address this, we propose Faithful-Agent, a faithfulness-first framework that reformulates GUI interaction to prioritize evidence groundedness and internal consistency. Faithful-Agent employs a two-stage pipeline: (i) a faithfulness-oriented SFT stage to instill abstainment behaviors under evidence perturbations; (ii) an RFT stage that further amplifies faithfulness by introducing the guided advantage estimator (GuAE), an anchor-based and variance-adaptive advantage tempering mechanism built upon GRPO. GuAE prevents advantage collapse in low-variance rollout groups under sparse GUI rewards, and with a thought-action…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.