Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents

Zheng Wu; Heyuan Huang; Yanjia Yang; Yuanyi Song; Xingyu Lou; Weiwen Liu; Weinan Zhang; Jun Wang; Zhuosheng Zhang

arXiv:2508.08645·cs.CL·April 6, 2026

Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents

Zheng Wu, Heyuan Huang, Yanjia Yang, Yuanyi Song, Xingyu Lou, Weiwen Liu, Weinan Zhang, Jun Wang, Zhuosheng Zhang

PDF

1 Repo 1 Models 1 Datasets

TL;DR

This paper introduces IFRAgent, a framework that enhances personalized mobile-use agents by analyzing both explicit and implicit human intentions from demonstrations, leading to better alignment with user preferences.

Contribution

It presents a novel approach to incorporate implicit intention flows into mobile agent personalization, along with a new dataset and evaluation metrics.

Findings

01

IFRAgent improves intention alignment rate by 6.79% on average.

02

It enhances step completion rates by 5.30% on average.

03

The approach outperforms baselines with a 32.06% relative improvement.

Abstract

As multimodal large language models advance rapidly, the automation of mobile tasks has become increasingly feasible through the use of mobile-use agents that mimic human interactions from graphical user interface. To further enhance mobile-use agents, previous studies employ demonstration learning to improve mobile-use agents from human demonstrations. However, these methods focus solely on the explicit intention flows of humans (e.g., step sequences) while neglecting implicit intention flows (e.g., personal preferences), which makes it difficult to construct personalized mobile-use agents. In this work, to evaluate the \textbf{I}ntention \textbf{A}lignment \textbf{R}ate between mobile-use agents and humans, we first collect \textbf{MobileIAR}, a dataset containing human-intent-aligned actions and ground-truth actions. This enables a comprehensive assessment of the agents'…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MadeAgents/Quick-on-the-Uptake
github

Models

🤗
wuuuuuz/IFRAgent
model· 5 dl
5 dl

Datasets

wuuuuuz/MobileIAR
dataset· 15 dl
15 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.