AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance
Yuyang Zhao, Wentao Shi, Fuli Feng, and Xiangnan He

TL;DR
AppAgent-Pro introduces a proactive GUI agent system that anticipates user needs and actively integrates multi-domain information, enhancing the effectiveness of information retrieval and user assistance beyond traditional reactive agents.
Contribution
This paper presents a novel proactive GUI agent system, AppAgent-Pro, capable of anticipating user needs and conducting in-depth multi-domain information mining, which is a significant advancement over reactive agents.
Findings
Enables proactive information integration based on user instructions
Improves depth and comprehensiveness of information retrieval
Potential to transform daily information acquisition
Abstract
Large language model (LLM)-based agents have demonstrated remarkable capabilities in addressing complex tasks, thereby enabling more advanced information retrieval and supporting deeper, more sophisticated human information-seeking behaviors. However, most existing agents operate in a purely reactive manner, responding passively to user instructions, which significantly constrains their effectiveness and efficiency as general-purpose platforms for information acquisition. To overcome this limitation, this paper proposes AppAgent-Pro, a proactive GUI agent system that actively integrates multi-domain information based on user instructions. This approach enables the system to proactively anticipate users' underlying needs and conduct in-depth multi-domain information mining, thereby facilitating the acquisition of more comprehensive and intelligent information. AppAgent-Pro has the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
