AppBuddy: Learning to Accomplish Tasks in Mobile Apps via Reinforcement Learning
Maayan Shvo, Zhiming Hu, Rodrigo Toro Icarte, Iqbal Mohomed, Allan, Jepson, Sheila A. McIlraith

TL;DR
This paper presents AppBuddy, an RL framework enabling agents to learn multi-step tasks in mobile apps by interacting with screen elements, demonstrating generalization and providing a platform for mobile RL research.
Contribution
Introduction of an RL-based platform, AppBuddy, for learning to accomplish tasks in mobile apps, including a suite of benchmark tasks and compatibility with OpenAI Gym.
Findings
RL agents can learn multi-step tasks in mobile apps
Agents show modest generalization across different apps
AppBuddy platform supports diverse RL research in mobile environments
Abstract
Human beings, even small children, quickly become adept at figuring out how to use applications on their mobile devices. Learning to use a new app is often achieved via trial-and-error, accelerated by transfer of knowledge from past experiences with like apps. The prospect of building a smarter smartphone - one that can learn how to achieve tasks using mobile apps - is tantalizing. In this paper we explore the use of Reinforcement Learning (RL) with the goal of advancing this aspiration. We introduce an RL-based framework for learning to accomplish tasks in mobile apps. RL agents are provided with states derived from the underlying representation of on-screen elements, and rewards that are based on progress made in the task. Agents can interact with screen elements by tapping or typing. Our experimental results, over a number of mobile apps, show that RL agents can learn to accomplish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
