SkillDroid: Compile Once, Reuse Forever
Qijia Chen, Andrea Bellucci, Zhida Sun, Giulio Jacucci

TL;DR
SkillDroid is a skill-based agent that compiles LLM-guided GUI trajectories into reusable templates, significantly reducing LLM calls and improving reliability over time in mobile GUI tasks.
Contribution
It introduces a three-layer system that compiles and reuses GUI skills, enhancing efficiency and success rates compared to stateless LLM approaches.
Findings
Achieves 85.3% success rate, 23 points above baseline.
Uses 49% fewer LLM calls.
1000% success rate in replay mode across 79 rounds.
Abstract
LLM-based mobile GUI agents treat every task invocation as an independent reasoning episode, requiring a full LLM inference call at each action step. This per-step dependence makes them stateless: a task completed successfully yesterday is re-derived from scratch today, with no improvement in reliability or speed. We present SkillDroid, a three-layer skill agent that compiles successful LLM-guided GUI trajectories into parameterized skill templates (sequences of UI actions with weighted element locators and typed parameter slots) and replays them on future invocations without any LLM calls. A matching cascade (regex patterns, embedding similarity, and app filtering) routes incoming instructions to stored skills, while a failure-learning layer triggers recompilation when skill reliability degrades. Over a 150-round longitudinal evaluation with systematic instruction variation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
