DroidCall: A Dataset for LLM-powered Android Intent Invocation

Weikai Xie; Li Zhang; Shihe Wang; Rongjie Yi; Mengwei Xu

arXiv:2412.00402·cs.AI·December 3, 2024

DroidCall: A Dataset for LLM-powered Android Intent Invocation

Weikai Xie, Li Zhang, Shihe Wang, Rongjie Yi, Mengwei Xu

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

DroidCall introduces a new dataset and fine-tuning approach for small language models to accurately invoke Android intents, enabling efficient on-device mobile agents with enhanced privacy.

Contribution

We created DroidCall, the first dataset for Android intent invocation, and demonstrated that fine-tuned small language models can match or surpass GPT-4o in this task.

Findings

01

Fine-tuned small models approach GPT-4o performance.

02

DroidCall contains 10,000 samples for training and testing.

03

An Android app demonstrates practical deployment.

Abstract

The growing capabilities of large language models in natural language understanding significantly strengthen existing agentic systems. To power performant on-device mobile agents for better data privacy, we introduce DroidCall, the first training and testing dataset for accurate Android intent invocation. With a highly flexible and reusable data generation pipeline, we constructed 10k samples in DroidCall. Given a task instruction in natural language, small language models such as Qwen2.5-3B and Gemma2-2B fine-tuned with DroidCall can approach or even surpass the capabilities of GPT-4o for accurate Android intent invocation. We also provide an end-to-end Android app equipped with these fine-tuned models to demonstrate the Android intent invocation process. The code and dataset are available at https://github.com/UbiquitousLearning/DroidCall.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ubiquitouslearning/droidcall
noneOfficial

Datasets

mllmTeam/DroidCall
dataset· 215 dl
215 dl

Videos

DroidCall: A Dataset for LLM-powered Android Intent Invocation· underline

Taxonomy

TopicsAdvanced Malware Detection Techniques · Mobile and Web Applications · Advanced Data Storage Technologies