Improving Random Testing via LLM-powered UI Tarpit Escaping for Mobile Apps

Mengqian Xu; Yiheng Xiong; Le Chang; Ting Su; Chengcheng Wan; Weikai Miao

arXiv:2604.06763·cs.SE·April 14, 2026

Improving Random Testing via LLM-powered UI Tarpit Escaping for Mobile Apps

Mengqian Xu, Yiheng Xiong, Le Chang, Ting Su, Chengcheng Wan, Weikai Miao

PDF

TL;DR

This paper introduces a hybrid testing method using large language models to escape UI tarpits in mobile app testing, significantly improving coverage and bug detection.

Contribution

It proposes a novel LLM-powered approach to identify and escape UI tarpits during random GUI testing, enhancing test effectiveness for mobile apps.

Findings

01

Achieved 54.8% and 44.8% coverage improvements over baselines.

02

Detected 75 total bugs, including 34 new ones.

03

Found more bugs in WeChat than traditional random testing.

Abstract

Random GUI testing is a widely-used technique for testing mobile apps. However, its effectiveness is limited by the notorious issue -- UI exploration tarpits, where the exploration is trapped in local UI regions, thus impeding test coverage and bug discovery. In this experience paper, we introduce LLM-powered random GUI Testing, a novel hybrid testing approach to mitigating UI tarpits during random testing. Our approach monitors UI similarity to identify tarpits and query LLMs to suggest promising events for escaping the encountered tarpits. We implement our approach on top of two different automated input generation (AIG) tools for mobile apps: (1) HybridMonkey upon Monkey, a state-of-the-practice tool; and (2) HybridDroidbot upon Droidbot, a state-of-the-art tool. We evaluated them on 12 popular, real-world apps. The results show that HybridMonkey and HybridDroidbot outperform all…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.