Improving Random Testing via LLM-powered UI Tarpit Escaping for Mobile Apps
Mengqian Xu, Yiheng Xiong, Le Chang, Ting Su, Chengcheng Wan, Weikai Miao

TL;DR
This paper introduces a hybrid testing method using large language models to escape UI tarpits in mobile app testing, significantly improving coverage and bug detection.
Contribution
It proposes a novel LLM-powered approach to identify and escape UI tarpits during random GUI testing, enhancing test effectiveness for mobile apps.
Findings
Achieved 54.8% and 44.8% coverage improvements over baselines.
Detected 75 total bugs, including 34 new ones.
Found more bugs in WeChat than traditional random testing.
Abstract
Random GUI testing is a widely-used technique for testing mobile apps. However, its effectiveness is limited by the notorious issue -- UI exploration tarpits, where the exploration is trapped in local UI regions, thus impeding test coverage and bug discovery. In this experience paper, we introduce LLM-powered random GUI Testing, a novel hybrid testing approach to mitigating UI tarpits during random testing. Our approach monitors UI similarity to identify tarpits and query LLMs to suggest promising events for escaping the encountered tarpits. We implement our approach on top of two different automated input generation (AIG) tools for mobile apps: (1) HybridMonkey upon Monkey, a state-of-the-practice tool; and (2) HybridDroidbot upon Droidbot, a state-of-the-art tool. We evaluated them on 12 popular, real-world apps. The results show that HybridMonkey and HybridDroidbot outperform all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
