Enabling Cost-Effective UI Automation Testing with Retrieval-Based LLMs: A Case Study in WeChat
Sidong Feng, Haochuan Lu, Jianqin Jiang, Ting Xiong, Likun Huang,, Yinglin Liang, Xiaoqin Li, Yuetang Deng, Aldeida Aleti

TL;DR
This paper presents CAT, a retrieval-augmented approach using LLMs for cost-effective UI automation testing in industry apps, demonstrated through a case study on WeChat that achieves high automation accuracy at low cost.
Contribution
The paper introduces CAT, a novel method combining retrieval-augmented generation and machine learning with LLMs to improve industrial UI testing efficiency and cost-effectiveness.
Findings
Achieves 90% UI automation with $0.34 cost
Outperforms state-of-the-art in industrial app testing
Detects 141 bugs in WeChat testing platform
Abstract
UI automation tests play a crucial role in ensuring the quality of mobile applications. Despite the growing popularity of machine learning techniques to generate these tests, they still face several challenges, such as the mismatch of UI elements. The recent advances in Large Language Models (LLMs) have addressed these issues by leveraging their semantic understanding capabilities. However, a significant gap remains in applying these models to industrial-level app testing, particularly in terms of cost optimization and knowledge limitation. To address this, we introduce CAT to create cost-effective UI automation tests for industry apps by combining machine learning and LLMs with best practices. Given the task description, CAT employs Retrieval Augmented Generation (RAG) to source examples of industrial app usage as the few-shot learning context, assisting LLMs in generating the specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsService-Oriented Architecture and Web Services · Power Systems and Technologies · Web Data Mining and Analysis
