Automated Soap Opera Testing Directed by LLMs and Scenario Knowledge: Feasibility, Challenges, and Road Ahead
Yanqi Su, Zhenchang Xing, Chong Wang, Chunyang Chen, Xiwei Xu, Qinghua, Lu, Liming Zhu

TL;DR
This paper investigates the potential of automating soap opera testing using large language models and scenario knowledge graphs, highlighting current capabilities, challenges, and future directions for large-scale, automated exploratory testing.
Contribution
It introduces a multi-agent system leveraging LLMs and a Scenario Knowledge Graph to automate soap opera testing, addressing manual testing limitations and outlining future research pathways.
Findings
Automated testing shows promise but lags behind manual testing in scenario coverage.
The system can identify potential bugs but struggles with complex scenario boundaries.
Future work should focus on neural-symbolic integration and human-AI collaboration.
Abstract
Exploratory testing (ET) harnesses tester's knowledge, creativity, and experience to create varying tests that uncover unexpected bugs from the end-user's perspective. Although ET has proven effective in system-level testing of interactive systems, the need for manual execution has hindered large-scale adoption. In this work, we explore the feasibility, challenges and road ahead of automated scenario-based ET (a.k.a soap opera testing). We conduct a formative study, identifying key insights for effective manual soap opera testing and challenges in automating the process. We then develop a multi-agent system leveraging LLMs and a Scenario Knowledge Graph (SKG) to automate soap opera testing. The system consists of three multi-modal agents, Planner, Player, and Detector that collaborate to execute tests and identify potential bugs. Experimental results demonstrate the potential of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Surface Polishing Techniques · Fault Detection and Control Systems · Image and Object Detection Techniques
