A Comprehensive Empirical Evaluation of Generating Test Suites for Mobile Applications with Diversity
Thomas Vogel, Chinh Tran, Lars Grunske

TL;DR
This paper evaluates a new approach, pienzDiv, for generating diverse test suites for mobile apps, showing it improves or matches existing methods without trial-and-error tuning, though it may be slower.
Contribution
It introduces pienzDiv, a diversity-preserving extension of pienz, developed through fitness landscape analysis, to enhance test suite quality without manual configuration tuning.
Findings
pienzDiv maintains test suite diversity and improves fault detection.
It achieves comparable or better coverage than pienz.
It produces longer test sequences and requires more execution time.
Abstract
Context: In search-based software engineering we often use popular heuristics with default configurations, which typically lead to suboptimal results, or we perform experiments to identify configurations on a trial-and-error basis, which may lead to better results for a specific problem. We consider the problem of generating test suites for mobile applications (apps) and rely on \Sapienz, a state-of-the-art approach to this problem that uses a popular heuristic (NSGA-II) with a default configuration. Objective: We want to achieve better results in generating test suites with \Sapienz while avoiding trial-and-error experiments to identify a more suitable configuration of \Sapienz. Method: We conducted a fitness landscape analysis of \Sapienz to analytically understand the search problem, which allowed us to make informed decisions about the heuristic and configuration of \Sapienz when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
