TL;DR
This paper presents a comprehensive taxonomy of 91 information attributes used in test case prioritisation, critically analyzing their applicability and the role of machine learning, with implications for industrial use and cost-benefit considerations.
Contribution
It introduces the TePIA taxonomy classifying information attributes for TCP and evaluates ML-based methods in industrial contexts, highlighting challenges related to information access and applicability.
Findings
High number of attributes may hinder industrial adoption
Assuming easy access to SUT code can limit real-world applicability
ML-based TCP effectiveness depends on information availability and cost
Abstract
Most software companies have extensive test suites and re-run parts of them continuously to ensure recent changes have no adverse effects. Since test suites are costly to execute, industry needs methods for test case prioritisation (TCP). Recently, TCP methods use machine learning (ML) to exploit the information known about the system under test (SUT) and its test cases. However, the value added by ML-based TCP methods should be critically assessed with respect to the cost of collecting the information. This paper analyses two decades of TCP research, and presents a taxonomy of 91 information attributes that have been used. The attributes are classified with respect to their information sources and the characteristics of their extraction process. Based on this taxonomy, TCP methods validated with industrial data and those applying ML are analysed in terms of information availability,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
