Predicting Rankings of Software Verification Competitions
Mike Czech, Eyke H\"ullermeier, Marie-Christine Jakobs, Heike Wehrheim

TL;DR
This paper introduces a machine learning method that predicts the rankings of software verification tools on specific tasks using graph-based code representations, outperforming previous feature-based approaches.
Contribution
The paper presents a novel label ranking approach with graph kernels for predicting verification tool rankings, improving accuracy over existing methods.
Findings
High predictive accuracy on SV-COMP data sets
Outperforms previous feature-based ranking methods
Effective generalization to unseen verification tasks
Abstract
Software verification competitions, such as the annual SV-COMP, evaluate software verification tools with respect to their effectivity and efficiency. Typically, the outcome of a competition is a (possibly category-specific) ranking of the tools. For many applications, such as building portfolio solvers, it would be desirable to have an idea of the (relative) performance of verification tools on a given verification task beforehand, i.e., prior to actually running all tools on the task. In this paper, we present a machine learning approach to predicting rankings of tools on verification tasks. The method builds upon so-called label ranking algorithms, which we complement with appropriate kernels providing a similarity measure for verification tasks. Our kernels employ a graph representation for software source code that mixes elements of control flow and program dependence graphs with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices
