Streamlining Software Reviews: Efficient Predictive Modeling with   Minimal Examples

Tim Menzies; Andre Lustosa

arXiv:2405.12920·cs.SE·May 22, 2024

Streamlining Software Reviews: Efficient Predictive Modeling with Minimal Examples

Tim Menzies, Andre Lustosa

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach for software review using predictive models trained on minimal examples, enabling efficient decision-making and automation in software analysis tasks.

Contribution

It demonstrates that effective predictive models for software review can be built with as few as 12 to 30 labeled examples, a significant reduction in data requirements.

Findings

01

Models trained with minimal data perform well across diverse case studies.

02

The approach reduces SME effort in software review processes.

03

Open-source code and data are provided for reproducibility.

Abstract

This paper proposes a new challenge problem for software analytics. In the process we shall call "software review", a panel of SMEs (subject matter experts) review examples of software behavior to recommend how to improve that's software's operation. SME time is usually extremely limited so, ideally, this panel can complete this optimization task after looking at just a small number of very informative, examples. To support this review process, we explore methods that train a predictive model to guess if some oracle will like/dislike the next example. Such a predictive model can work with the SMEs to guide them in their exploration of all the examples. Also, after the panelists leave, that model can be used as an oracle in place of the panel (to handle new examples, while the panelists are busy, elsewhere). In 31 case studies (ranging from from high-level decisions about software…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

timm/ez
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Semantic Web and Ontologies