Is this Build Failure Related to my Patch? An Empirical Study of Unrelated Build Failures in Continuous Integration
Andie Huang, Daniel Alencar da Costa, Grant Dick, Mariam El Mezouar

TL;DR
This study analyzes and predicts unrelated build failures in CI systems using machine learning, aiming to reduce developer effort in identifying failures caused by unrelated issues.
Contribution
It introduces semi-supervised PU learning models that effectively predict unrelated build failures using features from issue reports, comments, and commits.
Findings
Models achieve up to 0.88 precision and 0.97 AUC.
Unrelated test failures account for 20% of unrelated failures.
Features like CI latency and error message repetition are key indicators.
Abstract
Continuous Integration (CI) systems often run many builds concurrently. In this setting, a legitimate build failure may not be caused by the code push that triggered it. Such unrelated build failures can waste developer effort because developers must determine whether the failure is actionable for their current change. We study 77,354 CI build failures from seven open source Apache projects to understand and predict unrelated build failures. We find that developers spend a median of 4 hours identifying whether a failure is related or unrelated to their push. We also perform a document analysis of 371 confirmed unrelated build failures sampled from 10,316 potentially unrelated failures. The analysis shows that unrelated test failures account for 20% of the cases in which developers classify build failures as unrelated. To predict unrelated build failures, we extract 33 features from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
