# Predicting Merge Conflicts in Collaborative Software Development

**Authors:** Moein Owhadi-Kareshk, Sarah Nadi, Julia Rubin

arXiv: 1907.06274 · 2019-07-16

## TL;DR

This paper demonstrates that machine learning can effectively predict safe merge scenarios in collaborative software development, significantly reducing the computational cost of conflict detection.

## Contribution

The study introduces a lightweight classifier using Git features to predict merge conflicts, showing high accuracy in large-scale GitHub data across multiple programming languages.

## Key findings

- High F1-scores (0.95-0.97) for predicting safe merges
- Moderate F1-scores (0.57-0.68) for conflicting merges
- Feasibility of conflict prediction in practical settings

## Abstract

Background. During collaborative software development, developers often use branches to add features or fix bugs. When merging changes from two branches, conflicts may occur if the changes are inconsistent. Developers need to resolve these conflicts before completing the merge, which is an error-prone and time-consuming process. Early detection of merge conflicts, which warns developers about resolving conflicts before they become large and complicated, is among the ways of dealing with this problem. Existing techniques do this by continuously pulling and merging all combinations of branches in the background to notify developers as soon as a conflict occurs, which is a computationally expensive process. One potential way for reducing this cost is to use a machine-learning based conflict predictor that filters out the merge scenarios that are not likely to have conflicts, ie safe merge scenarios. Aims. In this paper, we assess if conflict prediction is feasible. Method. We design a classifier for predicting merge conflicts, based on 9 light-weight Git feature sets. To evaluate our predictor, we perform a large-scale study on 267, 657 merge scenarios from 744 GitHub repositories in seven programming languages. Results. Our results show that we achieve high f1-scores, varying from 0.95 to 0.97 for different programming languages, when predicting safe merge scenarios. The f1-score is between 0.57 and 0.68 for the conflicting merge scenarios. Conclusions. Predicting merge conflicts is feasible in practice, especially in the context of predicting safe merge scenarios as a pre-filtering step for speculative merging.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.06274/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1907.06274/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/1907.06274/full.md

---
Source: https://tomesphere.com/paper/1907.06274