Effect of Technical and Social Factors on Pull Request Quality for the NPM Ecosystem
Tapajit Dey, Audris Mockus

TL;DR
This study analyzes technical and social factors influencing pull request acceptance in the NPM ecosystem, identifying key predictors and their complex relationships using a large dataset and predictive modeling.
Contribution
It introduces a comprehensive ecosystem-wide analysis of PR acceptance factors, including novel measures, and models their complex effects with high predictive accuracy.
Findings
Number of PRs and acceptance rate are significant predictors.
Signals during the PR review phase greatly influence acceptance.
Most predictors have non-linear effects on acceptance probability.
Abstract
Pull request (PR) based development, which is a norm for the social coding platforms, entails the challenge of evaluating the contributions of, often unfamiliar, developers from across the open source ecosystem and, conversely, submitting a contribution to a project with unfamiliar maintainers. Previous studies suggest that the decision of accepting or rejecting a PR may be influenced by a diverging set of technical and social factors, but often focus on relatively few projects, do not consider ecosystem-wide measures, or the possible non-monotonic relationships between the predictors and PR acceptance probability. We aim to shed light on this important decision making process by testing which measures significantly affect the probability of PR acceptance on a significant fraction of a large ecosystem, rank them by their relative importance in predicting PR acceptance, and determine the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
