PatchTrack: A Comprehensive Analysis of ChatGPT's Influence on Pull Request Outcomes
Daniel Ogenrwot, John Businge

TL;DR
This study analyzes how developers evaluate, adapt, and integrate ChatGPT-generated code in pull requests, revealing that full adoption is rare and AI influences workflows beyond direct code integration.
Contribution
It provides empirical insights into AI-mediated decision-making in software development, highlighting patterns of integration and influence of ChatGPT in pull request workflows.
Findings
Median integration rate of AI-generated code is 25%.
Developers treat AI output as a starting point, not final code.
AI influences workflows through guidance, documentation, and debugging.
Abstract
The rapid adoption of large language models (LLMs) like ChatGPT has introduced new dynamics in software development, particularly within pull request workflows. While prior research has examined the quality of AI-generated code, less is known about how developers evaluate, adapt, and integrate these suggestions in real-world collaboration. We analyze 338 pull requests from 255 GitHub repositories containing self-admitted ChatGPT usage, comprising 645 AI-generated snippets and 3,486 developer-authored patches. To support this analysis at scale, we use PatchTrack, an automated classifier that identifies whether AI-generated patches were applied, partially reused, or not integrated. Our findings reveal that full adoption of ChatGPT-generated code is uncommon: the median integration rate is 25%. Qualitative analysis of 89 pull requests with integrated patches reveals recurring patterns of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
