The Unexplored Treasure Trove of Phabricator Code Review
Gunnar Kudrjavets (University of Groningen), Nachiappan Nagappan, (Microsoft Research), Ayushi Rastogi (University of Groningen)

TL;DR
This paper presents a new dataset of 317,476 Phabricator code reviews from five major projects, enabling detailed analysis of code review history and facilitating future research in this area.
Contribution
We mined and compiled the first large-scale, publicly accessible dataset of Phabricator code reviews, overcoming data retrieval challenges and providing formats for broad research use.
Findings
Dataset includes 317,476 code reviews from five projects.
Enables granular analysis of code review history.
Facilitates future research with accessible data formats.
Abstract
Phabricator is a modern code collaboration tool used by popular projects like FreeBSD and Mozilla. However, unlike the other well-known code review environments, such as Gerrit or GitHub, there is no readily accessible public code review dataset for Phabricator. This paper describes our experience mining code reviews from five different projects that use Phabricator (Blender, FreeBSD, KDE, LLVM, and Mozilla). We discuss the challenges associated with the data retrieval process and our solutions, resulting in a dataset with details regarding 317,476 Phabricator code reviews. Our dataset is available in both JSON and MySQL database dump formats. The dataset enables analyses of the history of code reviews at a more granular level than other platforms. In addition, given that the projects we mined are publicly accessible via the Conduit API, our dataset can be used as a foundation to fetch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Advanced Malware Detection Techniques · Web Application Security Vulnerabilities
