git2net - Mining Time-Stamped Co-Editing Networks from Large git Repositories
Christoph Gote, Ingo Scholtes, Frank Schweitzer

TL;DR
git2net is a scalable tool that extracts detailed, time-stamped co-editing networks from large git repositories, revealing fine-grained developer collaboration patterns through text mining of code changes.
Contribution
It introduces a novel method for constructing high-resolution, directed, weighted developer networks based on detailed commit history analysis, surpassing traditional co-authorship models.
Findings
Enables analysis of detailed code ownership and collaboration.
Provides high-resolution, time-stamped developer interaction data.
Applied successfully to open source and commercial projects.
Abstract
Data from software repositories have become an important foundation for the empirical study of software engineering processes. A recurring theme in the repository mining literature is the inference of developer networks capturing e.g. collaboration, coordination, or communication from the commit history of projects. Most of the studied networks are based on the co-authorship of software artefacts defined at the level of files, modules, or packages. While this approach has led to insights into the social aspects of software development, it neglects detailed information on code changes and code ownership, e.g. which exact lines of code have been authored by which developers, that is contained in the commit log of software projects. Addressing this issue, we introduce git2net, a scalable python software that facilitates the extraction of fine-grained co-editing networks in large git…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Open Source Software Innovations · Wikis in Education and Collaboration
