Worldwide Gender Differences in Public Code Contributions
Davide Rossi, Stefano Zacchiroli (LTCI)

TL;DR
This study analyzes 50 years of public code contributions worldwide, revealing a steady increase in female participation overall, regional differences, and a decline during the COVID-19 pandemic.
Contribution
It provides a large-scale, longitudinal analysis of gender and geographic trends in open source software contributions, filling a significant research gap.
Findings
Female contribution ratio is low but steadily increasing globally.
Growth in female participation is observed across all world regions.
COVID-19 pandemic temporarily decreased female contributions.
Abstract
Gender imbalance is a well-known phenomenon observed throughout sciences which is particularly severe in software development and Free/Open Source Software communities. Little is know yet about the geography of this phenomenon in particular when considering large scales for both its time and space dimensions. We contribute to fill this gap with a longitudinal study of the population of contributors to publicly available software source code. We analyze the development history of 160 million software projects for a total of 2.2 billion commits contributed by 43 million distinct authors over a period of 50 years. We classify author names by gender using name frequencies and author geographical locations using heuristics based on email addresses and time zones. We study the evolution over time of contributions to public code by gender and by world region. For the world overall, we confirm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
