The Types, Roles, and Practices of Documentation in Data Analytics Open Source Software Libraries: A Collaborative Ethnography of Documentation Work
R. Stuart Geiger, Nelle Varoquaux, Charlotte Mazel-Cabasse, Chris, Holdgraf

TL;DR
This study explores the diverse formats, roles, and social dynamics of documentation work in open source data analytics libraries through ethnographic research, highlighting challenges and motivations of contributors.
Contribution
It provides a detailed ethnographic analysis of documentation practices, roles, and contributor motivations in data analytics OSS libraries, revealing social and technical barriers.
Findings
Documentation serves educational, promotional, and organizational roles.
Documentation work requires diverse skills and faces social/technical barriers.
Motivation for documentation work is often lower than coding tasks.
Abstract
Computational research and data analytics increasingly relies on complex ecosystems of open source software (OSS) "libraries" -- curated collections of reusable code that programmers import to perform a specific task. Software documentation for these libraries is crucial in helping programmers/analysts know what libraries are available and how to use them. Yet documentation for open source software libraries is widely considered low-quality. This article is a collaboration between CSCW researchers and contributors to data analytics OSS libraries, based on ethnographic fieldwork and qualitative interviews. We examine several issues around the formats, practices, and challenges around documentation in these largely volunteer-based projects. There are many different kinds and formats of documentation that exist around such libraries, which play a variety of educational, promotional, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
