Technical Debt in the Peer-Review Documentation of R Packages: a rOpenSci Case Study
Zadia Codabux, Melina Vidoni, Fatemeh H. Fard

TL;DR
This study investigates technical debt in the peer-review documentation of R packages, revealing prevalent documentation debt and role-specific differences, thus pioneering analysis in scientific software peer-review.
Contribution
It introduces a taxonomy of technical debt in R package peer-reviews and provides the first empirical analysis of this kind in scientific software.
Findings
Documentation debt is the most common type of technical debt.
Different roles report different types of technical debt.
The study provides a publicly available dataset for future research.
Abstract
Context: Technical Debt is a metaphor used to describe code that is "not quite right." Although TD studies have gained momentum, TD has yet to be studied as thoroughly in non-Object-Oriented (OO) or scientific software such as R. R is a multi-paradigm programming language, whose popularity in data science and statistical applications has amplified in recent years. Due to R's inherent ability to expand through user-contributed packages, several community-led organizations were created to organize and peer-review packages in a concerted effort to increase their quality. Nonetheless, it is well-known that most R users do not have a technical programming background, being from multiple disciplines. Objective: The goal of this study is to investigate TD in the peer-review documentation of R packages led by rOpenSci. Method: We collected over 5000 comments from 157 packages that had been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Software Engineering Research
