Best Practices in Statistical Computing

Ricardo Sanchez; Beth Ann Griffin; Joseph Pane; Daniel McCaffrey

arXiv:2101.11857·stat.CO·August 9, 2021

Best Practices in Statistical Computing

Ricardo Sanchez, Beth Ann Griffin, Joseph Pane, Daniel McCaffrey

PDF

1 Repo

TL;DR

This paper emphasizes the importance of best practices in statistical computing, advocating for transparency, documentation, and quality assurance to improve reproducibility and reduce errors in research.

Contribution

It provides a comprehensive set of guidelines for implementing code quality assurance processes in statistical research.

Findings

01

Implementing QA steps enhances reproducibility.

02

Adherence to coding standards reduces errors.

03

Regular testing improves data integrity.

Abstract

The world is becoming increasingly complex, both in terms of the rich sources of data we have access to as well as in terms of the statistical and computational methods we can use on those data. These factors create an ever-increasing risk for errors in our code and sensitivity in our findings to data preparation and execution of complex statistical and computing methods. The consequences of coding and data mistakes can be substantial. Openness (e.g., providing others with data code) and transparency (e.g., requiring that data processing and code follow standards) are two key solutions to help alleviate concerns about replicability and errors. In this paper, we describe the key steps for implementing a code quality assurance (QA) process for researchers to follow to improve their coding practices throughout a project to assure the quality of the final data, code, analyses and ultimately…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jpane24/code-qa
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.