Eliciting Best Practices for Collaboration with Computational Notebooks

Luigi Quaranta; Fabio Calefato; Filippo Lanubile

arXiv:2202.07233·cs.HC·February 16, 2022

Eliciting Best Practices for Collaboration with Computational Notebooks

Luigi Quaranta, Fabio Calefato, Filippo Lanubile

PDF

1 Repo

TL;DR

This paper identifies and evaluates best practices for collaborative data science using computational notebooks, highlighting gaps between awareness and consistent adoption due to tool limitations.

Contribution

It provides a catalog of best practices for collaborative notebook use, assesses their awareness among data scientists, and analyzes real-world adoption patterns.

Findings

01

Experts are aware of most best practices.

02

Adoption varies depending on context and tool support.

03

Some best practices are often unfeasible or counterproductive.

Abstract

Despite the widespread adoption of computational notebooks, little is known about best practices for their usage in collaborative contexts. In this paper, we fill this gap by eliciting a catalog of best practices for collaborative data science with computational notebooks. With this aim, we first look for best practices through a multivocal literature review. Then, we conduct interviews with professional data scientists to assess their awareness of these best practices. Finally, we assess the adoption of best practices through the analysis of 1,380 Jupyter notebooks retrieved from the Kaggle platform. Findings reveal that experts are mostly aware of the best practices and tend to adopt them in their daily work. Nonetheless, they do not consistently follow all the recommendations as, depending on specific contexts, some are deemed unfeasible or counterproductive due to the lack of proper…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

collab-uniba/pynblint
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttentive Walk-Aggregating Graph Neural Network