TL;DR
This paper introduces NBSafety, a Jupyter kernel that uses runtime tracing and static analysis to automatically manage cell lineage, preventing errors and improving reproducibility in computational notebooks.
Contribution
NBSafety is a novel system that enhances notebook safety by automatically tracking execution lineage and preventing errors without altering existing notebook semantics.
Findings
NBSafety identified 117 safety errors in 666 real notebook sessions.
Cells flagged by NBSafety were over 7 times more likely to be re-executed by users.
NBSafety effectively prevents errors while preserving notebook flexibility.
Abstract
Computational notebooks have emerged as the platform of choice for data science and analytical workflows, enabling rapid iteration and exploration. By keeping intermediate program state in memory and segmenting units of execution into so-called "cells", notebooks allow users to execute their workflows interactively and enjoy particularly tight feedback. However, as cells are added, removed, reordered, and rerun, this hidden intermediate state accumulates in a way that is not necessarily correlated with the notebook's visible code, making execution behavior difficult to reason about, and leading to errors and lack of reproducibility. We present NBSafety, a custom Jupyter kernel that uses runtime tracing and static analysis to automatically manage lineage associated with cell execution and global notebook state. NBSafety detects and prevents errors that users make during unaided notebook…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
