Charting EDA: Characterizing Interactive Visualization Use in Computational Notebooks with a Mixed-Methods Formalism
Dylan Wootton, Amy Rae Fox, Evan Peck, Arvind Satyanarayan

TL;DR
This study introduces a formalism to analyze how interactive visualizations influence data analysts' insights in computational notebooks, revealing that they enable earlier, more complex discoveries and serve as planning aids during EDA.
Contribution
The paper presents a novel mixed-methods formalism for characterizing EDA processes and demonstrates its application in revealing the roles of interactive visualizations in data analysis.
Findings
Interactive visualizations lead to earlier insights.
Some representations act as planning aids.
A small subset of representations drives most observations.
Abstract
Interactive visualizations are powerful tools for Exploratory Data Analysis (EDA), but how do they affect the observations analysts make about their data? We conducted a qualitative experiment with 13 professional data scientists analyzing two datasets with Jupyter notebooks, collecting a rich dataset of interaction traces and think-aloud utterances. By qualitatively coding participant utterances, we introduce a formalism that describes EDA as a sequence of analysis states, where each state is comprised of either a representation an analyst constructs (e.g., the output of a data frame, an interactive visualization, etc.) or an observation the analyst makes (e.g., about missing data, the relationship between variables, etc.). By applying our formalism to our dataset, we identify that interactive visualizations, on average, lead to earlier and more complex insights about relationships…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Educational Games and Gamification · Multimedia Communication and Technology
