Jupyter Scatter: Interactive Exploration of Large-Scale Datasets
Fritz Lekschas, Trevor Manz

TL;DR
Jupyter Scatter is a scalable, interactive scatterplot tool integrated with Jupyter environments that enables efficient exploration of large datasets with up to twenty million points, supporting fast selections and easy integration.
Contribution
It introduces a highly scalable and user-friendly scatterplot widget capable of visualizing and interacting with large-scale datasets in Jupyter environments.
Findings
Supports up to twenty million points
Provides fast point selection and interactivity
Integrates seamlessly with Pandas and Matplotlib
Abstract
Jupyter Scatter is a scalable, interactive, and interlinked scatterplot widget for exploring datasets in Jupyter Notebook/Lab, Colab, and VS Code. Its goal is to simplify the visual exploration, analysis, and comparison of large-scale bivariate datasets. Jupyter Scatter can render up to twenty million points, supports fast point selections, integrates with Pandas DataFrame and Matplotlib, uses perceptually-effective default settings, and offers a user-friendly API.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data Technologies and Applications · Data Visualization and Analytics
