PyGWalker: On-the-fly Assistant for Exploratory Visual Data Analysis
Yue Yu, Leixian Shen, Fei Long, Huamin Qu, Hao Chen

TL;DR
PyGWalker is a Python library that provides real-time, user-friendly visual data analysis assistance, bridging the gap between programmatic and visual data exploration in computational environments.
Contribution
It introduces a lightweight GUI tool with a flexible architecture supporting various environments, enhancing exploratory data analysis workflows.
Findings
Over 612,000 downloads on PyPI
More than 10,500 GitHub stars as of June 2024
Widespread adoption in data science community
Abstract
Exploratory visual data analysis tools empower data analysts to efficiently and intuitively explore data insights throughout the entire analysis cycle. However, the gap between common programmatic analysis (e.g., within computational notebooks) and exploratory visual analysis leads to a disjointed and inefficient data analysis experience. To bridge this gap, we developed PyGWalker, a Python library that offers on-the-fly assistance for exploratory visual data analysis. It features a lightweight and intuitive GUI with a shelf builder modality. Its loosely coupled architecture supports multiple computational environments to accommodate varying data sizes. Since its release in February 2023, PyGWalker has gained much attention, with 612k downloads on PyPI and over 10.5k stars on GitHub as of June 2024. This demonstrates its value to the data science and visualization community, with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Data Visualization and Analytics
