Data Formulator: AI-powered Concept-driven Visualization Authoring
Chenglong Wang, John Thompson, Bongshin Lee

TL;DR
Data Formulator introduces a novel AI-powered visualization tool that simplifies data transformation by allowing users to define concepts in natural language, automating the process and enhancing accessibility for visualization creation.
Contribution
It proposes a new concept binding paradigm and implements it in Data Formulator, enabling intuitive, concept-driven visualization authoring without extensive data transformation expertise.
Findings
Participants successfully created complex visualizations using Data Formulator.
The tool improved understanding and inspection of data transformations.
User feedback indicated ease of use and effectiveness.
Abstract
With most modern visualization tools, authors need to transform their data into tidy formats to create visualizations they want. Because this requires experience with programming or separate data processing tools, data transformation remains a barrier in visualization authoring. To address this challenge, we present a new visualization paradigm, concept binding, that separates high-level visualization intents and low-level data transformation steps, leveraging an AI agent. We realize this paradigm in Data Formulator, an interactive visualization authoring tool. With Data Formulator, authors first define data concepts they plan to visualize using natural languages or examples, and then bind them to visual channels. Data Formulator then dispatches its AI-agent to automatically transform the input data to surface these concepts and generate desired visualizations. When presenting the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Scientific Computing and Data Management · Computational Physics and Python Applications
