Foresight: Rapid Data Exploration Through Guideposts
\c{C}a\u{g}atay Demiralp, Peter J. Haas, Srinivasan, Parthasarathy, Tejaswini Pedapati

TL;DR
Foresight is a visualization recommender system that enables rapid, guided exploration of large high-dimensional datasets through the use of guideposts, which highlight significant data features and facilitate efficient navigation.
Contribution
The paper introduces Foresight, a novel system that automates the identification of key data features and supports interactive exploration via guideposts, improving upon manual EDA methods.
Findings
Foresight enables faster data exploration compared to traditional tools.
Guideposts help users identify significant data patterns efficiently.
The system is validated through user studies and performance benchmarks.
Abstract
Current tools for exploratory data analysis (EDA) require users to manually select data attributes, statistical computations and visual encodings. This can be daunting for large-scale, complex data. We introduce Foresight, a visualization recommender system that helps the user rapidly explore large high-dimensional datasets through "guideposts." A guidepost is a visualization corresponding to a pronounced instance of a statistical descriptor of the underlying data, such as a strong linear correlation between two attributes, high skewness or concentration about the mean of a single attribute, or a strong clustering of values. For each descriptor, Foresight initially presents visualizations of the "strongest" instances, based on an appropriate ranking metric. Given these initial guideposts, the user can then look at "nearby" guideposts by issuing "guidepost queries" containing constraints…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Data Management and Algorithms · Advanced Database Systems and Queries
