STULL: Unbiased Online Sampling for Visual Exploration of Large Spatiotemporal Data
Guizhen Wang, Jingjing Guo, Mingjie Tang, Jos\'e Florencio de Queiroz, Neto, Calvin Yau, Anas Daghistani, Morteza Karimzadeh, Walid G. Aref, David, S. Ebert

TL;DR
This paper introduces STULL, an unbiased online sampling method for large spatiotemporal datasets that improves the accuracy of visual analytics by ensuring equal selection probability for qualifying data points, outperforming biased methods.
Contribution
The paper presents a novel unbiased sampling approach with a new data index and retrieval plan, enhancing the accuracy of interactive visualizations of large spatiotemporal data.
Findings
Samples are at least 50% more accurate in representing spatial distribution.
The approach enables more accurate approximate visualizations.
It outperforms existing biased sampling techniques in experiments.
Abstract
Online sampling-supported visual analytics is increasingly important, as it allows users to explore large datasets with acceptable approximate answers at interactive rates. However, existing online spatiotemporal sampling techniques are often biased, as most researchers have primarily focused on reducing computational latency. Biased sampling approaches select data with unequal probabilities and produce results that do not match the exact data distribution, leading end users to incorrect interpretations. In this paper, we propose a novel approach to perform unbiased online sampling of large spatiotemporal data. The proposed approach ensures the same probability of selection to every point that qualifies the specifications of a user's multidimensional query. To achieve unbiased sampling for accurate representative interactive visualizations, we design a novel data index and an associated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Data Management and Algorithms · Human Mobility and Location-Based Analysis
