An Adaptive Benchmark for Modeling User Exploration of Large Datasets
Joanna Purich, Anthony Wise, Leilani Battle

TL;DR
This paper introduces SIMBA, a flexible benchmark that models user exploration in data analysis, enabling evaluation of database systems' performance in realistic, goal-oriented exploration scenarios.
Contribution
We propose a novel simulation-based benchmark that models user analysis goals and interactions, providing a more realistic assessment of DBMS performance during data exploration.
Findings
SIMBA can simulate diverse user exploration behaviors.
The benchmark reveals performance gaps in DBMSs not detected by traditional methods.
Experimental results demonstrate SIMBA's effectiveness across multiple systems and scenarios.
Abstract
In this paper, we present a new DBMS performance benchmark that can simulate user exploration with any specified dashboard design made of standard visualization and interaction components. The distinguishing feature of our SImulation-BAsed (or SIMBA) benchmark is its ability to model user analysis goals as a set of SQL queries to be generated through a valid sequence of user interactions, as well as measure the completion of analysis goals by testing for equivalence between the user's previous queries and their goal queries. In this way, the SIMBA benchmark can simulate how an analyst opportunistically searches for interesting insights at the beginning of an exploration session and eventually hones in on specific goals towards the end. To demonstrate the versatility of the SIMBA benchmark, we use it to test the performance of four DBMSs with six different dashboard specifications and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Advanced Database Systems and Queries · Peer-to-Peer Network Technologies
