Understanding Queries by Conditional Instances
Amir Gilad, Zhengjie Miao, Sudeepa Roy, Jun Yang

TL;DR
This paper introduces a method for generating simple, representative data instances that illustrate how relational queries are satisfied, aiding understanding and debugging of complex queries.
Contribution
It formalizes the concept of representative instances for relational queries, proposes algorithms to compute minimal collections of such instances, and demonstrates their practical effectiveness.
Findings
Algorithms efficiently generate representative instances.
User study shows improved query debugging.
Method effectively summarizes all query satisfaction ways.
Abstract
A powerful way to understand a complex query is by observing how it operates on data instances. However, specific database instances are not ideal for such observations: they often include large amounts of superfluous details that are not only irrelevant to understanding the query but also cause cognitive overload; and one specific database may not be enough. Given a relational query, is it possible to provide a simple and generic "representative" instance that (1) illustrates how the query can be satisfied, (2) summarizes all specific instances that would satisfy the query in the same way by abstracting away unnecessary details? Furthermore, is it possible to find a collection of such representative instances that together completely characterize all possible ways in which the query can be satisfied? This paper takes initial steps towards answering these questions. We design what these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Understanding Queries by Conditional Instances· youtube
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Stream Mining Techniques · Semantic Web and Ontologies
