Semantics and Multi-Query Optimization Algorithms for the Analyze Operator
Marios Iakovidis, Panos Vassiliadis

TL;DR
This paper introduces the ANALYZE operator, a novel cube querying tool with formal semantics that enhances multi-query optimization for data analysis by providing a comprehensive view and efficient execution strategies.
Contribution
The paper defines the semantics of the ANALYZE operator, proves an optimization method to merge queries, and proposes three algorithms for efficient execution.
Findings
Mid-MQO consistently performs well across contexts.
Max-MQO excels with large, overlapping sibling queries.
Min-MQO provides a baseline without optimization.
Abstract
In their hunt for highlights, i.e., interesting patterns in the data, data analysts have to issue groups of related queries and manually combine their results. To the extent that the analyst's goals are based on an intention on what to discover (e.g., contrast a query result to peer ones, verify a pattern to a broader range of data in the data space, etc), the integration of intentional query operators in analytical engines can enhance the efficiency of these analytical tasks. In this paper, we introduce, with well-defined semantics, the ANALYZE operator, a novel cube querying intentional operator that provides a 360 view of data. We define the semantics of an ANALYZE query as a tuple of five internal, facilitator cube queries, that (a) report on the specifics of a particular subset of the data space, which is part of the query specification, and to which we refer as the original query,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Cloud Computing and Resource Management · Machine Learning and Data Classification
