Synthesizing Document Database Queries using Collection Abstractions
Qikang Liu, Yang He, Yanwen Cai, Byeongguk Kwak, Yuepeng Wang

TL;DR
This paper introduces a synthesis technique that automatically generates document database queries from input-output examples using a new algebraic language and collection abstractions, enabling efficient query generation.
Contribution
It presents a novel synthesis method with a domain-specific language and collection abstractions to efficiently generate complex document database queries from examples.
Findings
Successfully synthesized 108 out of 110 benchmarks.
Queries generated within tens of seconds on average.
Effective pruning of search space through collection abstractions.
Abstract
Document databases are increasingly popular in various applications, but their queries are challenging to write due to the flexible and complex data model underlying document databases. This paper presents a synthesis technique that aims to generate document database queries from input-output examples automatically. A new domain-specific language is designed to express a representative set of document database queries in an algebraic style. Furthermore, the synthesis technique leverages a novel abstraction of collections for deduction to efficiently prune the search space and quickly generate the target query. An evaluation of 110 benchmarks from various sources shows that the proposed technique can synthesize 108 benchmarks successfully. On average, the synthesizer can generate document database queries from a small number of input-output examples within tens of seconds.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Semantic Web and Ontologies · Data Management and Algorithms
