TL;DR
This paper formalizes MongoDB's data model and query language, analyzing its expressivity and computational complexity, revealing equivalences with nested relational algebra and establishing complexity bounds.
Contribution
It provides the first formalization of MongoDB's data model and core query language, with complexity analysis and expressivity characterization.
Findings
MQuery is equivalent to nested relational algebra.
Complexity bounds range from LOGSPACE to exponential time.
Characterization of nested relational algebra query evaluation complexity.
Abstract
A significant number of novel database architectures and data models have been proposed during the last decade. While some of these new systems have gained in popularity, they lack a proper formalization, and a precise understanding of the expressivity and the computational properties of the associated query languages. In this paper, we aim at filling this gap, and we do so by considering MongoDB, a widely adopted document database managing complex (tree structured) values represented in a JSON-based data model, equipped with a powerful query mechanism. We provide a formalization of the MongoDB data model, and of a core fragment, called MQuery, of the MongoDB query language. We study the expressivity of MQuery, showing its equivalence with nested relational algebra. We further investigate the computational complexity of significant fragments of it, obtaining several (tight) bounds in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
