The BigDAWG Architecture
Vijay Gadepally, Jennie Duggan, Aaron Elmore, Jeremy Kepner, Samuel, Madden, Tim Mattson, Michael Stonebraker

TL;DR
BigDAWG is a polystore system architecture that enables seamless integration and querying across diverse database systems with different data models, demonstrated on medical data.
Contribution
This paper introduces the BigDAWG architecture, supporting multi-engine data management with a uniform interface and application to complex medical datasets.
Findings
Successful implementation of BigDAWG architecture
Application to MIMIC II medical dataset
Future plans for cross-system query mechanics
Abstract
BigDAWG is a polystore system designed to work on complex problems that naturally span across different processing or storage engines. BigDAWG provides an architecture that supports diverse database systems working with different data models, support for the competing notions of location transparency and semantic completeness via islands of information and a middleware that provides a uniform multi-island interface. In this article, we describe the current architecture of BigDAWG, its application on the MIMIC II medical dataset, and our plans for the mechanics of cross-system queries. During the presentation, we will also deliver a brief demonstration of the current version of BigDAWG.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Management and Algorithms · Data Quality and Management
