Towards a Unified Architecture for in-RDBMS Analytics
Xixuan Feng, Arun Kumar, Ben Recht, Christopher R\'e

TL;DR
This paper proposes a unified architecture for in-database analytics that simplifies integration, improves performance, and enables generic optimization across multiple techniques in RDBMSs.
Contribution
It introduces a unified architecture for in-database analytics, enabling generic performance optimization and easy integration of new techniques with minimal code changes.
Findings
Achieves competitive or higher performance compared to native tools
Requires only a few dozen lines of code to add new techniques
Demonstrates feasibility across commercial and open-source RDBMSs
Abstract
The increasing use of statistical data analysis in enterprise applications has created an arms race among database vendors to offer ever more sophisticated in-database analytics. One challenge in this race is that each new statistical technique must be implemented from scratch in the RDBMS, which leads to a lengthy and complex development process. We argue that the root cause for this overhead is the lack of a unified architecture for in-database analytics. Our main contribution in this work is to take a step towards such a unified architecture. A key benefit of our unified architecture is that performance optimizations for analytics techniques can be studied generically instead of an ad hoc, per-technique fashion. In particular, our technical contributions are theoretical and empirical studies of two key factors that we found impact performance: the order data is stored, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Distributed systems and fault tolerance · Cloud Computing and Resource Management
