Serving Deep Learning Model in Relational Databases
Lixi Zhou, Qi Lin, Kanchan Chowdhury, Saif Masood, Alexandre, Eichenberger, Hong Min, Alexander Sim, Jie Wang, Yida Wang, Kesheng Wu,, Binhang Yuan, Jia Zou

TL;DR
This paper explores architectures for serving deep learning models within relational databases, analyzing their strengths, limitations, and integration challenges to enable efficient data-intensive DL inference.
Contribution
It provides a comprehensive analysis of DL-centric, UDF-centric, and relation-centric architectures, proposing strategies for their integration within RDBMS.
Findings
Identifies key gaps hindering architecture integration
Proposes pathways for seamless architecture combination
Lays groundwork for a new RDBMS supporting DL inference
Abstract
Serving deep learning (DL) models on relational data has become a critical requirement across diverse commercial and scientific domains, sparking growing interest recently. In this visionary paper, we embark on a comprehensive exploration of representative architectures to address the requirement. We highlight three pivotal paradigms: The state-of-the-art DL-centric architecture offloads DL computations to dedicated DL frameworks. The potential UDF-centric architecture encapsulates one or more tensor computations into User Defined Functions (UDFs) within the relational database management system (RDBMS). The potential relation-centric architecture aims to represent a large-scale tensor computation through relational operators. While each of these architectures demonstrates promise in specific use scenarios, we identify urgent requirements for seamless integration of these architectures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Software System Performance and Reliability · Parallel Computing and Optimization Techniques
