In-RDBMS Hardware Acceleration of Advanced Analytics
Divya Mahajan, Joon Kyung Kim, Jacob Sacks, Adel Ardalan, Arun Kumar,, Hadi Esmaeilzadeh

TL;DR
This paper introduces DAnA, a hardware acceleration framework that automatically maps advanced analytics queries to FPGA accelerators within a database, significantly speeding up data processing without requiring hardware design expertise.
Contribution
DAnA is the first system to automatically generate FPGA-based accelerators from high-level analytics queries embedded in SQL and Python, integrating seamlessly with databases.
Findings
8.3x average speedup on real datasets
28.2x maximum speedup achieved
4.0x faster than Apache MADLib on Greenplum
Abstract
The data revolution is fueled by advances in machine learning, databases, and hardware design. Programmable accelerators are making their way into each of these areas independently. As such, there is a void of solutions that enables hardware acceleration at the intersection of these disjoint fields. This paper sets out to be the initial step towards a unifying solution for in-Database Acceleration of Advanced Analytics (DAnA). Deploying specialized hardware, such as FPGAs, for in-database analytics currently requires hand-designing the hardware and manually routing the data. Instead, DAnA automatically maps a high-level specification of advanced analytics queries to an FPGA accelerator. The accelerator implementation is generated for a User Defined Function (UDF), expressed as a part of an SQL query using a Python-embedded Domain-Specific Language (DSL). To realize an efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
