In-RDBMS Hardware Acceleration of Advanced Analytics

Divya Mahajan; Joon Kyung Kim; Jacob Sacks; Adel Ardalan; Arun Kumar,; Hadi Esmaeilzadeh

arXiv:1801.06027·cs.DB·September 19, 2018

In-RDBMS Hardware Acceleration of Advanced Analytics

Divya Mahajan, Joon Kyung Kim, Jacob Sacks, Adel Ardalan, Arun Kumar,, Hadi Esmaeilzadeh

PDF

TL;DR

This paper introduces DAnA, a hardware acceleration framework that automatically maps advanced analytics queries to FPGA accelerators within a database, significantly speeding up data processing without requiring hardware design expertise.

Contribution

DAnA is the first system to automatically generate FPGA-based accelerators from high-level analytics queries embedded in SQL and Python, integrating seamlessly with databases.

Findings

01

8.3x average speedup on real datasets

02

28.2x maximum speedup achieved

03

4.0x faster than Apache MADLib on Greenplum

Abstract

The data revolution is fueled by advances in machine learning, databases, and hardware design. Programmable accelerators are making their way into each of these areas independently. As such, there is a void of solutions that enables hardware acceleration at the intersection of these disjoint fields. This paper sets out to be the initial step towards a unifying solution for in-Database Acceleration of Advanced Analytics (DAnA). Deploying specialized hardware, such as FPGAs, for in-database analytics currently requires hand-designing the hardware and manually routing the data. Instead, DAnA automatically maps a high-level specification of advanced analytics queries to an FPGA accelerator. The accelerator implementation is generated for a User Defined Function (UDF), expressed as a part of an SQL query using a Python-embedded Domain-Specific Language (DSL). To realize an efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.