ElasticAI: Creating and Deploying Energy-Efficient Deep Learning   Accelerator for Pervasive Computing

Chao Qian; Tianheng Ling; Gregor Schiele

arXiv:2409.09044·cs.AR·September 17, 2024

ElasticAI: Creating and Deploying Energy-Efficient Deep Learning Accelerator for Pervasive Computing

Chao Qian, Tianheng Ling, Gregor Schiele

PDF

TL;DR

ElasticAI introduces a workflow with tools for automatically creating and deploying energy-efficient deep learning accelerators on embedded FPGAs, facilitating deployment in pervasive computing devices.

Contribution

The paper presents ElasticAI-Workflow, a novel system that automates the design and deployment of FPGA-based DL accelerators for embedded devices.

Findings

01

Successful automatic generation of FPGA accelerators

02

Verified performance on embedded FPGA platform

03

Potential for energy-efficient DL deployment in pervasive computing

Abstract

Deploying Deep Learning (DL) on embedded end devices is a scorching trend in pervasive computing. Since most Microcontrollers on embedded devices have limited computing power, it is necessary to add a DL accelerator. Embedded Field Programmable Gate Arrays (FPGAs) are suitable for deploying DL accelerators for embedded devices, but developing an energy-efficient DL accelerator on an FPGA is not easy. Therefore, we propose the ElasticAI-Workflow that aims to help DL developers to create and deploy DL models as hardware accelerators on embedded FPGAs. This workflow consists of two key components: the ElasticAI-Creator and the Elastic Node. The former is a toolchain for automatically generating DL accelerators on FPGAs. The latter is a hardware platform for verifying the performance of the generated accelerators. With this combination, the performance of the accelerator can be sufficiently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.