ONNX-to-Hardware Design Flow for the Generation of Adaptive   Neural-Network Accelerators on FPGAs

Federico Manca; Francesco Ratto

arXiv:2309.13321·cs.AR·September 26, 2023

ONNX-to-Hardware Design Flow for the Generation of Adaptive Neural-Network Accelerators on FPGAs

Federico Manca, Francesco Ratto

PDF

Open Access

TL;DR

This paper explores a new FPGA-based design flow for adaptive neural network accelerators, combining existing toolchains with approximate computing to enable lightweight, flexible, and energy-efficient NN inference at the edge.

Contribution

It introduces a novel approach that integrates adaptive toolchains with approximate computing for FPGA-based neural network acceleration.

Findings

01

Reviewed existing frameworks with streaming architecture

02

Proposed a combined approach for adaptivity and efficiency

03

Lays groundwork for lightweight edge NN inference

Abstract

Neural Networks (NN) provide a solid and reliable way of executing different types of applications, ranging from speech recognition to medical diagnosis, speeding up onerous and long workloads. The challenges involved in their implementation at the edge include providing diversity, flexibility, and sustainability. That implies, for instance, supporting evolving applications and algorithms energy-efficiently. Using hardware or software accelerators can deliver fast and efficient computation of the \acp{nn}, while flexibility can be exploited to support long-term adaptivity. Nonetheless, handcrafting an NN for a specific device, despite the possibility of leading to an optimal solution, takes time and experience, and that's why frameworks for hardware accelerators are being developed. This work-in-progress study focuses on exploring the possibility of combining the toolchain proposed by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Neural Networks and Applications