TL;DR
This paper introduces the concept of direct hardware mapping (DHM) for CNNs on FPGAs, demonstrating its feasibility and providing a tool to automatically generate hardware descriptions, enabling efficient CNN deployment on embedded FPGA devices.
Contribution
It proposes the novel approach of directly mapping CNN graphs onto FPGA resources and presents the HADDOC2 tool for automatic hardware generation.
Findings
Feasibility of direct hardware mapping for CNNs on FPGAs.
Development of the HADDOC2 tool for automatic hardware synthesis.
Potential for improved CNN acceleration on embedded FPGA platforms.
Abstract
Deep Convolutional Neural Networks (CNNs) are the state-of-the-art in image classification. Since CNN feed forward propagation involves highly regular parallel computation, it benefits from a significant speed-up when running on fine grain parallel programmable logic devices. As a consequence, several studies have proposed FPGA-based accelerators for CNNs. However, because of the large computationalpower required by CNNs, none of the previous studies has proposed a direct mapping of the CNN onto the physical resources of an FPGA, allocating each processing actor to its own hardware instance.In this paper, we demonstrate the feasibility of the so called direct hardware mapping (DHM) and discuss several tactics we explore to make DHM usable in practice. As a proof of concept, we introduce the HADDOC2 open source tool, that automatically transforms a CNN description into a synthesizable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
