FFCNN: Fast FPGA based Acceleration for Convolution neural network inference
F. Keddous, H-N. Nguyen, and A. Nakib

TL;DR
This paper introduces FFCNN, an efficient FPGA-based OpenCL accelerator for large-scale CNN inference, utilizing deep pipelining, data reuse, and task mapping to enhance performance and resource efficiency.
Contribution
The paper presents a novel FPGA implementation of CNN inference using OpenCL, with techniques for data reuse and task mapping to improve efficiency and performance.
Findings
Significantly improved performance over existing accelerators.
Efficient resource utilization on FPGA hardware.
Effective data reuse and task mapping techniques.
Abstract
We present a new efficient OpenCL-based Accelerator for large scale Convolutional Neural Networks called Fast Inference on FPGAs for Convolution Neural Network (FFCNN). FFCNN is based on a deeply pipelined OpenCL kernels architecture. As pointed out before, high-level synthesis tools such as the OpenCL framework can easily port codes originally designed for CPUs and GPUs to FPGAs, but it is still difficult to make OpenCL codes run efficiently on FPGAs. This work aims to propose an efficient FPGA implementation of OpenCL High-Performance Computing Applications. To do so, a Data reuse and task mapping techniques are also presented to improve design efficiency. In addition, the following motivations were taken into account when developing FFCNN: 1) FFCNN has been designed to be easily implemented on Intel OpenCL SDK based FPGA design flow. 2) In FFFCN, different techniques have been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · CCD and CMOS Imaging Sensors
MethodsConvolution
