An FPGA-based Solution for Convolution Operation Acceleration

Trung Dinh Pham; Bao Gia Bach; Lam Trinh Luu; Minh Dinh Nguyen; Hai; Duc Pham; Khoa Bui Anh; Xuan Quang Nguyen; Cuong Pham Quoc

arXiv:2206.04520·cs.AR·February 28, 2023

An FPGA-based Solution for Convolution Operation Acceleration

Trung Dinh Pham, Bao Gia Bach, Lam Trinh Luu, Minh Dinh Nguyen, Hai, Duc Pham, Khoa Bui Anh, Xuan Quang Nguyen, Cuong Pham Quoc

PDF

Open Access 1 Repo

TL;DR

This paper presents an FPGA-based architecture designed to accelerate convolution operations in neural networks, aiming for efficient edge-AI deployment with promising performance metrics.

Contribution

It introduces a novel FPGA IP core architecture for convolution acceleration, compatible across FPGA families, optimized for edge-AI applications.

Findings

01

Single core achieves 0.224 GOPS

02

Full utilization yields 4.48 GOPS

03

Demonstrates feasibility for edge-AI hardware acceleration

Abstract

Hardware-based acceleration is an extensive attempt to facilitate many computationally-intensive mathematics operations. This paper proposes an FPGA-based architecture to accelerate the convolution operation - a complex and expensive computing step that appears in many Convolutional Neural Network models. We target the design to the standard convolution operation, intending to launch the product as an edge-AI solution. The project's purpose is to produce an FPGA IP core that can process a convolutional layer at a time. System developers can deploy the IP core with various FPGA families by using Verilog HDL as the primary design language for the architecture. The experimental results show that our single computing core synthesized on a simple edge computing FPGA board can offer 0.224 GOPS. When the board is fully utilized, 4.48 GOPS can be achieved.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trung-pham-dinh/CNN-on-FPGA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Neural Networks and Applications

MethodsConvolution