A Novel FPGA-based CNN Hardware Accelerator: Optimization for   Convolutional Layers using Karatsuba Ofman Multiplier

Amit Sarkar

arXiv:2412.20393·cs.AR·December 31, 2024

A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier

Amit Sarkar

PDF

Open Access

TL;DR

This paper introduces an FPGA-based CNN hardware accelerator optimized for convolutional layers using the Karatsuba-Ofman multiplier, enhancing speed and resource efficiency in deep learning applications.

Contribution

It presents a novel FPGA architecture for CNN acceleration that integrates the Karatsuba-Ofman multiplier to improve convolution efficiency.

Findings

01

Enhanced multiplication speed with less hardware resources

02

Effective implementation on FPGA for AlexNet, VGG16, VGG19

03

Potential for improved CNN processing performance

Abstract

A new architecture of CNN hardware accelerator is presented. Convolutional Neural Networks (CNNs) are a subclass of neural networks that have demonstrated outstanding performance in a variety of computer vision applications, including object detection, image classification, and many more.Convolution, a mathematical operation that consists of multiplying, shifting and adding a set of input values by a set of learnable parameters known as filters or kernels, which is the fundamental component of a CNN.The Karatsuba Ofman multiplier is known for its ability to perform high-speed multiplication with less hardware resources compared to traditional multipliers. This article examines the usage of the Karatsuba Ofman Multiplier method on FPGA in the prominent CNN designs AlexNet, VGG16, and VGG19.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum-Dot Cellular Automata