# A Holistic Approach for Optimizing DSP Block Utilization of a CNN   implementation on FPGA

**Authors:** Kamel Abdelouahab, Cedric Bourrasset, Maxime Pelcat, Fran\c{c}ois, Berry, Jean-Charles Quinton, Jocelyn Serot

arXiv: 1703.09779 · 2018-05-29

## TL;DR

This paper introduces a holistic method combining approximate computing and design space exploration to optimize DSP block utilization in FPGA-based CNN implementations, achieving high efficiency and accuracy.

## Contribution

It presents a novel approach for optimizing FPGA resource usage in CNNs by exploring data representation and topology variations for improved efficiency.

## Key findings

- Achieved 883 classifications/sec at 256x256 resolution.
- Utilized only 8% of available DSP blocks.
- Generated dataflow architectures for 76 CNN topologies.

## Abstract

Deep Neural Networks are becoming the de-facto standard models for image understanding, and more generally for computer vision tasks. As they involve highly parallelizable computations, CNN are well suited to current fine grain programmable logic devices. Thus, multiple CNN accelerators have been successfully implemented on FPGAs. Unfortunately, FPGA resources such as logic elements or DSP units remain limited. This work presents a holistic method relying on approximate computing and design space exploration to optimize the DSP block utilization of a CNN implementation on an FPGA. This method was tested when implementing a reconfigurable OCR convolutional neural network on an Altera Stratix V device and varying both data representation and CNN topology in order to find the best combination in terms of DSP block utilization and classification accuracy. This exploration generated dataflow architectures of 76 CNN topologies with 5 different fixed point representation. Most efficient implementation performs 883 classifications/sec at 256 x 256 resolution using 8% of the available DSP blocks.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1703.09779/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/1703.09779/full.md

## References

21 references — full list in the complete paper: https://tomesphere.com/paper/1703.09779/full.md

---
Source: https://tomesphere.com/paper/1703.09779