Fixed-Point Convolutional Neural Network for Real-Time Video Processing   in FPGA

Roman Solovyev; Alexander Kustov; Dmitry Telpukhov; Vladimir Rukhlov,; Alexandr Kalinin

arXiv:1808.09945·cs.CV·December 4, 2020

Fixed-Point Convolutional Neural Network for Real-Time Video Processing in FPGA

Roman Solovyev, Alexander Kustov, Dmitry Telpukhov, Vladimir Rukhlov,, Alexandr Kalinin

PDF

2 Repos

TL;DR

This paper introduces a fixed-point convolutional neural network architecture optimized for real-time video processing on FPGAs, emphasizing speed, hardware efficiency, and adaptability for practical camera-based image recognition tasks.

Contribution

The paper presents a novel neural network design with hardware-level optimizations and fixed-point arithmetic, enabling efficient real-time video processing on low-cost FPGAs.

Findings

01

Achieves real-time video processing on inexpensive FPGAs

02

Reduces network complexity with fixed-point arithmetic

03

Demonstrates effective adaptation to new datasets

Abstract

Modern mobile neural networks with a reduced number of weights and parameters do a good job with image classification tasks, but even they may be too complex to be implemented in an FPGA for video processing tasks. The article proposes neural network architecture for the practical task of recognizing images from a camera, which has several advantages in terms of speed. This is achieved by reducing the number of weights, moving from a floating-point to a fixed-point arithmetic, and due to a number of hardware-level optimizations associated with storing weights in blocks, a shift register, and an adjustable number of convolutional blocks that work in parallel. The article also proposed methods for adapting the existing data set for solving a different task. As the experiments showed, the proposed neural network copes well with real-time video processing even on the cheap FPGAs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.