An Energy-Efficient FPGA-based Deconvolutional Neural Networks   Accelerator for Single Image Super-Resolution

Jung-Woo Chang; Keon-Woo Kang; and Suk-Ju Kang

arXiv:1801.05997·cs.DC·December 19, 2018

An Energy-Efficient FPGA-based Deconvolutional Neural Networks Accelerator for Single Image Super-Resolution

Jung-Woo Chang, Keon-Woo Kang, and Suk-Ju Kang

PDF

TL;DR

This paper introduces an energy-efficient FPGA-based accelerator for real-time single image super-resolution using deconvolutional neural networks, achieving high throughput and low power consumption for UHD applications.

Contribution

It proposes novel optimization methods for deconvolutional CNNs and dataflow, along with quantization techniques, enabling efficient high-resolution image processing on FPGA hardware.

Findings

01

Achieves up to 108x throughput improvement over conventional accelerators.

02

Attains energy efficiency of up to 500.2 GOPS/W at SR scale factor 4.

03

Restores high-quality images with reduced data bit-width and fewer parameters.

Abstract

Convolutional neural networks (CNNs) demonstrate excellent performance in various computer vision applications. In recent years, FPGA-based CNN accelerators have been proposed for optimizing performance and power efficiency. Most accelerators are designed for object detection and recognition algorithms that are performed on low-resolution (LR) images. However, real-time image super-resolution (SR) cannot be implemented on a typical accelerator because of the long execution cycles required to generate high-resolution (HR) images, such as those used in ultra-high-definition (UHD) systems. In this paper, we propose a novel CNN accelerator with efficient parallelization methods for SR applications. First, we propose a new methodology for optimizing the deconvolutional neural networks (DCNNs) used for increasing feature maps. Secondly, we propose a novel method to optimize CNN dataflow so…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.