An Energy-Efficient FPGA-based Deconvolutional Neural Networks Accelerator for Single Image Super-Resolution
Jung-Woo Chang, Keon-Woo Kang, and Suk-Ju Kang

TL;DR
This paper introduces an energy-efficient FPGA-based accelerator for real-time single image super-resolution using deconvolutional neural networks, achieving high throughput and low power consumption for UHD applications.
Contribution
It proposes novel optimization methods for deconvolutional CNNs and dataflow, along with quantization techniques, enabling efficient high-resolution image processing on FPGA hardware.
Findings
Achieves up to 108x throughput improvement over conventional accelerators.
Attains energy efficiency of up to 500.2 GOPS/W at SR scale factor 4.
Restores high-quality images with reduced data bit-width and fewer parameters.
Abstract
Convolutional neural networks (CNNs) demonstrate excellent performance in various computer vision applications. In recent years, FPGA-based CNN accelerators have been proposed for optimizing performance and power efficiency. Most accelerators are designed for object detection and recognition algorithms that are performed on low-resolution (LR) images. However, real-time image super-resolution (SR) cannot be implemented on a typical accelerator because of the long execution cycles required to generate high-resolution (HR) images, such as those used in ultra-high-definition (UHD) systems. In this paper, we propose a novel CNN accelerator with efficient parallelization methods for SR applications. First, we propose a new methodology for optimizing the deconvolutional neural networks (DCNNs) used for increasing feature maps. Secondly, we propose a novel method to optimize CNN dataflow so…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
