ACNPU: A 4.75TOPS/W 1080P@30FPS Super Resolution Accelerator with Decoupled Asymmetric Convolution
Tun-Hao Yang, and Tian-Sheuan Chang

TL;DR
ACNPU is an energy-efficient super-resolution accelerator that improves image quality with less complexity, leveraging decoupled asymmetric convolution and innovative hardware design to achieve high performance on resource-limited devices.
Contribution
This paper introduces ACNPU, a novel super-resolution accelerator with a decoupled asymmetric convolution and split-bypass structure, reducing complexity while maintaining high image quality.
Findings
Enhances image quality by 0.34dB with a 27-layer model.
Achieves 4.75 TOPS/W energy efficiency at 1080p@30FPS.
Reduces external DRAM access through holistic model fusion.
Abstract
Deep learning-driven superresolution (SR) outperforms traditional techniques but also faces the challenge of high complexity and memory bandwidth. This challenge leads many accelerators to opt for simpler and shallow models like FSRCNN, compromising performance for real-time needs, especially for resource-limited edge devices. This paper proposes an energy-efficient SR accelerator, ACNPU, to tackle this challenge. The ACNPU enhances image quality by 0.34dB with a 27-layer model, but needs 36\% less complexity than FSRCNN, while maintaining a similar model size, with the \textit{decoupled asymmetric convolution and split-bypass structure}. The hardware-friendly 17K-parameter model enables \textit{holistic model fusion} instead of localized layer fusion to remove external DRAM access of intermediate feature maps. The on-chip memory bandwidth is further reduced with the \textit{input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Fluorescence Microscopy Techniques
MethodsOPT · Convolution
