Efficient Super Resolution Using Binarized Neural Network
Yinglan Ma, Hongyu Xiong, Zhe Hu, Lizhuang Ma

TL;DR
This paper introduces a binarized neural network approach for single-image super-resolution that maintains high accuracy while significantly reducing model size and inference time, enabling more efficient deployment on mobile devices.
Contribution
The paper proposes a novel binarization method for residual blocks in super-resolution networks, achieving comparable accuracy with much smaller models and faster inference.
Findings
Binarized SR networks achieve similar quality to real-weight models.
Model size reduced by 80% with binarization.
Inference speed potentially increased by 5 times.
Abstract
Deep convolutional neural networks (DCNNs) have recently demonstrated high-quality results in single-image super-resolution (SR). DCNNs often suffer from over-parametrization and large amounts of redundancy, which results in inefficient inference and high memory usage, preventing massive applications on mobile devices. As a way to significantly reduce model size and computation time, binarized neural network has only been shown to excel on semantic-level tasks such as image classification and recognition. However, little effort of network quantization has been spent on image enhancement tasks like SR, as network quantization is usually assumed to sacrifice pixel-level accuracy. In this work, we explore an network-binarization approach for SR tasks without sacrificing much reconstruction accuracy. To achieve this, we binarize the convolutional filters in only residual blocks, and adopt a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Photoacoustic and Ultrasonic Imaging · Advanced Optical Sensing Technologies
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
