QuickSRNet: Plain Single-Image Super-Resolution Architecture for Faster   Inference on Mobile Platforms

Guillaume Berger; Manik Dhingra; Antoine Mercier; Yashesh; Savani; Sunny Panchal; Fatih Porikli

arXiv:2303.04336·eess.IV·May 16, 2023·1 cites

QuickSRNet: Plain Single-Image Super-Resolution Architecture for Faster Inference on Mobile Platforms

Guillaume Berger, Manik Dhingra, Antoine Mercier, Yashesh, Savani, Sunny Panchal, Fatih Porikli

PDF

Open Access 3 Models

TL;DR

QuickSRNet is a lightweight super-resolution model designed for real-time 1080p upscaling on mobile devices, achieving high accuracy with minimal latency suitable for gaming and video applications.

Contribution

The paper introduces QuickSRNet, a novel architecture that improves accuracy-to-latency trade-offs for mobile super-resolution and includes training techniques for robustness and speed.

Findings

01

Produces 1080p outputs via 2x upscaling in 2.2 ms on a smartphone

02

Outperforms existing architectures in accuracy-to-latency trade-offs

03

Maintains robustness to quantization

Abstract

In this work, we present QuickSRNet, an efficient super-resolution architecture for real-time applications on mobile platforms. Super-resolution clarifies, sharpens, and upscales an image to higher resolution. Applications such as gaming and video playback along with the ever-improving display capabilities of TVs, smartphones, and VR headsets are driving the need for efficient upscaling solutions. While existing deep learning-based super-resolution approaches achieve impressive results in terms of visual quality, enabling real-time DL-based super-resolution on mobile devices with compute, thermal, and power constraints is challenging. To address these challenges, we propose QuickSRNet, a simple yet effective architecture that provides better accuracy-to-latency trade-offs than existing neural architectures for single-image super resolution. We present training tricks to speed up…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Image Processing Techniques and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings