Video compression with low complexity CNN-based spatial resolution   adaptation

Di Ma; Fan Zhang; David R. Bull

arXiv:2007.14726·eess.IV·January 21, 2021

Video compression with low complexity CNN-based spatial resolution adaptation

Di Ma, Fan Zhang, David R. Bull

PDF

TL;DR

This paper introduces a low-complexity CNN-based spatial resolution adaptation framework for video compression, achieving over 10% bitrate savings and reducing computational load at encoder and decoder.

Contribution

It proposes a novel approach combining CNN-based down-sampling with Lanczos3 up-sampling, balancing complexity and performance in video compression.

Findings

01

Over 10% bitrate savings compared to HEVC HM

02

29% reduction in encoder complexity

03

10% reduction in decoder complexity

Abstract

It has recently been demonstrated that spatial resolution adaptation can be integrated within video compression to improve overall coding performance by spatially down-sampling before encoding and super-resolving at the decoder. Significant improvements have been reported when convolutional neural networks (CNNs) were used to perform the resolution up-sampling. However, this approach suffers from high complexity at the decoder due to the employment of CNN-based super-resolution. In this paper, a novel framework is proposed which supports the flexible allocation of complexity between the encoder and decoder. This approach employs a CNN model for video down-sampling at the encoder and uses a Lanczos3 filter to reconstruct full resolution at the decoder. The proposed method was integrated into the HEVC HM 16.20 software and evaluated on JVET UHD test sequences using the All Intra…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.