Gated Fusion Network for SAO Filter and Inter Frame Prediction in   Versatile Video Coding

Shiba Kuanar; Dwarikanath Mahapatra; Vassilis Athitsos; K.R Rao

arXiv:2105.12229·eess.IV·May 27, 2021·Signal Process. Image Commun.

Gated Fusion Network for SAO Filter and Inter Frame Prediction in Versatile Video Coding

Shiba Kuanar, Dwarikanath Mahapatra, Vassilis Athitsos, K.R Rao

PDF

Open Access

TL;DR

This paper introduces a deep learning-based filter for VVC that enhances video quality and reduces bit rate by effectively suppressing artifacts without additional signaling overhead.

Contribution

A novel multi-scale CNN model with variable filter size and deconvolution for intra-inter frame quality enhancement in VVC.

Findings

01

Outperforms baseline VVC in BD-BR and BD-PSNR metrics.

02

Achieves 3.762% average bit rate savings.

03

Effectively suppresses visual artifacts like ringing and blocking.

Abstract

To achieve higher coding efficiency, Versatile Video Coding (VVC) includes several novel components, but at the expense of increasing decoder computational complexity. These technologies at a low bit rate often create contouring and ringing effects on the reconstructed frames and introduce various blocking artifacts at block boundaries. To suppress those visual artifacts, the VVC framework supports four post-processing filter operations. The interoperation of these filters introduces extra signaling bits and eventually becomes overhead at higher resolution video processing. In this paper, a novel deep learning-based model is proposed for sample adaptive offset (SAO) nonlinear filtering operation and substantiated the merits of intra-inter frame quality enhancement. We introduced a variable filter size multi-scale CNN (MSCNN) to improve the denoising operation and incorporated strided…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Digital Media Forensic Detection