# Deep Frame Prediction for Video Coding

**Authors:** Hyomin Choi, Ivan V. Bajic

arXiv: 1901.00062 · 2019-06-24

## TL;DR

This paper introduces a deep neural network-based frame prediction method for video coding that improves efficiency without requiring motion information, achieving notable bitrate reductions compared to HEVC.

## Contribution

It presents a novel DNN approach for frame prediction in video coding that eliminates the need for motion transfer, enabling both uni- and bi-directional predictions with improved compression.

## Key findings

- Joint DNN reduces luminance bitrate by up to 4.4%.
- Separate DNNs provide additional 0.3%-0.5% bit savings.
- Method outperforms conventional motion-compensated prediction in HEVC.

## Abstract

We propose a novel frame prediction method using a deep neural network (DNN), with the goal of improving video coding efficiency. The proposed DNN makes use of decoded frames, at both encoder and decoder, to predict textures of the current coding block. Unlike conventional inter-prediction, the proposed method does not require any motion information to be transferred between the encoder and the decoder. Still, both uni-directional and bi-directional prediction are possible using the proposed DNN, which is enabled by the use of the temporal index channel, in addition to color channels. In this study, we developed a jointly trained DNN for both uni- and bi- directional prediction, as well as separate networks for uni- and bi-directional prediction, and compared the efficacy of both approaches. The proposed DNNs were compared with the conventional motion-compensated prediction in the latest video coding standard, HEVC, in terms of BD-Bitrate. The experiments show that the proposed joint DNN (for both uni- and bi-directional prediction) reduces the luminance bitrate by about 4.4%, 2.4%, and 2.3% in the Low delay P, Low delay, and Random access configurations, respectively. In addition, using the separately trained DNNs brings further bit savings of about 0.3%-0.5%.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.00062/full.md

## Figures

40 figures with captions in the complete paper: https://tomesphere.com/paper/1901.00062/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/1901.00062/full.md

---
Source: https://tomesphere.com/paper/1901.00062