Improving Deep Video Compression by Resolution-adaptive Flow Coding

Zhihao Hu (1); Zhenghao Chen (2); Dong Xu (2); Guo Lu (3); Wanli; Ouyang (2); Shuhang Gu (2) ((1) College of Software; Beihang University,; China; (2) School of Electrical; Information Engineering; The University; of Sydney; Australia; (3) School of Computer Science & Technology; Beijing; Institute of Technology; China)

arXiv:2009.05982·cs.CV·September 15, 2020·5 cites

Improving Deep Video Compression by Resolution-adaptive Flow Coding

Zhihao Hu (1), Zhenghao Chen (2), Dong Xu (2), Guo Lu (3), Wanli, Ouyang (2), Shuhang Gu (2) ((1) College of Software, Beihang University,, China, (2) School of Electrical, Information Engineering, The University, of Sydney, Australia, (3) School of Computer Science & Technology

PDF

Open Access

TL;DR

This paper introduces Resolution-adaptive Flow Coding (RaFC), a novel framework that enhances deep video compression by adaptively selecting optimal resolutions for flow map encoding at both frame and block levels, improving efficiency.

Contribution

The paper proposes a new multi-resolution approach for flow map compression in deep video codecs, with adaptive resolution selection at frame and block levels based on rate-distortion criteria.

Findings

01

RaFC significantly improves compression efficiency on benchmark datasets.

02

Adaptive resolution selection outperforms fixed-resolution methods.

03

The framework achieves better rate-distortion performance across multiple datasets.

Abstract

In the learning based video compression approaches, it is an essential issue to compress pixel-level optical flow maps by developing new motion vector (MV) encoders. In this work, we propose a new framework called Resolution-adaptive Flow Coding (RaFC) to effectively compress the flow maps globally and locally, in which we use multi-resolution representations instead of single-resolution representations for both the input flow maps and the output motion features of the MV encoder. To handle complex or simple motion patterns globally, our frame-level scheme RaFC-frame automatically decides the optimal flow map resolution for each video frame. To cope different types of motion patterns locally, our block-level scheme called RaFC-block can also select the optimal resolution for each local block of motion features. In addition, the rate-distortion criterion is applied to both RaFC-frame and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Video Coding and Compression Technologies