Generative Models at the Frontier of Compression: A Survey on Generative Face Video Coding

Bolin Chen; Shanzhi Yin; Goluck Konuko; Giuseppe Valenzise; Zihan Zhang; Shiqi Wang; Yan Ye

arXiv:2506.07369·cs.CV·June 10, 2025

Generative Models at the Frontier of Compression: A Survey on Generative Face Video Coding

Bolin Chen, Shanzhi Yin, Goluck Konuko, Giuseppe Valenzise, Zihan Zhang, Shiqi Wang, Yan Ye

PDF

Open Access

TL;DR

This survey reviews the emerging field of Generative Face Video Coding (GFVC), highlighting its potential to revolutionize face video compression through deep generative models, and discusses standardization, benchmarking, and future directions.

Contribution

This paper provides the first comprehensive survey of GFVC, including a review of methods, benchmarking analysis, a new face video database, and insights into standardization and future industrial applications.

Findings

01

GFVC achieves ultra-low bitrate high-fidelity face video reconstruction.

02

Benchmarking reveals key differences among GFVC methods.

03

A large-scale GFVC face video database was constructed for evaluation.

Abstract

The rise of deep generative models has greatly advanced video compression, reshaping the paradigm of face video coding through their powerful capability for semantic-aware representation and lifelike synthesis. Generative Face Video Coding (GFVC) stands at the forefront of this revolution, which could characterize complex facial dynamics into compact latent codes for bitstream compactness at the encoder side and leverages powerful deep generative models to reconstruct high-fidelity face signal from the compressed latent codes at the decoder side. As such, this well-designed GFVC paradigm could enable high-fidelity face video communication at ultra-low bitrate ranges, far surpassing the capabilities of the latest Versatile Video Coding (VVC) standard. To pioneer foundational research and accelerate the evolution of GFVC, this paper presents the first comprehensive survey of GFVC…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Advanced Data Compression Techniques