VCGAN: Video Colorization with Hybrid Generative Adversarial Network

Yuzhi Zhao; Lai-Man Po; Wing-Yin Yu; Yasar Abbas Ur Rehman; Mengyang; Liu; Yujia Zhang; Weifeng Ou

arXiv:2104.12357·cs.CV·May 9, 2023

VCGAN: Video Colorization with Hybrid Generative Adversarial Network

Yuzhi Zhao, Lai-Man Po, Wing-Yin Yu, Yasar Abbas Ur Rehman, Mengyang, Liu, Yujia Zhang, Weifeng Ou

PDF

1 Repo

TL;DR

VCGAN is a novel end-to-end hybrid GAN model that improves video colorization by enhancing temporal consistency and integrating colorization and refinement into a single architecture, leading to higher quality results.

Contribution

The paper introduces VCGAN, a hybrid recurrent GAN that unifies colorization and refinement networks, and incorporates global and placeholder feature extractors for better quality and temporal consistency.

Findings

01

VCGAN outperforms existing methods in color quality.

02

VCGAN achieves superior temporal consistency in videos.

03

The dense long-term loss improves far-frame coherence.

Abstract

We propose a hybrid recurrent Video Colorization with Hybrid Generative Adversarial Network (VCGAN), an improved approach to video colorization using end-to-end learning. The VCGAN addresses two prevalent issues in the video colorization domain: Temporal consistency and unification of colorization network and refinement network into a single architecture. To enhance colorization quality and spatiotemporal consistency, the mainstream of generator in VCGAN is assisted by two additional networks, i.e., global feature extractor and placeholder feature extractor, respectively. The global feature extractor encodes the global semantics of grayscale input to enhance colorization quality, whereas the placeholder feature extractor acts as a feedback connection to encode the semantics of the previous colorized frame in order to maintain spatiotemporal consistency. If changing the input for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhaoyuzhi/VCGAN
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsColorization