Coherent Online Video Style Transfer

Dongdong Chen; Jing Liao; Lu Yuan; Nenghai Yu; Gang Hua

arXiv:1703.09211·cs.CV·March 29, 2017·39 cites

Coherent Online Video Style Transfer

Dongdong Chen, Jing Liao, Lu Yuan, Nenghai Yu, Gang Hua

PDF

Open Access

TL;DR

This paper introduces an end-to-end neural network for online video style transfer that produces temporally coherent stylized videos in near real-time, significantly improving over frame-by-frame methods.

Contribution

It presents the first efficient, end-to-end network for online video style transfer that maintains temporal coherence over long sequences.

Findings

01

Outperforms per-frame baseline qualitatively and quantitatively

02

Achieves comparable coherence to optimization-based methods

03

Runs three orders of magnitude faster than existing approaches

Abstract

Training a feed-forward network for fast neural style transfer of images is proven to be successful. However, the naive extension to process video frame by frame is prone to producing flickering results. We propose the first end-to-end network for online video style transfer, which generates temporally coherent stylized video sequences in near real-time. Two key ideas include an efficient network by incorporating short-term coherence, and propagating short-term coherence to long-term, which ensures the consistency over larger period of time. Our network can incorporate different image stylization networks. We show that the proposed method clearly outperforms the per-frame baseline both qualitatively and quantitatively. Moreover, it can achieve visually comparable coherence to optimization-based video style transfer, but is three orders of magnitudes faster in runtime.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Advanced Vision and Imaging