DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative   Adversarial Network

Xiangtian Li; Xiaobo Wang; Zhen Qi; Han Cao; Zhaoyang Zhang; Ao Xiang

arXiv:2412.16948·cs.CV·December 24, 2024·2 cites

DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative Adversarial Network

Xiangtian Li, Xiaobo Wang, Zhen Qi, Han Cao, Zhaoyang Zhang, Ao Xiang

PDF

Open Access

TL;DR

DTSGAN is a novel spatiotemporal GAN that learns from a single dynamic texture to generate diverse, high-quality video sequences with natural motion, advancing dynamic texture synthesis.

Contribution

Introduces DTSGAN, a new method for learning dynamic textures from a single example using a multi-scale pipeline and a data update strategy to enhance diversity.

Findings

01

Generates high-quality dynamic textures with natural motion

02

Outperforms existing methods in diversity and realism

03

Effective in learning from a single sample

Abstract

Dynamic texture synthesis aims to generate sequences that are visually similar to a reference video texture and exhibit specific stationary properties in time. In this paper, we introduce a spatiotemporal generative adversarial network (DTSGAN) that can learn from a single dynamic texture by capturing its motion and content distribution. With the pipeline of DTSGAN, a new video sequence is generated from the coarsest scale to the finest one. To avoid mode collapse, we propose a novel strategy for data updates that helps improve the diversity of generated results. Qualitative and quantitative experiments show that our model is able to generate high quality dynamic textures and natural motion.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing and 3D Reconstruction · Generative Adversarial Networks and Image Synthesis · Image Retrieval and Classification Techniques