Pose-guided Generative Adversarial Net for Novel View Action Synthesis

Xianhang Li; Junhao Zhang; Kunchang Li; Shruti Vyas; Yogesh S Rawat

arXiv:2110.07993·cs.CV·December 10, 2021

Pose-guided Generative Adversarial Net for Novel View Action Synthesis

Xianhang Li, Junhao Zhang, Kunchang Li, Shruti Vyas, Yogesh S Rawat

PDF

Open Access

TL;DR

This paper introduces PAS-GAN, a novel pose-guided framework for synthesizing realistic, temporally coherent videos of human actions from unseen viewpoints, leveraging pose transformation and a multi-scale loss.

Contribution

The paper proposes a new pose-guided GAN framework with a recurrent pose transformation and action-separable loss for improved novel view human action synthesis.

Findings

01

Outperforms existing methods on NTU-RGBD and PKU-MMD datasets.

02

Effectively separates action and background in generated videos.

03

Achieves high temporal coherence and realism in synthesized videos.

Abstract

We focus on the problem of novel-view human action synthesis. Given an action video, the goal is to generate the same action from an unseen viewpoint. Naturally, novel view video synthesis is more challenging than image synthesis. It requires the synthesis of a sequence of realistic frames with temporal coherency. Besides, transferring the different actions to a novel target view requires awareness of action category and viewpoint change simultaneously. To address these challenges, we propose a novel framework named Pose-guided Action Separable Generative Adversarial Net (PAS-GAN), which utilizes pose to alleviate the difficulty of this task. First, we propose a recurrent pose-transformation module which transforms actions from the source view to the target view and generates novel view pose sequence in 2D coordinate space. Second, a well-transformed pose sequence enables us to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis