The ReprGesture entry to the GENEA Challenge 2022

Sicheng Yang; Zhiyong Wu; Minglei Li; Mengchen Zhao; Jiuxin Lin,; Liyang Chen; Weihong Bao

arXiv:2208.12133·cs.HC·August 26, 2022

The ReprGesture entry to the GENEA Challenge 2022

Sicheng Yang, Zhiyong Wu, Minglei Li, Mengchen Zhao, Jiuxin Lin,, Liyang Chen, Weihong Bao

PDF

1 Repo

TL;DR

This paper presents ReprGesture, a multimodal gesture generation system for embodied agents, utilizing adversarial training and diverse feature representations to produce contextually appropriate non-verbal behaviors evaluated in the GENEA challenge.

Contribution

The paper introduces a novel multimodal representation learning approach with adversarial training for gesture generation, advancing the state-of-the-art in non-verbal behavior synthesis.

Findings

01

Effective use of WavLM and FastText features for gesture generation.

02

Adversarial training improves modality-invariant feature learning.

03

System performs well in GENEA challenge evaluations.

Abstract

This paper describes the ReprGesture entry to the Generation and Evaluation of Non-verbal Behaviour for Embodied Agents (GENEA) challenge 2022. The GENEA challenge provides the processed datasets and performs crowdsourced evaluations to compare the performance of different gesture generation systems. In this paper, we explore an automatic gesture generation system based on multimodal representation learning. We use WavLM features for audio, FastText features for text and position and rotation matrix features for gesture. Each modality is projected to two distinct subspaces: modality-invariant and modality-specific. To learn inter-modality-invariant commonalities and capture the characters of modality-specific representations, gradient reversal layer based adversarial classifier and modality reconstruction decoders are used during training. The gesture decoder generates proper gestures…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YoungSeng/ReprGesture
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsfastText