Generative Adversarial Networks in Human Emotion Synthesis:A Review
Noushin Hajarolasvadi, Miguel Arjona Ram\'irez, Hasan Demirel

TL;DR
This review paper surveys recent advances in using generative adversarial networks for synthesizing human emotions through facial expressions, speech, and cross-modal approaches, highlighting datasets, methods, and future challenges.
Contribution
It provides a comprehensive overview of GAN-based human emotion synthesis, analyzing databases, techniques, and open problems across audio and video modalities.
Findings
GANs effectively synthesize facial expressions and speech emotions.
Cross-modal emotion synthesis shows promising results.
Open research challenges include data quality and model generalization.
Abstract
Synthesizing realistic data samples is of great value for both academic and industrial communities. Deep generative models have become an emerging topic in various research areas like computer vision and signal processing. Affective computing, a topic of a broad interest in computer vision society, has been no exception and has benefited from generative models. In fact, affective computing observed a rapid derivation of generative models during the last two decades. Applications of such models include but are not limited to emotion recognition and classification, unimodal emotion synthesis, and cross-modal emotion synthesis. As a result, we conducted a review of recent advances in human emotion synthesis by studying available databases, advantages, and disadvantages of the generative models along with the related training strategies considering two principal human communication…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
