Fine-tuning Wav2vec for Vocal-burst Emotion Recognition

Dang-Khanh Nguyen; Sudarshan Pant; Ngoc-Huynh Ho; Guee-Sang Lee,; Soo-Huyng Kim; Hyung-Jeong Yang

arXiv:2210.00263·eess.AS·October 4, 2022

Fine-tuning Wav2vec for Vocal-burst Emotion Recognition

Dang-Khanh Nguyen, Sudarshan Pant, Ngoc-Huynh Ho, Guee-Sang Lee,, Soo-Huyng Kim, Hyung-Jeong Yang

PDF

Open Access

TL;DR

This paper explores fine-tuning Wav2vec for recognizing emotions from non-verbal vocal bursts like laughs and cries, demonstrating promising results in a new affective computing challenge.

Contribution

It introduces a fine-tuning approach of Wav2vec for vocal-burst emotion recognition, a novel application in affective computing.

Findings

01

Achieved promising results compared to baseline models

02

Demonstrated effectiveness of fine-tuned Wav2vec for emotion recognition

03

Contributed to the new A-VB competition tasks

Abstract

The ACII Affective Vocal Bursts (A-VB) competition introduces a new topic in affective computing, which is understanding emotional expression using the non-verbal sound of humans. We are familiar with emotion recognition via verbal vocal or facial expression. However, the vocal bursts such as laughs, cries, and signs, are not exploited even though they are very informative for behavior analysis. The A-VB competition comprises four tasks that explore non-verbal information in different spaces. This technical report describes the method and the result of SclabCNU Team for the tasks of the challenge. We achieved promising results compared to the baseline model provided by the organizers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition