A Siamese Neural Network with Modified Distance Loss For Transfer Learning in Speech Emotion Recognition
Kexin Feng, Theodora Chaspari

TL;DR
This paper introduces a modified distance loss for Siamese neural networks to improve transfer learning in speech emotion recognition, effectively leveraging source data and pairwise sample differences to enhance model fine-tuning.
Contribution
It proposes a novel distance loss function for Siamese networks and demonstrates its effectiveness in transfer learning for speech emotion recognition.
Findings
Distance loss improves Siamese network fine-tuning.
Source data selection impacts performance more than layer freezing.
Siamese networks show promise in transfer learning for emotion recognition.
Abstract
Automatic emotion recognition plays a significant role in the process of human computer interaction and the design of Internet of Things (IOT) technologies. Yet, a common problem in emotion recognition systems lies in the scarcity of reliable labels. By modeling pairwise differences between samples of interest, a Siamese network can help to mitigate this challenge since it requires fewer samples than traditional deep learning methods. In this paper, we propose a distance loss, which can be applied on the Siamese network fine-tuning, by optimizing the model based on the relevant distance between same and difference class pairs. Our system use samples from the source data to pre-train the weights of proposed Siamese neural network, which are fine-tuned based on the target data. We present an emotion recognition task that uses speech, since it is one of the most ubiquitous and frequently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Speech and Audio Processing · Music and Audio Processing
MethodsSiamese Network
