Improving Automatic Emotion Recognition from speech using Rhythm and   Temporal feature

Mayank Bhargava; Tim Polzehl

arXiv:1303.1761·cs.CV·March 8, 2013·30 cites

Improving Automatic Emotion Recognition from speech using Rhythm and Temporal feature

Mayank Bhargava, Tim Polzehl

PDF

Open Access

TL;DR

This paper enhances automatic emotion recognition from speech by integrating rhythm and temporal features, achieving improved accuracy over traditional methods by leveraging linguistic insights and acoustic features.

Contribution

It introduces the use of rhythm and temporal features from linguistic analysis into speech emotion recognition, combined with segmentation and feature selection techniques.

Findings

01

Achieved 80.60% recognition rate on Berlin Emotion Database

02

Demonstrated the effectiveness of rhythm and temporal features in emotion recognition

03

Improved accuracy over traditional feature-based approaches

Abstract

This paper is devoted to improve automatic emotion recognition from speech by incorporating rhythm and temporal features. Research on automatic emotion recognition so far has mostly been based on applying features like MFCCs, pitch and energy or intensity. The idea focuses on borrowing rhythm features from linguistic and phonetic analysis and applying them to the speech signal on the basis of acoustic knowledge only. In addition to this we exploit a set of temporal and loudness features. A segmentation unit is employed in starting to separate the voiced/unvoiced and silence parts and features are explored on different segments. Thereafter different classifiers are used for classification. After selecting the top features using an IGR filter we are able to achieve a recognition rate of 80.60 % on the Berlin Emotion Database for the speaker dependent framework.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Speech and Audio Processing · Speech Recognition and Synthesis