A Simple Attention-Based Mechanism for Bimodal Emotion Classification
Mazen Elabd, Sardar Jaf

TL;DR
This paper introduces a novel bimodal deep learning architecture with attention mechanisms that combines text and speech data to improve emotion classification accuracy, outperforming existing systems.
Contribution
The paper presents a new attention-based bimodal deep learning architecture trained on text and speech data for emotion classification, demonstrating superior performance.
Findings
Bimodal architectures outperform unimodal ones in emotion classification.
Attention mechanisms enhance model performance.
Proposed architecture surpasses several state-of-the-art systems.
Abstract
Big data contain rich information for machine learning algorithms to utilize when learning important features during classification tasks. Human beings express their emotion using certain words, speech (tone, pitch, speed) or facial expression. Artificial Intelligence approach to emotion classification are largely based on learning from textual information. However, public datasets containing text and speech data provide sufficient resources to train machine learning algorithms for the tack of emotion classification. In this paper, we present novel bimodal deep learning-based architectures enhanced with attention mechanism trained and tested on text and speech data for emotion classification. We report details of different deep learning based architectures and show the performance of each architecture including rigorous error analyses. Our finding suggests that deep learning based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition
MethodsSoftmax · Attention Is All You Need
