LPGNet: A Lightweight Network with Parallel Attention and Gated Fusion for Multimodal Emotion Recognition

Zhining He; Yang Xiao

arXiv:2508.08925·eess.AS·August 13, 2025

LPGNet: A Lightweight Network with Parallel Attention and Gated Fusion for Multimodal Emotion Recognition

Zhining He, Yang Xiao

PDF

Open Access

TL;DR

LPGNet is a lightweight, efficient multimodal emotion recognition model that uses parallel attention and gated fusion to improve accuracy and generalization without relying on speaker information.

Contribution

The paper introduces LPGNet, a novel lightweight network with parallel attention and gated fusion, reducing computational cost and dependence on speaker data in emotion recognition.

Findings

01

Achieves over 87% accuracy and F1-score on IEMOCAP

02

Outperforms baseline models with fewer parameters

03

Generalizes better across different speakers

Abstract

Emotion recognition in conversations (ERC) aims to predict the emotional state of each utterance by using multiple input types, such as text and audio. While Transformer-based models have shown strong performance in this task, they often face two major issues: high computational cost and heavy dependence on speaker information. These problems reduce their ability to generalize in real-world conversations. To solve these challenges, we propose LPGNet, a Lightweight network with Parallel attention and Gated fusion for multimodal ERC. The main part of LPGNet is the Lightweight Parallel Interaction Attention (LPIA) module. This module replaces traditional stacked Transformer layers with parallel dot-product attention, which can model both within-modality and between-modality relationships more efficiently. To improve emotional feature learning, LPGNet also uses a dual-gated fusion method.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Speech Recognition and Synthesis