Federated Dialogue-Semantic Diffusion for Emotion Recognition under Incomplete Modalities

Xihang Qiu; Jiarong Cheng; Yuhao Fang; Wanpeng Zhang; Yao Lu; Ye Zhang; Chun Li

arXiv:2511.00344·cs.CV·November 4, 2025

Federated Dialogue-Semantic Diffusion for Emotion Recognition under Incomplete Modalities

Xihang Qiu, Jiarong Cheng, Yuhao Fang, Wanpeng Zhang, Yao Lu, Ye Zhang, Chun Li

PDF

Open Access

TL;DR

This paper introduces FedDISC, a federated learning framework for multimodal emotion recognition that effectively handles missing modalities by integrating diffusion models, dialogue context, and semantic consistency, outperforming existing methods.

Contribution

The paper presents a novel federated diffusion-based framework for missing-modality recovery in emotion recognition, incorporating dialogue context and semantic alignment, with a new alternating training strategy.

Findings

01

FedDISC outperforms existing methods on IEMOCAP, CMUMOSI, and CMUMOSEI datasets.

02

The framework effectively handles diverse missing modality patterns.

03

Semantic consistency is maintained through dialogue-aware diffusion models.

Abstract

Multimodal Emotion Recognition in Conversations (MERC) enhances emotional understanding through the fusion of multimodal signals. However, unpredictable modality absence in real-world scenarios significantly degrades the performance of existing methods. Conventional missing-modality recovery approaches, which depend on training with complete multimodal data, often suffer from semantic distortion under extreme data distributions, such as fixed-modality absence. To address this, we propose the Federated Dialogue-guided and Semantic-Consistent Diffusion (FedDISC) framework, pioneering the integration of federated learning into missing-modality recovery. By federated aggregation of modality-specific diffusion models trained on clients and broadcasting them to clients missing corresponding modalities, FedDISC overcomes single-client reliance on modality completeness. Additionally, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Speech and dialogue systems