Emotion-Driven Personalized Recommendation for AI-Generated Content Using Multi-Modal Sentiment and Intent Analysis
Zheqi Hu, Xuanjing Chen, Jinlin Hu

TL;DR
This paper introduces a multi-modal emotion and intent recognition model that enhances AI-generated content recommendations by incorporating real-time emotional and intentional user states, leading to improved engagement and satisfaction.
Contribution
It presents a novel cross-modal transformer framework integrating visual, auditory, and textual data for emotion and intent recognition in personalized AIGC recommendations.
Findings
4.3% improvement in F1-score over baselines
12.3% reduction in cross-entropy loss
Increased user engagement and satisfaction
Abstract
With the rapid growth of AI-generated content (AIGC) across domains such as music, video, and literature, the demand for emotionally aware recommendation systems has become increasingly important. Traditional recommender systems primarily rely on user behavioral data such as clicks, views, or ratings, while neglecting users' real-time emotional and intentional states during content interaction. To address this limitation, this study proposes a Multi-Modal Emotion and Intent Recognition Model (MMEI) based on a BERT-based Cross-Modal Transformer with Attention-Based Fusion, integrated into a cloud-native personalized AIGC recommendation framework. The proposed system jointly processes visual (facial expression), auditory (speech tone), and textual (comments or utterances) modalities through pretrained encoders ViT, Wav2Vec2, and BERT, followed by an attention-based fusion module to learn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Recommender Systems and Techniques · Sentiment Analysis and Opinion Mining
