Multimodal Recommendation via Self-Corrective Preference Alignmen
Yalong Guan, Xiang Chen, Mingyang Wang, Xiangyu Wu, Lihao Liu, Chao Qi, Shuang Yang, Tingting Gao, Guorui Zhou, Changjian Chen

TL;DR
This paper introduces MSPA, a novel framework for personalized multimodal author recommendation in live streaming, which effectively aligns user preferences with author features to enhance accuracy and interpretability.
Contribution
The paper presents MSPA, a new multimodal recommendation framework that uses structured preference generation and self-corrective alignment to improve live streaming recommendations.
Findings
MSPA outperforms baseline models in accuracy and recall.
MSPA enhances interpretability of recommendations.
Visualizations confirm the effectiveness of preference alignment.
Abstract
With the rapid growth of live streaming platforms, personalized recommendation systems have become pivotal in improving user experience and driving platform revenue. The dynamic and multimodal nature of live streaming content (e.g., visual, audio, textual data) requires joint modeling of user behavior and multimodal features to capture evolving author characteristics. However, traditional methods relying on single-modal features or treating multimodal ones as supplementary struggle to align users' dynamic preferences with authors' multimodal attributes, limiting accuracy and interpretability. To address this, we propose MSPA (Multimodal Self-Corrective Preference Alignment), a personalized author recommendation framework with two components: (1) a Multimodal Preference Composer that uses MLLMs to generate structured preference text and embeddings from users' tipping history; and (2) a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Sentiment Analysis and Opinion Mining
