DiffCL: A Diffusion-Based Contrastive Learning Framework with Semantic Alignment for Multimodal Recommendations
Qiya Song, Jiajun Hu, Lin Xiao, Bin Sun, Xieping Gao, Shutao Li

TL;DR
DiffCL is a novel diffusion-based contrastive learning framework that enhances multimodal recommendation accuracy by reducing noise, aligning semantic information across modalities, and addressing data sparsity through graph-based features.
Contribution
The paper introduces a diffusion model for contrastive views, semantic alignment via stable ID embeddings, and an item-item graph to improve multimodal recommendation systems.
Findings
DiffCL outperforms existing methods on three public datasets.
The framework effectively reduces noise impact during contrastive learning.
Semantic alignment improves cross-modal consistency and recommendation accuracy.
Abstract
Multimodal recommendation systems integrate diverse multimodal information into the feature representations of both items and users, thereby enabling a more comprehensive modeling of user preferences. However, existing methods are hindered by data sparsity and the inherent noise within multimodal data, which impedes the accurate capture of users' interest preferences. Additionally, discrepancies in the semantic representations of items across different modalities can adversely impact the prediction accuracy of recommendation models. To address these challenges, we introduce a novel diffusion-based contrastive learning framework (DiffCL) for multimodal recommendation. DiffCL employs a diffusion model to generate contrastive views that effectively mitigate the impact of noise during the contrastive learning phase. Furthermore, it improves semantic consistency across modalities by aligning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
