TL;DR
This paper introduces JBM-Diff, a diffusion model for multimodal recommendation that denoises features and feedback to improve ranking accuracy, addressing redundant information and feedback bias.
Contribution
It proposes a joint diffusion approach conditioned on collaborative features and modal consistency, enhancing multimodal feature alignment and data credibility detection.
Findings
Effective removal of irrelevant multimodal information.
Improved recommendation accuracy demonstrated on three datasets.
Enhanced data augmentation through sample pair credibility assessment.
Abstract
In recent years, multimodal recommendation has received significant attention and achieved remarkable success in GCN-based recommendation methods. However, there are two key challenges here: (1) There is a significant amount of redundant information in multimodal features that is unrelated to user preferences. Directly injecting multimodal features into the interaction graph can affect the collaborative feature learning between users and items. (2) There are false negative and false positive behaviors caused by system errors such as accidental clicks and non-exposure. This feedback bias can affect the ranking accuracy of training sample pairs, thereby reducing the recommendation accuracy of the model. To address these challenges, this work proposes a Joint Behavior-guided and Modal-consistent Conditional Graph Diffusion Model (JBM-Diff) for joint denoising of multimodal features and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
