TL;DR
This paper introduces SMORE, a spectral graph-based multimodal recommendation model that effectively suppresses modality noise and captures universal and modality-specific preferences for improved recommendation accuracy.
Contribution
The paper proposes a novel spectrum-based fusion approach with adaptive noise filtering and a multi-modal graph learning module for enhanced multimodal recommendation.
Findings
SMORE outperforms existing models on three real-world datasets.
Spectral domain fusion effectively reduces modality noise.
The model accurately captures both universal and modality-specific preferences.
Abstract
Incorporating multi-modal features as side information has recently become a trend in recommender systems. To elucidate user-item preferences, recent studies focus on fusing modalities via concatenation, element-wise sum, or attention mechanisms. Despite having notable success, existing approaches do not account for the modality-specific noise encapsulated within each modality. As a result, direct fusion of modalities will lead to the amplification of cross-modality noise. Moreover, the variation of noise that is unique within each modality results in noise alleviation and fusion being more challenging. In this work, we propose a new Spectrum-based Modality Representation (SMORE) fusion graph recommender that aims to capture both uni-modal and fusion preferences while simultaneously suppressing modality noise. Specifically, SMORE projects the multi-modal features into the frequency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need · Focus
