Adaptive Multi-Modalities Fusion in Sequential Recommendation Systems
Hengchang Hu, Wei Guo, Yong Liu, Min-Yen Kan

TL;DR
This paper introduces MMSR, a graph-based adaptive fusion method for sequential recommendation that dynamically balances intra- and inter-modality relationships, improving performance across multiple datasets.
Contribution
It proposes a novel graph-based approach with adaptive fusion order and dual attention, effectively handling missing modalities and outperforming existing models.
Findings
MMSR outperforms state-of-the-art models on six datasets.
Graph propagation methods surpass other graph neural networks.
The adaptive fusion approach effectively manages missing modalities.
Abstract
In sequential recommendation, multi-modal information (e.g., text or image) can provide a more comprehensive view of an item's profile. The optimal stage (early or late) to fuse modality features into item representations is still debated. We propose a graph-based approach (named MMSR) to fuse modality features in an adaptive order, enabling each modality to prioritize either its inherent sequential nature or its interplay with other modalities. MMSR represents each user's history as a graph, where the modality features of each item in a user's history sequence are denoted by cross-linked nodes. The edges between homogeneous nodes represent intra-modality sequential relationships, and the ones between heterogeneous nodes represent inter-modality interdependence relationships. During graph propagation, MMSR incorporates dual attention, differentiating homogeneous and heterogeneous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Graph Neural Networks · Topic Modeling
