Beyond Co-occurrence: Multi-modal Session-based Recommendation
Xiaokun Zhang, Bo Xu, Fenglong Ma, Chenliang Li, Liang Yang, Hongfei, Lin

TL;DR
This paper introduces MMSBR, a multi-modal session-based recommendation model that integrates descriptive and numerical information using contrastive learning, hierarchical transformers, and probabilistic modeling to improve recommendation accuracy and cold-start handling.
Contribution
It proposes a novel unified framework for modeling multi-modal information in session-based recommendation, addressing heterogeneity and probabilistic influences.
Findings
MMSBR outperforms existing methods on three real-world datasets.
The model effectively alleviates the cold-start problem.
Extensive experiments validate the approach's superiority.
Abstract
Session-based recommendation is devoted to characterizing preferences of anonymous users based on short sessions. Existing methods mostly focus on mining limited item co-occurrence patterns exposed by item ID within sessions, while ignoring what attracts users to engage with certain items is rich multi-modal information displayed on pages. Generally, the multi-modal information can be classified into two categories: descriptive information (e.g., item images and description text) and numerical information (e.g., price). In this paper, we aim to improve session-based recommendation by modeling the above multi-modal information holistically. There are mainly three issues to reveal user intent from multi-modal information: (1) How to extract relevant semantics from heterogeneous descriptive information with different noise? (2) How to fuse these heterogeneous descriptive information to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Expert finding and Q&A systems
MethodsContrastive Learning · Focus
