TL;DR
This paper introduces DIMO, a novel framework that disentangles ID and modality effects in session-based recommendation to improve accuracy and explainability, using causal inference and self-supervised learning.
Contribution
DIMO explicitly separates ID and modality effects at item and session levels, enabling more accurate and explainable recommendations without supervised signals.
Findings
DIMO outperforms existing methods on multiple datasets.
DIMO effectively generates explanations for recommendations.
Disentanglement improves recommendation accuracy and interpretability.
Abstract
Session-based recommendation aims to predict intents of anonymous users based on their limited behaviors. Modeling user behaviors involves two distinct rationales: co-occurrence patterns reflected by item IDs, and fine-grained preferences represented by item modalities (e.g., text and images). However, existing methods typically entangle these causes, leading to their failure in achieving accurate and explainable recommendations. To this end, we propose a novel framework DIMO to disentangle the effects of ID and modality in the task. At the item level, we introduce a co-occurrence representation schema to explicitly incorporate cooccurrence patterns into ID representations. Simultaneously, DIMO aligns different modalities into a unified semantic space to represent them uniformly. At the session level, we present a multi-view self-supervised disentanglement, including proxy mechanism and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsCausal inference
