STAIR: Manipulating Collaborative and Multimodal Information for E-Commerce Recommendation
Cong Xu, Yunhang He, Jun Wang, Wei Zhang

TL;DR
This paper introduces STAIR, a novel graph convolution method that effectively combines collaborative and multimodal information for e-commerce recommendations, addressing modality erasure and forgetting issues to improve performance.
Contribution
The paper proposes STAIR, a new stepwise graph convolution approach that preserves multimodal features and mitigates forgetting, achieving state-of-the-art results in e-commerce recommendation tasks.
Findings
Achieves state-of-the-art performance on three datasets.
Effectively preserves multimodal information during training.
Requires minimal computational and memory resources.
Abstract
While the mining of modalities is the focus of most multimodal recommendation methods, we believe that how to fully utilize both collaborative and multimodal information is pivotal in e-commerce scenarios where, as clarified in this work, the user behaviors are rarely determined entirely by multimodal features. In order to combine the two distinct types of information, some additional challenges are encountered: 1) Modality erasure: Vanilla graph convolution, which proves rather useful in collaborative filtering, however erases multimodal information; 2) Modality forgetting: Multimodal information tends to be gradually forgotten as the recommendation loss essentially facilitates the learning of collaborative information. To this end, we propose a novel approach named STAIR, which employs a novel STepwise grAph convolution to enable a co-existence of collaborative and multimodal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRecommender Systems and Techniques · Semantic Web and Ontologies · Advanced Text Analysis Techniques
MethodsFocus · Convolution
