Robust Multimodal Recommendation via Graph Retrieval-Enhanced Modality Completion

Yuan Li; Jun Hu; Jiaxin Jiang; Bryan Hooi; Bingsheng He

arXiv:2605.00670·cs.IR·May 4, 2026

Robust Multimodal Recommendation via Graph Retrieval-Enhanced Modality Completion

Yuan Li, Jun Hu, Jiaxin Jiang, Bryan Hooi, Bingsheng He

PDF

TL;DR

This paper introduces GRE-MC, a novel framework that enhances multimodal recommendation by retrieving relevant subgraphs and using graph transformers to improve modality completion, especially under incomplete data conditions.

Contribution

GRE-MC leverages a modality-aware subgraph retrieval and joint graph transformer encoding to improve robustness and accuracy in multimodal recommendation with incomplete data.

Findings

01

GRE-MC outperforms existing methods on benchmark datasets.

02

The subgraph retrieval mechanism provides richer context for modality completion.

03

Joint encoding with graph transformers enhances robustness against missing modalities.

Abstract

Multimodal data plays a critical role in web-based recommendation systems, where information from diverse modalities such as vision and text enhances representation learning. However, real-world multimodal datasets often suffer from modality incompleteness due to sensor failures, annotation scarcity, or privacy constraints, which substantially degrade model performance and reliability. One effective solution to address this issue is modality completion, which reconstructs missing features to provide modality-complete graphs for downstream tasks. Given a query node with missing multimodal features, existing modality completion methods typically infer information from the node itself or its neighbors to reconstruct the missing modality. However, these methods may overlook semantically relevant context in the graph, which contains valuable cues that are non-trivial to capture through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.