Refining Contrastive Learning and Homography Relations for Multi-Modal Recommendation

Shouxing Ma; Yawen Zeng; Shiqing Wu; Guandong Xu

arXiv:2508.13745·cs.IR·August 20, 2025

Refining Contrastive Learning and Homography Relations for Multi-Modal Recommendation

Shouxing Ma, Yawen Zeng, Shiqing Wu, Guandong Xu

PDF

1 Models 3 Datasets

TL;DR

This paper introduces REARM, a novel framework that enhances multi-modal recommendation by refining contrastive learning with meta-networks and orthogonal constraints, and by exploring homography relations through multiple graph structures.

Contribution

The paper proposes a new framework that improves multi-modal recommendation by refining contrastive learning and exploring homography relations with multiple graphs, addressing data sparsity and feature noise.

Findings

01

REARM outperforms state-of-the-art baselines on three real-world datasets.

02

The framework effectively filters noise and retains valuable modal-specific information.

03

Visualization shows improved distinction between shared and unique modal features.

Abstract

Multi-modal recommender system focuses on utilizing rich modal information ( i.e., images and textual descriptions) of items to improve recommendation performance. The current methods have achieved remarkable success with the powerful structure modeling capability of graph neural networks. However, these methods are often hindered by sparse data in real-world scenarios. Although contrastive learning and homography ( i.e., homogeneous graphs) are employed to address the data sparsity challenge, existing methods still suffer two main limitations: 1) Simple multi-modal feature contrasts fail to produce effective representations, causing noisy modal-shared features and loss of valuable information in modal-unique features; 2) The lack of exploration of the homograph relations between user interests and item co-occurrence results in incomplete mining of user-item interplay. To address the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
MrShouxingMa/REARM
model

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.