A Tale of Two Graphs: Freezing and Denoising Graph Structures for   Multimodal Recommendation

Xin Zhou; Zhiqi Shen

arXiv:2211.06924·cs.IR·August 24, 2023

A Tale of Two Graphs: Freezing and Denoising Graph Structures for Multimodal Recommendation

Xin Zhou, Zhiqi Shen

PDF

2 Repos

TL;DR

FREEDOM is a simple, efficient multimodal recommendation model that freezes and denoises graph structures, achieving state-of-the-art results with lower memory costs by leveraging spectral graph theory and edge pruning.

Contribution

The paper introduces FREEDOM, a novel approach that freezes item-item graphs and denoises user-item interactions, simplifying and improving upon prior latent structure learning methods.

Findings

01

FREEDOM outperforms baselines by 19.07% in accuracy.

02

It reduces memory cost by up to 6 times compared to LATTICE.

03

Freezing the graph structure is competitive with learned latent structures.

Abstract

Multimodal recommender systems utilizing multimodal features (e.g., images and textual descriptions) typically show better recommendation accuracy than general recommendation models based solely on user-item interactions. Generally, prior work fuses multimodal features into item ID embeddings to enrich item representations, thus failing to capture the latent semantic item-item structures. In this context, LATTICE proposes to learn the latent structure between items explicitly and achieves state-of-the-art performance for multimodal recommendations. However, we argue the latent graph structure learning of LATTICE is both inefficient and unnecessary. Experimentally, we demonstrate that freezing its item-item structure before training can also achieve competitive performance. Based on this finding, we propose a simple yet effective model, dubbed as FREEDOM, that FREEzes the item-item graph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning