It is Never Too Late to Mend: Separate Learning for Multimedia   Recommendation

Zhuangzhuang He; Zihan Wang; Yonghui Yang; Haoyue Bai; Le Wu

arXiv:2406.08270·cs.IR·December 18, 2024

It is Never Too Late to Mend: Separate Learning for Multimedia Recommendation

Zhuangzhuang He, Zihan Wang, Yonghui Yang, Haoyue Bai, Le Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces Separate Learning (SEA), a novel framework for multimedia recommendation that effectively learns modal-unique and modal-generic features by leveraging mutual information techniques, outperforming previous alignment-based methods.

Contribution

The paper proposes a new Separate Learning framework that addresses limitations of existing methods by explicitly learning modal-unique and modal-generic features using mutual information, with extensive experimental validation.

Findings

01

SEA outperforms existing methods on three datasets.

02

Mutual information-based learning improves feature quality.

03

The framework demonstrates strong generalization capabilities.

Abstract

Multimedia recommendation, which incorporates various modalities (e.g., images, texts, etc.) into user or item representation to improve recommendation quality, and self-supervised learning carries multimedia recommendation to a plateau of performance, because of its superior performance in aligning different modalities. However, more and more research finds that aligning all modal representations is suboptimal because it damages the unique attributes of each modal. These studies use subtraction and orthogonal constraints in geometric space to learn unique parts. However, our rigorous analysis reveals the flaws in this approach, such as that subtraction does not necessarily yield the desired modal-unique and that orthogonal constraints are ineffective in user and item high-dimensional representation spaces. To make up for the previous weaknesses, we propose Separate Learning (SEA) for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bruno686/SEA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Recommender Systems and Techniques · Music and Audio Processing

MethodsALIGN · Focus