It is Never Too Late to Mend: Separate Learning for Multimedia Recommendation
Zhuangzhuang He, Zihan Wang, Yonghui Yang, Haoyue Bai, Le Wu

TL;DR
This paper introduces Separate Learning (SEA), a novel framework for multimedia recommendation that effectively learns modal-unique and modal-generic features by leveraging mutual information techniques, outperforming previous alignment-based methods.
Contribution
The paper proposes a new Separate Learning framework that addresses limitations of existing methods by explicitly learning modal-unique and modal-generic features using mutual information, with extensive experimental validation.
Findings
SEA outperforms existing methods on three datasets.
Mutual information-based learning improves feature quality.
The framework demonstrates strong generalization capabilities.
Abstract
Multimedia recommendation, which incorporates various modalities (e.g., images, texts, etc.) into user or item representation to improve recommendation quality, and self-supervised learning carries multimedia recommendation to a plateau of performance, because of its superior performance in aligning different modalities. However, more and more research finds that aligning all modal representations is suboptimal because it damages the unique attributes of each modal. These studies use subtraction and orthogonal constraints in geometric space to learn unique parts. However, our rigorous analysis reveals the flaws in this approach, such as that subtraction does not necessarily yield the desired modal-unique and that orthogonal constraints are ineffective in user and item high-dimensional representation spaces. To make up for the previous weaknesses, we propose Separate Learning (SEA) for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Recommender Systems and Techniques · Music and Audio Processing
MethodsALIGN · Focus
