Pareto Invariant Representation Learning for Multimedia Recommendation

Shanshan Huang; Haoxuan Li; Qingsong Li; Chunyuan Zheng; Li Liu

arXiv:2308.04706·cs.IR·August 25, 2023

Pareto Invariant Representation Learning for Multimedia Recommendation

Shanshan Huang, Haoxuan Li, Qingsong Li, Chunyuan Zheng, Li Liu

PDF

TL;DR

This paper introduces PaInvRL, a novel framework for multimedia recommendation that learns invariant and variant representations to improve generalization across different data distributions, addressing spurious correlations.

Contribution

It proposes a Pareto multi-objective optimization approach to balance IID and OOD generalization in multimedia recommendation models.

Findings

01

PaInvRL outperforms state-of-the-art models on multiple datasets.

02

It effectively mitigates spurious correlations in multimedia data.

03

The framework improves both within- and cross-environment recommendation accuracy.

Abstract

Multimedia recommendation involves personalized ranking tasks, where multimedia content is usually represented using a generic encoder. However, these generic representations introduce spurious correlations that fail to reveal users' true preferences. Existing works attempt to alleviate this problem by learning invariant representations, but overlook the balance between independent and identically distributed (IID) and out-of-distribution (OOD) generalization. In this paper, we propose a framework called Pareto Invariant Representation Learning (PaInvRL) to mitigate the impact of spurious correlations from an IID-OOD multi-objective optimization perspective, by learning invariant representations (intrinsic factors that attract user attention) and variant representations (other factors) simultaneously. Specifically, PaInvRL includes three iteratively executed modules: (i) heterogeneous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methodsfail