Pareto Invariant Representation Learning for Multimedia Recommendation
Shanshan Huang, Haoxuan Li, Qingsong Li, Chunyuan Zheng, Li Liu

TL;DR
This paper introduces PaInvRL, a novel framework for multimedia recommendation that learns invariant and variant representations to improve generalization across different data distributions, addressing spurious correlations.
Contribution
It proposes a Pareto multi-objective optimization approach to balance IID and OOD generalization in multimedia recommendation models.
Findings
PaInvRL outperforms state-of-the-art models on multiple datasets.
It effectively mitigates spurious correlations in multimedia data.
The framework improves both within- and cross-environment recommendation accuracy.
Abstract
Multimedia recommendation involves personalized ranking tasks, where multimedia content is usually represented using a generic encoder. However, these generic representations introduce spurious correlations that fail to reveal users' true preferences. Existing works attempt to alleviate this problem by learning invariant representations, but overlook the balance between independent and identically distributed (IID) and out-of-distribution (OOD) generalization. In this paper, we propose a framework called Pareto Invariant Representation Learning (PaInvRL) to mitigate the impact of spurious correlations from an IID-OOD multi-objective optimization perspective, by learning invariant representations (intrinsic factors that attract user attention) and variant representations (other factors) simultaneously. Specifically, PaInvRL includes three iteratively executed modules: (i) heterogeneous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodsfail
