CM$^3$: Calibrating Multimodal Recommendation
Xin Zhou, Yongjie Wang, Zhiqi Shen

TL;DR
This paper introduces CM$^3$, a novel calibration method for multimodal recommender systems that leverages item similarity to improve embedding alignment and uniformity, resulting in enhanced recommendation performance.
Contribution
The paper proposes a calibrated uniformity loss based on multimodal item similarity and a Spherical Bézier fusion method for better feature integration in multimodal recommenders.
Findings
Achieved up to 5.4% NDCG@20 improvement on real datasets.
Demonstrated the effectiveness of calibrated uniformity in balancing alignment and uniformity.
Validated the approach across five real-world datasets.
Abstract
Alignment and uniformity are fundamental principles within the domain of contrastive learning. In recommender systems, prior work has established that optimizing the Bayesian Personalized Ranking (BPR) loss contributes to the objectives of alignment and uniformity. Specifically, alignment aims to draw together the representations of interacting users and items, while uniformity mandates a uniform distribution of user and item embeddings across a unit hypersphere. This study revisits the alignment and uniformity properties within the context of multimodal recommender systems, revealing a proclivity among extant models to prioritize uniformity to the detriment of alignment. Our hypothesis challenges the conventional assumption of equitable item treatment through a uniformity loss, proposing a more nuanced approach wherein items with similar multimodal attributes converge toward proximal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Sentiment Analysis and Opinion Mining · Emotion and Mood Recognition
