CM$^3$: Calibrating Multimodal Recommendation

Xin Zhou; Yongjie Wang; Zhiqi Shen

arXiv:2508.01226·cs.IR·August 5, 2025

CM$^3$: Calibrating Multimodal Recommendation

Xin Zhou, Yongjie Wang, Zhiqi Shen

PDF

Open Access

TL;DR

This paper introduces CM$^3$, a novel calibration method for multimodal recommender systems that leverages item similarity to improve embedding alignment and uniformity, resulting in enhanced recommendation performance.

Contribution

The paper proposes a calibrated uniformity loss based on multimodal item similarity and a Spherical Bézier fusion method for better feature integration in multimodal recommenders.

Findings

01

Achieved up to 5.4% NDCG@20 improvement on real datasets.

02

Demonstrated the effectiveness of calibrated uniformity in balancing alignment and uniformity.

03

Validated the approach across five real-world datasets.

Abstract

Alignment and uniformity are fundamental principles within the domain of contrastive learning. In recommender systems, prior work has established that optimizing the Bayesian Personalized Ranking (BPR) loss contributes to the objectives of alignment and uniformity. Specifically, alignment aims to draw together the representations of interacting users and items, while uniformity mandates a uniform distribution of user and item embeddings across a unit hypersphere. This study revisits the alignment and uniformity properties within the context of multimodal recommender systems, revealing a proclivity among extant models to prioritize uniformity to the detriment of alignment. Our hypothesis challenges the conventional assumption of equitable item treatment through a uniformity loss, proposing a more nuanced approach wherein items with similar multimodal attributes converge toward proximal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Sentiment Analysis and Opinion Mining · Emotion and Mood Recognition