Towards Trustworthy Multimodal Recommendation
Zixuan Li

TL;DR
This paper introduces a plug-and-play modality rectification method to improve trustworthiness and robustness of multimodal recommendation systems against unreliable content and noisy signals, validated through extensive experiments.
Contribution
It proposes a novel soft matching rectification component for mitigating untrustworthy modality features in multimodal recommenders, without requiring architectural changes.
Findings
Rectification improves robustness against modality corruption.
Pseudo interactions can either help or hurt performance depending on context.
Propagation-graph pseudo edges influence robustness through message passing.
Abstract
Recent advances in multimodal recommendation have demonstrated the effectiveness of incorporating visual and textual content into collaborative filtering. However, real-world deployments raise an increasingly important yet underexplored issue: trustworthiness. On modern e-commerce platforms, multimodal content can be misleading or unreliable (e.g., visually inconsistent product images or click-bait titles), injecting untrustworthy signals into multimodal representations and making existing recommenders brittle under modality corruption. In this work, we take a step towards trustworthy multimodal recommendation from both a method and an analysis perspective. First, we propose a plug-and-play modality-level rectification component that mitigates untrustworthy modality features by learning soft correspondences between items and multimodal features. Using lightweight projections and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Multimodal Machine Learning Applications · Advanced Graph Neural Networks
