Multi-modal Food Recommendation using Clustering and Self-supervised Learning
Yixin Zhang, Xin Zhou, Qianwen Meng, Fanglin Zhu, Yonghui, Xu, Zhiqi Shen, Lizhen Cui

TL;DR
This paper introduces CLUSSL, a novel multi-modal food recommendation framework that uses clustering and self-supervised learning to improve recommendation accuracy by capturing semantic relations between recipes.
Contribution
The paper proposes CLUSSL, which employs modality-specific graphs and self-supervised learning to better model semantic relations in multi-modal food recommendation systems.
Findings
CLUSSL outperforms state-of-the-art benchmarks in experiments.
Graph-based representations enhance semantic understanding of recipes.
Self-supervised learning fosters independence between modality-specific features.
Abstract
Food recommendation systems serve as pivotal components in the realm of digital lifestyle services, designed to assist users in discovering recipes and food items that resonate with their unique dietary predilections. Typically, multi-modal descriptions offer an exhaustive profile for each recipe, thereby ensuring recommendations that are both personalized and accurate. Our preliminary investigation of two datasets indicates that pre-trained multi-modal dense representations might precipitate a deterioration in performance compared to ID features when encapsulating interactive relationships. This observation implies that ID features possess a relative superiority in modeling interactive collaborative signals. Consequently, contemporary cutting-edge methodologies augment ID features with multi-modal information as supplementary features, overlooking the latent semantic relations between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Advanced Text Analysis Techniques · Technology and Data Analysis
