MMREC: LLM Based Multi-Modal Recommender System
Jiahao Tian, Jinman Zhao, Zhenkai Wang, Zhicheng Ding

TL;DR
This paper introduces MMREC, a multi-modal recommender system leveraging Large Language Models to integrate text and image data, significantly improving recommendation accuracy and personalization.
Contribution
It presents a novel framework that unifies multi-modal data in a latent space using LLMs, enhancing recommendation relevance beyond previous methods.
Findings
Improved recommendation accuracy with multi-modal data
Effective integration of text and image information
Enhanced discriminative power of the model
Abstract
The importance of recommender systems is growing rapidly due to the exponential increase in the volume of content generated daily. This surge in content presents unique challenges for designing effective recommender systems. Key among these challenges is the need to effectively leverage the vast amounts of natural language data and images that represent user preferences. This paper presents a novel approach to enhancing recommender systems by leveraging Large Language Models (LLMs) and deep learning techniques. The proposed framework aims to improve the accuracy and relevance of recommendations by incorporating multi-modal information processing and by the use of unified latent space representation. The study explores the potential of LLMs to better understand and utilize natural language data in recommendation contexts, addressing the limitations of previous methods. The framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Web Data Mining and Analysis · Semantic Web and Ontologies
