MMREC: LLM Based Multi-Modal Recommender System

Jiahao Tian; Jinman Zhao; Zhenkai Wang; Zhicheng Ding

arXiv:2408.04211·cs.CL·June 12, 2025

MMREC: LLM Based Multi-Modal Recommender System

Jiahao Tian, Jinman Zhao, Zhenkai Wang, Zhicheng Ding

PDF

Open Access

TL;DR

This paper introduces MMREC, a multi-modal recommender system leveraging Large Language Models to integrate text and image data, significantly improving recommendation accuracy and personalization.

Contribution

It presents a novel framework that unifies multi-modal data in a latent space using LLMs, enhancing recommendation relevance beyond previous methods.

Findings

01

Improved recommendation accuracy with multi-modal data

02

Effective integration of text and image information

03

Enhanced discriminative power of the model

Abstract

The importance of recommender systems is growing rapidly due to the exponential increase in the volume of content generated daily. This surge in content presents unique challenges for designing effective recommender systems. Key among these challenges is the need to effectively leverage the vast amounts of natural language data and images that represent user preferences. This paper presents a novel approach to enhancing recommender systems by leveraging Large Language Models (LLMs) and deep learning techniques. The proposed framework aims to improve the accuracy and relevance of recommendations by incorporating multi-modal information processing and by the use of unified latent space representation. The study explores the potential of LLMs to better understand and utilize natural language data in recommendation contexts, addressing the limitations of previous methods. The framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Web Data Mining and Analysis · Semantic Web and Ontologies