Multimodal Point-of-Interest Recommendation

Yuta Kanzawa; Toyotaro Suzumura; Hiroki Kanezashi; Jiawei Yong,; Shintaro Fukushima

arXiv:2410.03265·cs.IR·October 8, 2024

Multimodal Point-of-Interest Recommendation

Yuta Kanzawa, Toyotaro Suzumura, Hiroki Kanezashi, Jiawei Yong,, Shintaro Fukushima

PDF

Open Access

TL;DR

This paper explores multimodal data for restaurant point-of-interest recommendation, demonstrating that incorporating image descriptions improves model performance and better reflects human decision-making behaviors.

Contribution

It introduces a semi-multimodal recommendation framework combining text and image data, advancing point-of-interest recommendation research.

Findings

01

Semi-multimodal model outperforms text-only models

02

Image descriptions enhance recommendation accuracy

03

Model better reflects human decision processes

Abstract

Large Language Models are applied to recommendation tasks such as items to buy and news articles to read. Point of Interest is quite a new area to sequential recommendation based on language representations of multimodal datasets. As a first step to prove our concepts, we focused on restaurant recommendation based on each user's past visit history. When choosing a next restaurant to visit, a user would consider genre and location of the venue and, if available, pictures of dishes served there. We created a pseudo restaurant check-in history dataset from the Foursquare dataset and the FoodX-251 dataset by converting pictures into text descriptions with a multimodal model called LLaVA, and used a language-based sequential recommendation framework named Recformer proposed in 2023. A model trained on this semi-multimodal dataset has outperformed another model trained on the same dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Topic Modeling · Image Retrieval and Classification Techniques