Large Language Models as Narrative-Driven Recommenders
Lukas Eberhard, Thorsten Ruprechter, Denis Helic

TL;DR
This paper evaluates the effectiveness of large language models in providing personalized, narrative-driven movie recommendations, showing they outperform traditional methods and that simple prompting strategies are often sufficient.
Contribution
It systematically compares 38 LLMs in a movie recommendation task, highlighting their strengths, limitations, and the impact of different prompting strategies.
Findings
LLMs generate contextually relevant recommendations.
Large models outperform traditional approaches like doc2vec.
Simple zero-shot prompting is often as effective as more complex methods.
Abstract
Narrative-driven recommenders aim to provide personalized suggestions for user requests expressed in free-form text such as "I want to watch a thriller with a mind-bending story, like Shutter Island." Although large language models (LLMs) have been shown to excel in processing general natural language queries, their effectiveness for handling such recommendation requests remains relatively unexplored. To close this gap, we compare the performance of 38 open- and closed-source LLMs of various sizes, such as LLama 3.2 and GPT-4o, in a movie recommendation setting. For this, we utilize a gold-standard, crowdworker-annotated dataset of posts from reddit's movie suggestion community and employ various prompting strategies, including zero-shot, identity, and few-shot prompting. Our findings demonstrate the ability of LLMs to generate contextually relevant movie recommendations, significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsLLaMA
