Exploring LLM-Based Generative Recommender Systems: Corpora, Customization, and Evaluation Insights

Shuqi Yang; Mingrui Jing; Shuai Wang; Jiaqing Wang; Weijie Xing; Yan Hu; Zheng Zhu

PMC · DOI:10.1093/geroni/igaf122.2248·December 31, 2025

Exploring LLM-Based Generative Recommender Systems: Corpora, Customization, and Evaluation Insights

Shuqi Yang, Mingrui Jing, Shuai Wang, Jiaqing Wang, Weijie Xing, Yan Hu, Zheng Zhu

PDF

Open Access

TL;DR

This paper reviews how large language models are used in healthcare recommendation systems, highlighting data sources, customization methods, and evaluation gaps.

Contribution

The study systematically categorizes corpus sources, customization techniques, and evaluation metrics for LLM-based medical recommendation systems.

Findings

01

Most LLM-GRS studies use mixed data sources like clinical resources and open datasets.

02

Customization methods include pre-training, RAG, and fine-tuning, often combined.

03

Evaluation lacks standardization, with gaps in fairness and real-world validation.

Abstract

Large Language Model-Driven Generative Recommender Systems (LLM-GRSs) are increasingly transforming healthcare, particularly in question-answering systems. This study systematically reviewed their corpora sources, customization techniques, and evaluation metrics. A search of PubMed/MEDLINE, Embase, Scopus, and Web of Science identified 61 studies (2021–2024) using LLM-GRSs for medical information delivery. Corpus sources were categorized into real-world clinical resources (n = 24), literature materials (n = 34), open-source datasets (n = 33), and web-crawled data (n = 11), with 44 studies integrating multiple sources. Key model customization strategies included pre-training, prompt engineering, retrieval-augmented generation (RAG), fine-tuning, in-context learning, and offline learning. Fourteen studies used a single customization technique, while 41 studies combined these methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare · Topic Modeling