Exploring LLM-Based Generative Recommender Systems: Corpora, Customization, and Evaluation Insights
Shuqi Yang, Mingrui Jing, Shuai Wang, Jiaqing Wang, Weijie Xing, Yan Hu, Zheng Zhu

TL;DR
This paper reviews how large language models are used in healthcare recommendation systems, highlighting data sources, customization methods, and evaluation gaps.
Contribution
The study systematically categorizes corpus sources, customization techniques, and evaluation metrics for LLM-based medical recommendation systems.
Findings
Most LLM-GRS studies use mixed data sources like clinical resources and open datasets.
Customization methods include pre-training, RAG, and fine-tuning, often combined.
Evaluation lacks standardization, with gaps in fairness and real-world validation.
Abstract
Large Language Model-Driven Generative Recommender Systems (LLM-GRSs) are increasingly transforming healthcare, particularly in question-answering systems. This study systematically reviewed their corpora sources, customization techniques, and evaluation metrics. A search of PubMed/MEDLINE, Embase, Scopus, and Web of Science identified 61 studies (2021–2024) using LLM-GRSs for medical information delivery. Corpus sources were categorized into real-world clinical resources (n = 24), literature materials (n = 34), open-source datasets (n = 33), and web-crawled data (n = 11), with 44 studies integrating multiple sources. Key model customization strategies included pre-training, prompt engineering, retrieval-augmented generation (RAG), fine-tuning, in-context learning, and offline learning. Fourteen studies used a single customization technique, while 41 studies combined these methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare · Topic Modeling
