A Review on Scientific Knowledge Extraction using Large Language Models in Biomedical Sciences
Gabriel Lino Garcia, Jo\~ao Renato Ribeiro Manesco, Pedro Henrique, Paiola, Lucas Miranda, Maria Paola de Salvo, Jo\~ao Paulo Papa

TL;DR
This review discusses how large language models are transforming biomedical knowledge extraction, highlighting their potential, current challenges, and future research directions to improve healthcare information access.
Contribution
It provides a comprehensive overview of LLM applications in biomedical knowledge extraction, identifying gaps and proposing integration of advanced techniques like RAG.
Findings
LLMs show promise in automating biomedical data extraction.
Challenges include hallucinations and limited generalization.
Future directions involve standardizing benchmarks and improving model reliability.
Abstract
The rapid advancement of large language models (LLMs) has opened new boundaries in the extraction and synthesis of medical knowledge, particularly within evidence synthesis. This paper reviews the state-of-the-art applications of LLMs in the biomedical domain, exploring their effectiveness in automating complex tasks such as evidence synthesis and data extraction from a biomedical corpus of documents. While LLMs demonstrate remarkable potential, significant challenges remain, including issues related to hallucinations, contextual understanding, and the ability to generalize across diverse medical tasks. We highlight critical gaps in the current research literature, particularly the need for unified benchmarks to standardize evaluations and ensure reliability in real-world applications. In addition, we propose directions for future research, emphasizing the integration of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies
