MedBioLM: Optimizing Medical and Biological QA with Fine-Tuned Large   Language Models and Retrieval-Augmented Generation

Seonok Kim

arXiv:2502.03004·cs.CL·February 6, 2025·5 cites

MedBioLM: Optimizing Medical and Biological QA with Fine-Tuned Large Language Models and Retrieval-Augmented Generation

Seonok Kim

PDF

Open Access

TL;DR

MedBioLM is a specialized biomedical question-answering model that combines fine-tuning and retrieval-augmented generation to improve accuracy, factuality, and reasoning in medical and biological domains.

Contribution

The paper introduces MedBioLM, a novel domain-adapted biomedical LLM that integrates fine-tuning with retrieval-augmented generation for enhanced QA performance.

Findings

01

Fine-tuning improves accuracy on biomedical QA datasets.

02

Retrieval-augmented generation enhances factual consistency.

03

Model benefits medical research, education, and clinical decision support.

Abstract

Large Language Models (LLMs) have demonstrated impressive capabilities across natural language processing tasks. However, their application to specialized domains such as medicine and biology requires further optimization to ensure factual accuracy, reliability, and contextual depth. We introduce MedBioLM, a domain-adapted biomedical question-answering model designed to enhance both short-form and long-form queries. By integrating fine-tuning and retrieval-augmented generation (RAG), MedBioLM dynamically incorporates domain-specific knowledge, improving reasoning abilities and factual accuracy. To evaluate its effectiveness, we fine-tuned the model on diverse biomedical QA datasets, covering structured multiple-choice assessments and complex clinical reasoning tasks. Fine-tuning significantly improves accuracy on benchmark datasets, while RAG enhances factual consistency. These results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Machine Learning in Bioinformatics