MedLM: Exploring Language Models for Medical Question Answering Systems

Niraj Yagnik; Jay Jhaveri; Vivek Sharma; Gabriel Pila

arXiv:2401.11389·cs.CL·March 7, 2024·2 cites

MedLM: Exploring Language Models for Medical Question Answering Systems

Niraj Yagnik, Jay Jhaveri, Vivek Sharma, Gabriel Pila

PDF

Open Access 1 Repo

TL;DR

This paper investigates the effectiveness of general and medical-specific language models in answering medical questions, aiming to identify the most reliable and suitable models for medical Q&A tasks.

Contribution

It compares various general and domain-specific language models for medical question answering, highlighting the impact of fine-tuning and model selection in this domain.

Findings

01

Medical-specific models outperform general models in accuracy.

02

Fine-tuning improves model performance significantly.

03

Different model families show varying reliability in medical Q&A.

Abstract

In the face of rapidly expanding online medical literature, automated systems for aggregating and summarizing information are becoming increasingly crucial for healthcare professionals and patients. Large Language Models (LLMs), with their advanced generative capabilities, have shown promise in various NLP tasks, and their potential in the healthcare domain, particularly for Closed-Book Generative QnA, is significant. However, the performance of these models in domain-specific tasks such as medical Q&A remains largely unexplored. This study aims to fill this gap by comparing the performance of general and medical-specific distilled LMs for medical Q&A. We aim to evaluate the effectiveness of fine-tuning domain-specific LMs and compare the performance of different families of Language Models. The study will address critical questions about these models' reliability, comparative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jayjhaveri1906/cse291_medlm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education