MedLM: Exploring Language Models for Medical Question Answering Systems
Niraj Yagnik, Jay Jhaveri, Vivek Sharma, Gabriel Pila

TL;DR
This paper investigates the effectiveness of general and medical-specific language models in answering medical questions, aiming to identify the most reliable and suitable models for medical Q&A tasks.
Contribution
It compares various general and domain-specific language models for medical question answering, highlighting the impact of fine-tuning and model selection in this domain.
Findings
Medical-specific models outperform general models in accuracy.
Fine-tuning improves model performance significantly.
Different model families show varying reliability in medical Q&A.
Abstract
In the face of rapidly expanding online medical literature, automated systems for aggregating and summarizing information are becoming increasingly crucial for healthcare professionals and patients. Large Language Models (LLMs), with their advanced generative capabilities, have shown promise in various NLP tasks, and their potential in the healthcare domain, particularly for Closed-Book Generative QnA, is significant. However, the performance of these models in domain-specific tasks such as medical Q&A remains largely unexplored. This study aims to fill this gap by comparing the performance of general and medical-specific distilled LMs for medical Q&A. We aim to evaluate the effectiveness of fine-tuning domain-specific LMs and compare the performance of different families of Language Models. The study will address critical questions about these models' reliability, comparative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education
