BioMistral-NLU: Towards More Generalizable Medical Language   Understanding through Instruction Tuning

Yujuan Velvin Fu; Giridhar Kaushik Ramachandran; Namu Park; Kevin; Lybarger; Fei Xia; Ozlem Uzuner; Meliha Yetisgen

arXiv:2410.18955·cs.CL·March 11, 2025

BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning

Yujuan Velvin Fu, Giridhar Kaushik Ramachandran, Namu Park, Kevin, Lybarger, Fei Xia, Ozlem Uzuner, Meliha Yetisgen

PDF

TL;DR

This paper introduces BioMistral-NLU, a fine-tuned medical language understanding model that improves zero-shot performance on diverse medical NLU tasks by using a unified prompting format and a new instruction-tuning dataset.

Contribution

It proposes a unified prompting format, curates the MNLU-Instruct dataset, and develops BioMistral-NLU, enhancing generalization in medical NLU tasks beyond existing models.

Findings

01

BioMistral-NLU outperforms original BioMistral, ChatGPT, and GPT-4 on medical NLU benchmarks.

02

Instruction tuning on diverse tasks improves zero-shot generalization.

03

Unified prompting and instruction tuning enhance model adaptability across medical NLU tasks.

Abstract

Large language models (LLMs) such as ChatGPT are fine-tuned on large and diverse instruction-following corpora, and can generalize to new tasks. However, those instruction-tuned LLMs often perform poorly in specialized medical natural language understanding (NLU) tasks that require domain knowledge, granular text comprehension, and structured data extraction. To bridge the gap, we: (1) propose a unified prompting format for 7 important NLU tasks, (2) curate an instruction-tuning dataset, MNLU-Instruct, utilizing diverse existing open-source medical NLU corpora, and (3) develop BioMistral-NLU, a generalizable medical NLU model, through fine-tuning BioMistral on MNLU-Instruct. We evaluate BioMistral-NLU in a zero-shot setting, across 6 important NLU tasks, from two widely adopted medical NLU benchmarks: BLUE and BLURB. Our experiments show that our BioMistral-NLU outperforms the original…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Byte Pair Encoding · Layer Normalization · Residual Connection · Linear Layer · Multi-Head Attention · Softmax · Adam