Towards Democratizing Multilingual Large Language Models For Medicine   Through A Two-Stage Instruction Fine-tuning Approach

Meng Zhou; Surajsinh Parmar; Anubhav Bhatti

arXiv:2409.05732·cs.CL·September 10, 2024

Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-Stage Instruction Fine-tuning Approach

Meng Zhou, Surajsinh Parmar, Anubhav Bhatti

PDF

Open Access

TL;DR

This paper presents a two-stage instruction fine-tuning method for multilingual medical large language models, using high-quality datasets to improve performance across languages while maintaining computational efficiency.

Contribution

It introduces two new multilingual medical datasets and a two-stage training paradigm to enhance LLMs for healthcare across multiple languages.

Findings

01

Achieves competitive results on multilingual benchmarks.

02

Balances performance with computational efficiency.

03

Provides publicly available datasets and model weights.

Abstract

Open-source, multilingual medical large language models (LLMs) have the potential to serve linguistically diverse populations across different regions. Adapting generic LLMs for healthcare often requires continual pretraining, but this approach is computationally expensive and sometimes impractical. Instruction fine-tuning on a specific task may not always guarantee optimal performance due to the lack of broader domain knowledge that the model needs to understand and reason effectively in diverse scenarios. To address these challenges, we introduce two multilingual instruction fine-tuning datasets, MMed-IFT and MMed-IFT-MC, containing over 200k high-quality medical samples in six languages. We propose a two-stage training paradigm: the first stage injects general medical knowledge using MMed-IFT, while the second stage fine-tunes task-specific multiple-choice questions with MMed-IFT-MC.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification