Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-Stage Instruction Fine-tuning Approach
Meng Zhou, Surajsinh Parmar, Anubhav Bhatti

TL;DR
This paper presents a two-stage instruction fine-tuning method for multilingual medical large language models, using high-quality datasets to improve performance across languages while maintaining computational efficiency.
Contribution
It introduces two new multilingual medical datasets and a two-stage training paradigm to enhance LLMs for healthcare across multiple languages.
Findings
Achieves competitive results on multilingual benchmarks.
Balances performance with computational efficiency.
Provides publicly available datasets and model weights.
Abstract
Open-source, multilingual medical large language models (LLMs) have the potential to serve linguistically diverse populations across different regions. Adapting generic LLMs for healthcare often requires continual pretraining, but this approach is computationally expensive and sometimes impractical. Instruction fine-tuning on a specific task may not always guarantee optimal performance due to the lack of broader domain knowledge that the model needs to understand and reason effectively in diverse scenarios. To address these challenges, we introduce two multilingual instruction fine-tuning datasets, MMed-IFT and MMed-IFT-MC, containing over 200k high-quality medical samples in six languages. We propose a two-stage training paradigm: the first stage injects general medical knowledge using MMed-IFT, while the second stage fine-tunes task-specific multiple-choice questions with MMed-IFT-MC.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification
