JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models   using Instruction-tuning

Issey Sukeda; Masahiro Suzuki; Hiroki Sakaji; Satoshi Kodera

arXiv:2310.10083·cs.CL·December 4, 2023·5 cites

JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuning

Issey Sukeda, Masahiro Suzuki, Hiroki Sakaji, Satoshi Kodera

PDF

Open Access 1 Repo 6 Models

TL;DR

This paper demonstrates that LoRA-based instruction-tuning enhances Japanese medical question-answering capabilities of large language models, highlighting the importance of domain-specific adaptation and the potential for local model deployment.

Contribution

It introduces a novel application of LoRA-based instruction-tuning for Japanese medical LLMs, providing a multifaceted evaluation and insights into domain adaptation challenges.

Findings

01

LoRA-based instruction-tuning improves medical QA performance.

02

Larger models show more significant domain adaptation effects.

03

Japanese-centric models still face limitations in domain adaptation.

Abstract

In the ongoing wave of impact driven by large language models (LLMs) like ChatGPT, the adaptation of LLMs to medical domain has emerged as a crucial research frontier. Since mainstream LLMs tend to be designed for general-purpose applications, constructing a medical LLM through domain adaptation is a huge challenge. While instruction-tuning is used to fine-tune some LLMs, its precise roles in domain adaptation remain unknown. Here we show the contribution of LoRA-based instruction-tuning to performance in Japanese medical question-answering tasks. In doing so, we employ a multifaceted evaluation for multiple-choice questions, including scoring based on "Exact match" and "Gestalt distance" in addition to the conventional accuracy. Our findings suggest that LoRA-based instruction-tuning can partially incorporate domain-specific knowledge into LLMs, with larger models demonstrating more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stardust-coder/japanese-lm-med-harness
pytorch

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education