Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Mohammed Nowshad Ruhani Chowdhury; Mohammed Nowaz Rabbani Chowdhury; Sakari Lukkarinen

arXiv:2603.24772·cs.CL·March 27, 2026

Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Mohammed Nowshad Ruhani Chowdhury, Mohammed Nowaz Rabbani Chowdhury, Sakari Lukkarinen

PDF

Open Access

TL;DR

This study demonstrates that fine-tuning a large language model on a small Finnish medical transcription dataset can achieve meaningful semantic accuracy, supporting its use in low-resource clinical NLP tasks.

Contribution

It introduces a domain-specific fine-tuning approach for LLaMA 3.1-8B on Finnish medical transcription data, showing feasibility in low-resource settings.

Findings

01

Semantic similarity was high despite low n-gram overlap.

02

Fine-tuning improved model performance for Finnish clinical transcription.

03

Supports privacy-preserving NLP applications in healthcare.

Abstract

Clinical documentation is a critical factor for patient safety, diagnosis, and continuity of care. The administrative burden of EHRs is a significant factor in physician burnout. This is a critical issue for low-resource languages, including Finnish. This study aims to investigate the effectiveness of a domain-aligned natural language processing (NLP); large language model for medical transcription in Finnish by fine-tuning LLaMA 3.1-8B on a small validated corpus of simulated clinical conversations by students at Metropolia University of Applied Sciences. The fine-tuning process for medical transcription used a controlled preprocessing and optimization approach. The fine-tuning effectiveness was evaluated by sevenfold cross-validation. The evaluation metrics for fine-tuned LLaMA 3.1-8B were BLEU = 0.1214, ROUGE-L = 0.4982, and BERTScore F1 = 0.8230. The results showed a low n-gram…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Machine Learning in Healthcare