# High-fidelity parameter-efficient fine-tuning for joint recognition and linking of diagnoses to ICD-10 in non-standard primary care notes

**Authors:** Cristian Estupiñán-Ojeda, Raúl J Sandomingo-Freire, Lluís Padró, Jordi Turmo

PMC · DOI: 10.1093/jamiaopen/ooaf120 · JAMIA Open · 2025-10-16

## TL;DR

This paper evaluates efficient training methods for identifying and linking diagnoses to ICD-10 codes in non-standard Spanish and Catalan primary care notes.

## Contribution

It introduces parameter-efficient fine-tuning (PEFT) as a memory-efficient alternative to full fine-tuning for multilingual clinical text classification.

## Key findings

- BERT-QLoRA achieved 62.2 strict Micro-F1 with 67.5% fewer trainable parameters than full fine-tuning.
- Training on combined bilingual data improved generalization across languages.
- QLoRA offered the best balance of accuracy and efficiency among PEFT techniques.

## Abstract

Joint recognition and ICD-10 linking of diagnoses in bilingual, non-standard Spanish and Catalan primary care notes is challenging. We evaluate parameter-efficient fine-tuning (PEFT) techniques as a resource-conscious alternative to full fine-tuning (FFT) for multi-label clinical text classification.

On a corpus of 21 812 Catalan and Spanish clinical notes from Catalonia, we compared the PEFT techniques LoRA, DoRA, LoHA, LoKR, and QLoRA applied to multilingual transformers (BERT, RoBERTa, DistilBERT, and mDeBERTa).

FFT delivered the best strict Micro-F1 (63.0), but BERT-QLoRA scored 62.2, only 0.8 points lower, while reducing trainable parameters by 67.5% and memory by 33.7%. Training on combined bilingual data consistently improved generalization across individual languages.

The small FFT margin was confined to rare labels, indicating limited benefit from updating all parameters. Among PEFT techniques, QLoRA offered the strongest accuracy-efficiency balance; LoRA and DoRA were competitive, whereas LoHA and LoKR incurred larger losses. Adapter rank mattered: ranks below 128 sharply degraded Micro-F1. The substantial memory savings enable deployment on commodity GPUs while delivering performance very close to FFT.

PEFT, particularly QLoRA, supports accurate and memory-efficient joint entity recognition and ICD-10 linking in multilingual, low-resource clinical settings.

## Full-text entities

- **Diseases:** fracture (MESH:D050723), cardiomegalia aprox (MESH:D006009), ICD (OMIM:252500), stroke (MESH:D020521), JERL (MESH:D020238), Fracture of unspecified body region (MESH:C562424), heart disease (MESH:D006331), cardiomegaly (MESH:D006332), FFT (MESH:C566019)
- **Chemicals:** Pt (MESH:D010984), QLoRA (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12530322/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12530322/full.md

## References

25 references — full list in the complete paper: https://tomesphere.com/paper/PMC12530322/full.md

---
Source: https://tomesphere.com/paper/PMC12530322