Severity-Aware Weighted Loss for Arabic Medical Text Generation
Ahmed Alansary, Molham Mohamed, Ali Hamdi

TL;DR
This paper introduces a severity-aware weighted loss for fine-tuning Arabic medical language models, prioritizing clinically critical cases to improve medical text generation accuracy.
Contribution
It proposes a novel loss function that dynamically weights token-level errors based on clinical severity, enhancing model performance without altering architecture.
Findings
Severity-aware fine-tuning improves performance significantly across models.
Performance gains range from 12% to 18% over baseline models.
The method effectively prioritizes high-severity medical cases during training.
Abstract
Large language models have shown strong potential for Arabic medical text generation; however, traditional fine-tuning objectives treat all medical cases uniformly, ignoring differences in clinical severity. This limitation is particularly critical in healthcare settings, where errors in severe cases contain higher clinical risk. In this work, we propose a severity-aware weighted loss for fine-tuning Arabic language models on medical complaint-response data. The method depends on soft severity probabilities to dynamically scale token-level loss contributions during optimization, thereby prioritizing clinically critical interactions without modifying model architectures. Experiments are conducted using the MAQA dataset, which provides Arabic medical complaints and trusted human responses. Severity labels and probabilistic scores are automatically derived using a fine-tuned AraBERT-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
