Reliability-Oriented Multilingual Orthopedic Diagnosis: A Domain-Adaptive Modeling and a Conceptual Validation Framework
Danish Ali, Li Xiaojian, Sundas Iqbal, and Farrukh Zaidi

TL;DR
This study evaluates multilingual orthopedic diagnosis models, highlighting the importance of domain adaptation and structured validation for reliable clinical decision support in low-resource languages.
Contribution
It introduces a domain-adaptive model architecture and a conceptual validation framework to improve reliability and safety in multilingual clinical diagnosis systems.
Findings
LLMs show strong linguistic fluency but unstable calibration in structured multilingual tasks.
Domain-adaptive models like IndicBERT-HPA outperform task-only models across diagnostic categories.
Structured validation and human-in-the-loop are essential for safe deployment of clinical decision support systems.
Abstract
Large Language Models (LLMs) are increasingly proposed for clinical decision support including multilingual diagnosis in low-resource settings. However, their reliability, calibration and safety characteristics remain insufficiently understood for structured, high-risk tasks. We present a system-level analysis of multilingual orthopedic diagnosis from free-text clinical notes in English, Hindi and Punjabi. We evaluate three modeling regimes: (i) task-aligned multilingual transformer encoders, (ii) a task-fine-tuned baseline (DistilBERT), and (iii) a domain-adaptive architecture tailored to orthopedic text (IndicBERT-HPA). These models are compared with zero-shot, instruction-tuned LLMs to assess suitability for structured diagnostic classification. Results indicate that while LLMs exhibit strong linguistic fluency, they show unstable calibration and reduced reliability under structured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
