CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation
Md. Mehedi Hasan, Md. Abir Hossain, Farman Hossain Sayem, Bikash Kumar Paul, Ziaur Rahman, Mohammad Shorif Uddin, and Rafid Mostafiz

TL;DR
CLIN-LLM is a safety-aware hybrid framework that combines multimodal patient data encoding, uncertainty calibration, and retrieval-augmented generation to improve clinical diagnosis and treatment recommendations.
Contribution
It introduces a novel safety-constrained pipeline integrating uncertainty estimation and retrieval-augmented generation for clinical decision support.
Findings
Achieves 98% accuracy and F1 score, outperforming ClinicalBERT by 7.1%.
Reduces unsafe antibiotic suggestions by 67% compared to GPT-5.
Flags 18% of cases for expert review based on confidence levels.
Abstract
Accurate symptom-to-disease classification and clinically grounded treatment recommendations remain challenging, particularly in heterogeneous patient settings with high diagnostic risk. Existing large language model (LLM)-based systems often lack medical grounding and fail to quantify uncertainty, resulting in unsafe outputs. We propose CLIN-LLM, a safety-constrained hybrid pipeline that integrates multimodal patient encoding, uncertainty-calibrated disease classification, and retrieval-augmented treatment generation. The framework fine-tunes BioBERT on 1,200 clinical cases from the Symptom2Disease dataset and incorporates Focal Loss with Monte Carlo Dropout to enable confidence-aware predictions from free-text symptoms and structured vitals. Low-certainty cases (18%) are automatically flagged for expert review, ensuring human oversight. For treatment generation, CLIN-LLM employs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
