CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation

Md. Mehedi Hasan; Md. Abir Hossain; Farman Hossain Sayem; Bikash Kumar Paul; Ziaur Rahman; Mohammad Shorif Uddin; and Rafid Mostafiz

arXiv:2510.22609·cs.AI·April 28, 2026

CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation

Md. Mehedi Hasan, Md. Abir Hossain, Farman Hossain Sayem, Bikash Kumar Paul, Ziaur Rahman, Mohammad Shorif Uddin, and Rafid Mostafiz

PDF

TL;DR

CLIN-LLM is a safety-aware hybrid framework that combines multimodal patient data encoding, uncertainty calibration, and retrieval-augmented generation to improve clinical diagnosis and treatment recommendations.

Contribution

It introduces a novel safety-constrained pipeline integrating uncertainty estimation and retrieval-augmented generation for clinical decision support.

Findings

01

Achieves 98% accuracy and F1 score, outperforming ClinicalBERT by 7.1%.

02

Reduces unsafe antibiotic suggestions by 67% compared to GPT-5.

03

Flags 18% of cases for expert review based on confidence levels.

Abstract

Accurate symptom-to-disease classification and clinically grounded treatment recommendations remain challenging, particularly in heterogeneous patient settings with high diagnostic risk. Existing large language model (LLM)-based systems often lack medical grounding and fail to quantify uncertainty, resulting in unsafe outputs. We propose CLIN-LLM, a safety-constrained hybrid pipeline that integrates multimodal patient encoding, uncertainty-calibrated disease classification, and retrieval-augmented treatment generation. The framework fine-tunes BioBERT on 1,200 clinical cases from the Symptom2Disease dataset and incorporates Focal Loss with Monte Carlo Dropout to enable confidence-aware predictions from free-text symptoms and structured vitals. Low-certainty cases (18%) are automatically flagged for expert review, ensuring human oversight. For treatment generation, CLIN-LLM employs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.