Development and Preliminary Evaluation of a Domain-Specific Large Language Model for Tuberculosis Care in South Africa

Thokozile Khosa; Olawande Daramola

arXiv:2604.19776·cs.CL·April 23, 2026

Development and Preliminary Evaluation of a Domain-Specific Large Language Model for Tuberculosis Care in South Africa

Thokozile Khosa, Olawande Daramola

PDF

TL;DR

This paper describes the development and initial testing of a specialized large language model tailored for tuberculosis care in South Africa, utilizing domain-specific data and advanced fine-tuning techniques.

Contribution

It introduces a novel domain-specific LLM for TB care, fine-tuned with QLoRA and retrieval augmentation, showing improved performance over general models.

Findings

01

The DS-LLM outperformed the base BioMistral-7B in contextual alignment.

02

Fine-tuning with QLoRA enhanced the model's medical knowledge.

03

Retrieval-augmented generation contributed to better contextual understanding.

Abstract

Tuberculosis (TB) is one of the world's deadliest infectious diseases, and in South Africa, it contributes a significant burden to the country's health care system. This paper presents an experimental study on the development of a domain-specific Large Language Model (DS-LLM) for TB care that can help to alleviate the burden on patients and healthcare providers. To achieve this, a literature review was conducted to understand current LLM development strategies, specifically in the medical domain. Thereafter, data were collected from South African TB guidelines, selected TB literature, and existing benchmark medical datasets. We performed LLM fine-tuning by using the Quantised Low-Rank Adaptation (QLoRA) algorithm on a medical LLM (BioMistral-7B), and also implemented Retrieval-Augmented Generation using GraphRAG. The developed DS-LLM was evaluated against the base BioMistral-7B model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.