# Unsupervised machine learning identifies biomarkers of disease progression in post-kala-azar dermal leishmaniasis in Sudan

**Authors:** Ana Torres, Brima Musa Younis, Samuel Tesema, Jose Carlos Solana, Javier Moreno, Antonio J. Martín-Galiano, Ahmed Mudawi Musa, Fabiana Alves, Eugenia Carrillo

PMC · DOI: 10.1371/journal.pntd.0012924 · PLOS Neglected Tropical Diseases · 2025-03-11

## TL;DR

Researchers used machine learning to find biomarkers that predict the progression of a skin disease in Sudanese patients recovering from a parasitic infection.

## Contribution

This study is the first to use unsupervised machine learning to identify biomarkers for predicting disease progression in post-kala-azar dermal leishmaniasis.

## Key findings

- A combination of 10 non-redundant variables was linked to a 3.1 times higher risk of progressive PKDL.
- Low albumin, low haematocrit, and low IFN-γ production were key factors in progressive disease.
- 14 biochemical variables identified by ML can detect worsening PKDL states in Sudanese patients.

## Abstract

Post-kala-azar dermal leishmaniasis (PKDL) appears as a rash in some individuals who have recovered from visceral leishmaniasis caused by Leishmania donovani. Today, basic knowledge of this neglected disease and how to predict its progression remain largely unknown.

This study addresses the use of several biochemical, haematological and immunological variables, independently or through unsupervised machine learning (ML), to predict PKDL progression risk. In 110 patients from Sudan, 31 such factors were assessed in relation to PKDL disease state at the time of diagnosis: progressive (worsening) versus stable. To identify key factors associated with PKDL worsening, we used both a conventional statistical approach and multivariate analysis through unsupervised ML. The independent use of these variables had limited power to predict skin lesion severity in a baseline examination. In contrast, the unsupervised ML approach identified a set of 10 non-redundant variables that was linked to a 3.1 times higher risk of developing progressive PKDL. Three of these clustering factors (low albumin level, low haematocrit and low IFN-γ production in PBMCs after Leishmania antigen stimulation) were remarkable in patients with progressive disease. Dimensionality re-establishment identified 11 further significantly modified factors that are also important to understand the worsening phenotype. Our results indicate that the combination of anaemia and a weak Th1 immunological response is likely the main physiological mechanism that leads to progressive PKDL.

A combination of 14 biochemical variables identified by unsupervised ML was able to detect a worsening PKDL state in Sudanese patients. This approach could prove instrumental to train future supervised algorithms based on larger patient cohorts both for a more precise diagnosis and to gain insight into fundamental aspects of this complication of visceral leishmaniasis.

Post-kala-azar dermal leishmaniasis (PKDL) is a neglected disease that causes stigmatization and carries a significant socioeconomic and personal burden in South Asia and East Africa. Currently, predicting its progression to a severe form remains largely unknown. In this study, we used machine learning (ML) methods to identify biomarkers of disease progression in PKDL using clinical, biochemical, haematological, and immunological data from a large cohort of Sudanese patients with either progressive (worsening) or stable conditions at diagnosis. For the first time, this study identified a combination of patient factors that may help provide a more accurate diagnosis of difficult-to-treat PKDL cases, with potential implications for improving patient management and quality of life.

## Linked entities

- **Proteins:** IFNG (interferon gamma)
- **Diseases:** visceral leishmaniasis (MONDO:0005445)
- **Species:** Leishmania donovani (taxon 5661)

## Full-text entities

- **Genes:** ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}, IFNG (interferon gamma) [NCBI Gene 3458] {aka IFG, IFI, IMD69}
- **Diseases:** anaemia (MESH:D000743), PKDL (MESH:D007898), rash (MESH:D005076), skin lesion (MESH:D012871)
- **Species:** Leishmania donovani (species) [taxon 5661], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11932480/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11932480/full.md

## References

60 references — full list in the complete paper: https://tomesphere.com/paper/PMC11932480/full.md

---
Source: https://tomesphere.com/paper/PMC11932480