# AI-Assisted Differentiation of Dengue and Chikungunya Using Big, Imbalanced Epidemiological Data

**Authors:** Thanh Huy Nguyen, Nguyen Quoc Khanh Le

PMC · DOI: 10.3390/tropicalmed11020040 · Tropical Medicine and Infectious Disease · 2026-01-30

## TL;DR

This study uses AI to accurately differentiate between dengue and chikungunya using large-scale epidemiological data, aiding diagnosis in resource-limited areas.

## Contribution

The novel use of AI models, particularly ANN, to classify dengue, chikungunya, and discarded cases with high accuracy in a real-world, imbalanced dataset.

## Key findings

- Random Forest achieved high multi-class classification performance (Recall: 0.9288, AUC: 0.9865).
- ANN excelled in identifying chikungunya cases (Recall: 0.9986, AUC: 0.9283).
- Models showed generalizability, especially for distinguishing discarded cases.

## Abstract

Dengue and chikungunya are endemic arboviral diseases in many low- and middle-income countries, often co-circulating and presenting with overlapping symptoms that hinder early diagnosis. Timely differentiation is critical, especially in resource-limited settings where laboratory testing is unavailable. We developed and evaluated machine-learning (ML)- and deep-learning (DL) models to classify dengue, chikungunya, and discarded cases using a large-scale, real-world dataset of over 6.7 million entries from Brazil (2013–2020). After applying the Synthetic Minority Oversampling Technique (SMOTE) to address class imbalance, we trained six ML models and one artificial neural network (ANN) using only demographic, clinical, and comorbidity features. The Random Forest model achieved strong multi-class classification performance (Recall: 0.9288, the Area Under the Curve (AUC): 0.9865). The ANN model excelled in identifying chikungunya cases (Recall: 0.9986, AUC: 0.9283), suggesting its suitability for rapid screening. External validation confirmed the generalizability of our models, particularly for distinguishing discarded cases. Our models demonstrate high-accuracy in differentiating dengue and chikungunya using routinely collected clinical and epidemiological data. This work supports the development of Artificial Intelligence-powered decision-support tools to assist frontline healthcare workers in under-resourced settings and aligns with the One Health approach to improving surveillance and diagnosis of neglected tropical diseases.

## Linked entities

- **Diseases:** dengue (MONDO:0005502), chikungunya (MONDO:0017941)

## Full-text entities

- **Genes:** IVNS1ABP (influenza virus NS1A binding protein) [NCBI Gene 10625] {aka ARA3, FLARA3, HSPC068, IMD70, KLHL39, ND1}
- **Diseases:** Conjunctivitis (MESH:D003231), Back Pain (MESH:D001416), arboviral diseases (MESH:D004671), virus infection (MESH:D014777), muscle, bone, or back pain (MESH:D063806), deaths (MESH:D003643), DIS (MESH:C567010), DL (MESH:D007859), Hypertension (MESH:D006973), Hematological disease (MESH:D006402), NTDs (MESH:D058069), Retro-orbital pain (MESH:D010146), jaundice (MESH:D007565), flu (MESH:D007251), bleeding (MESH:D006470), acute febrile infections (MESH:D000071072), renal disease (MESH:D007674), Chikungunya (MESH:D065632), autoimmune disease (MESH:D001327), DF (MESH:D003715), NAUSEA (MESH:D009325), DHF (MESH:D019595), Headache (MESH:D006261), Arthritis (MESH:D001168), injury to (MESH:D014947), febrile diseases (MESH:D004194), arbovirus diseases (MESH:D001102), exanthema (MESH:D005076), scrub typhus (MESH:D012612), leptospirosis (MESH:D007922), vomiting (MESH:D014839), yellow fever (MESH:D015004), chronic disability (MESH:D002908), joint edema (MESH:D004487), anuria (MESH:D001002), eye pain (MESH:D058447), abdominal pain (MESH:D015746), borne (MESH:D017282), Coma (MESH:D003128), Arthralgia (MESH:D018771), mosquito-borne diseases (MESH:D000079426), hematological, liver, kidney, peptic acid, (MESH:D010437), Diabetes (MESH:D003920), West Nile fever (MESH:D014901), weakness (MESH:D018908), Fever (MESH:D005334), infected (MESH:D007239), Zika (MESH:D000071243), irritability (MESH:D001523), malaria (MESH:D008288)
- **Chemicals:** creatinine (MESH:D003404)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12944897/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944897/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/PMC12944897/full.md

---
Source: https://tomesphere.com/paper/PMC12944897