# Machine learning-driven Diabetes Health Tracer (DHT): Optimizing prognosis using RaSK_GraDe and RaSK_GraDeL models

**Authors:** Muhammad Noman, Maria Hanif, Abdul Hameed, Muhammad Babar, Basit Qureshi

PMC · DOI: 10.1371/journal.pone.0327661 · PLOS One · 2025-10-21

## TL;DR

This paper introduces ML models RaSK_GraDe and RaSK_GraDeL for diabetes prediction, achieving high accuracy using a new dataset and preprocessing techniques.

## Contribution

The novel contribution is the development and evaluation of RaSK_GraDe and RaSK_GraDeL ensemble models for diabetes prognosis.

## Key findings

- RaSK_GraDe achieved 98.03% accuracy using a Voting Classifier.
- RaSK_GraDeL achieved 98.55% accuracy using a Stacking Model with Logistic Regression.
- The DHT dataset with 2877 observations was used for robust model evaluation.

## Abstract

Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

## Linked entities

- **Diseases:** Diabetes mellitus (MONDO:0005015)

## Full-text entities

- **Genes:** INS (insulin) [NCBI Gene 3630] {aka IDDM, IDDM1, IDDM2, ILPR, IRDN, MODY10}
- **Diseases:** Diabetes (MESH:D003920)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12539721/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12539721/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC12539721/full.md

---
Source: https://tomesphere.com/paper/PMC12539721