# Developing a model to predict neonatal respiratory distress syndrome and affecting factors using data mining: A cross-sectional study

**Authors:** Parisa Farshid, Kayvan Mirnia, Peyman Rezaei-Hachesu, Elham Maserat, Taha Samad-Soltani

PMC · DOI: 10.18502/ijrm.v21i11.14654 · International Journal of Reproductive Biomedicine · 2023-12-19

## TL;DR

This study uses data mining to develop a model predicting neonatal respiratory distress syndrome, showing promising accuracy with random forest techniques.

## Contribution

The novel contribution is applying data mining techniques, particularly random forest, to predict neonatal RDS with high accuracy.

## Key findings

- Random forest achieved the highest accuracy (0.815) in predicting neonatal RDS.
- The model demonstrated good sensitivity (0.802) and specificity (0.812).
- Data mining is shown to be a feasible tool for improving neonatal RDS diagnosis.

## Abstract

Background: One of the major challenges that hospitals and clinicians face is the early identification of newborns at risk for adverse events. One of them is neonatal respiratory distress syndrome (RDS). RDS is the widest spared respiratory disorder in immature newborns and the main source of death among them. Machine learning has been broadly accepted and used in various scopes to analyze medical information and is very useful in the early detection of RDS.
Objective: This study aimed to develop a model to predict neonatal RDS and affecting factors using data mining.
Materials and Methods: The original dataset in this cross-sectional study was extracted from the medical records of newborns diagnosed with RDS from July 2017-July 2018 in Alzahra hospital, Tabriz, Iran. This data includes information about 1469 neonates, and their mothers information. The data were preprocessed and applied to expand the classification model using machine learning techniques such as support vector machine, Naïve Bayes, classification tree, random forest, CN2 rule induction, and neural network, for prediction of RDS episodes. The study compares models according to their accuracy.
Results: Among the obtained results, an accuracy of 0.815, sensitivity of 0.802, specificity of 0.812, and area under the curve of 0.843 was the best output using random forest.
Conclusion: The findings of our study proved that new approaches, such as data mining, may support medical decisions, improving diagnosis in neonatal RDS. The feasibility of using a random forest in neonatal RDS prediction would offer the possibility to decrease postpartum complications of neonatal care.

## Linked entities

- **Diseases:** neonatal respiratory distress syndrome (MONDO:0700081)

## Full-text entities

- **Diseases:** apnea (MESH:D001049), hypoxic organ harm (MESH:D002534), RDS (MESH:D012128), weight (MESH:D015431), diabetes (MESH:D003920), metabolic disorder (MESH:D008659), preeclampsia (MESH:D011225), hypoglycemia (MESH:D007003), premature rupture of the membranes (MESH:D005322), jaundice (MESH:D007565), low (MESH:D009800), hypertension (MESH:D006973), death (MESH:D003643), neonatal diseases (MESH:D007232), meconium aspiration syndrome (MESH:D008471), circumference (MESH:C535556), neonatal death (MESH:D066087), neonatal RDS (MESH:D012127), brain harm (MESH:D001927), respiratory disorder (MESH:D012131), prematurity (MESH:C536271), TN (MESH:C579935)
- **Chemicals:** steroid (MESH:D013256), magnesium sulfide (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10823121/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10823121/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC10823121/full.md

---
Source: https://tomesphere.com/paper/PMC10823121