# Personalized Hearing Loss Care Using SNOMED CT-Aligned Ontology and Random Forest Machine Learning: A Hybrid Decision-Support Framework

**Authors:** Darine Kebsi, Chamseddine Barki, Ismail Dergaa, Riadh Gouider, Halil İbrahim Ceylan, Amina Maddouri, Abderrazak Jemai, Mourad Elloumi, Nicola Luigi Bragazzi, Hanene Boussi Rahmouni

PMC · DOI: 10.3390/audiolres16020037 · 2026-03-02

## TL;DR

This study creates a system combining medical knowledge and machine learning to better diagnose and treat hearing loss with personalized care.

## Contribution

A hybrid framework integrating SNOMED CT-aligned ontology with Random Forest ML for personalized hearing loss classification and treatment prediction.

## Key findings

- The ontology-based Random Forest model achieved 92.48% test accuracy for hearing loss classification.
- Semantic enrichment improved cross-validation accuracy to 92.80% with reduced standard deviation.
- Audiometric thresholds and ontology-derived severity labels were top predictors for accurate classification.

## Abstract

Background: Hearing loss affects over 466 million individuals globally and is recognized as a major risk factor for Alzheimer’s disease, yet treatment personalization remains limited due to the complexity and diversity of underlying causes. Current diagnostic and therapeutic approaches lack standardized methods to accurately predict the most appropriate intervention for individual patients. The integration of medical ontologies with machine learning offers a promising solution for enhancing diagnostic accuracy and treatment personalization. Aim: Our study aimed to (i) develop a Systematized Nomenclature of Medicine—Clinical Terms (SNOMED CT)-aligned clinical ontology for hearing loss using Semantic Web Rule Language for automated reasoning; (ii) implement a Random Forest classifier trained on ontology-enriched patient data to classify hearing loss types (conductive, sensorineural, mixed, or normal); and (iii) predict optimal personalized treatments based on laterality, severity, audiometric thresholds, and medical history using real-world patient data. Methods: We developed a task ontology using Protégé 5.6.3 with Web Ontology Language (OWL), integrated SNOMED CT terminology alignment, and implemented Semantic Web Rule Language rules executed by the Pellet 2.2.0 reasoner. The framework was trained and evaluated on 3723 adult patients from the 2015–2016 National Health and Nutrition Examination Survey (NHANES) dataset with complete audiometric and clinical data. Random Forest models were developed using an 80–20 train-test split with stratified sampling and five-fold cross-validation. Performance was compared between K-Means clustering-based labeling and ontology-based semantic inference using accuracy, precision, recall, F1-score, and log loss metrics. Results: The ontology successfully generated semantic labels for all 3723 patients, enabling precise classification of hearing loss types, severity levels, and laterality. The Random Forest model with K-Means clustering achieved a test accuracy of 90.2% with a log loss of 0.2766 and a cross-validation mean accuracy of 91.22% (standard deviation 1.2%). Integration of ontology-based semantic enrichment significantly improved performance, achieving a test accuracy of 92.48% with a cross-validation mean accuracy of 92.80% (standard deviation 0.9%). F1-scores improved across all classes, with mixed hearing loss showing a notable increase from 0.86 to 0.92. Feature importance analysis identified audiometric thresholds, ontology-derived severity labels, and medical history as top predictors, enhancing clinical interpretability. Conclusions: This study demonstrates that combining SNOMED CT-aligned ontology with Random Forest classification achieves superior diagnostic accuracy and enables personalized treatment recommendations for hearing loss. The hybrid framework provides clinically interpretable decision support while ensuring semantic interoperability with electronic health records. Multi-institutional validation studies are necessary to assess generalizability across diverse populations before clinical deployment.

## Linked entities

- **Diseases:** hearing loss (MONDO:0005365), Alzheimer’s disease (MONDO:0004975)

## Full-text entities

- **Genes:** GJB2 (gap junction protein beta 2) [NCBI Gene 2706] {aka BAPS, CX26, DFNA3, DFNA3A, DFNB1, DFNB1A}
- **Diseases:** diabetes (MESH:D003920), Hearing Impairment (MESH:D034381), Conductive hearing loss (MESH:D006314), Hearing_Condition (MESH:D006311), cardiovascular disease (MESH:D002318), hypertension (MESH:D006973), high-frequency hearing loss (MESH:D006316), bladder cancer (MESH:D001749), Alzheimer's disease (MESH:D000544), CT (MESH:D000088562), deafness (MESH:D003638), Conductive_HL (MESH:C538324), leukemia (MESH:D007938), Sensorineural (MESH:D006319), sensory deficit (MESH:D012678), Mixed hearing loss (MESH:D046089), injury to (MESH:D014947), dementia (MESH:D003704)
- **Chemicals:** alcohol (MESH:D000438)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13010750/full.md

---
Source: https://tomesphere.com/paper/PMC13010750