# SleepBert: An Intelligent Clinical Encyclopaedia for Sleep Disorders Using Large Language Models

**Authors:** KA Amala Ann, V Vaidhehi

PMC · DOI: 10.21203/rs.3.rs-6605863/v1 · Research Square · 2025-05-08

## TL;DR

SleepBert is a new AI system that improves sleep disorder diagnosis by combining sleep data with clinical knowledge, offering faster and more accurate insights.

## Contribution

SleepBert introduces a hybrid RAG model that integrates PSG data and clinical narratives for sleep disorder analysis.

## Key findings

- SleepBert achieved 93.40% accuracy, outperforming ClinicalBERT and BERT in sleep disorder classification.
- The model retrieved PubMed evidence with 90.1% accuracy and a latency of 5.4 seconds.
- SleepBert enables faster analysis of PSG data and access to rare neuro-cognitive and genetic markers.

## Abstract

Diagnosis of sleep disorders is difficult owing to the nature of sleep microarchitecture and the heterogeneity of symptom presentation. Conventional analysis of Polysomnography (PSG)—the interpretation of EEG bandpower, sleep spindles, and K-complexes—is time-consuming, laborious, and subjective, restricting detection of infrequent co-occurrences of disorders and their link to neuro-cognitive and genetic disorders. To overcome these challenges, we present SleepBert, a hybrid Retrieval-Augmented Generation (RAG) model that combines structured PSG features with unstructured clinical narratives for holistic sleep disorder analysis. Constructed by fine-tuning ClinicalBERT on PSG data from the NCH (paediatric dataset) and ISRUC datasets, SleepBert has a PSG-specific knowledge retrieval layer to retrieve real-time evidence from medical databases such as PubMed. The model delivered 93.40% accuracy, outdoing ClinicalBERT (87.20%) and BERT (80.90%), with 90.1% accuracy in retrieving PubMed and response latency of 5.4 seconds. This system serves as an Encyclopaedia of sleep disorders, delivering evidence-based, correct insights and support for decision making to clinicians and researchers. The system supports the analysis of a large number of PSGs, speeds up data-driven discoveries, and allows access to rare neuro-cognitive and genetic markers. SleepBert is an extensible platform for pushing the frontier of sleep disorder research and enhancing clinical decision-making through quick, accurate interpretations of sophisticated PSG data.

## Linked entities

- **Diseases:** sleep disorders (MONDO:0003406)

## Full-text entities

- **Diseases:** Sleep Disorders (MESH:D012893), neuro-cognitive and genetic disorders (MESH:D003072)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12083639/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12083639/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12083639/full.md

---
Source: https://tomesphere.com/paper/PMC12083639