# Identifying health conditions in older adults in textual health records using deep learning-based natural language processing

**Authors:** Jake Lin, Anna Kuukka, Tomi Korpi, Anna Tirkkonen, Antti Kariluoto, Juho Kaijansinkko, Maija Satamo, Hanna Pajulammi, Markus J. Haapanen, Sergei Häyrynen, Eetu Pursiainen, Daniel Ciovica, Mikaela B. von Bonsdorff, Juulia Jylhävä

PMC · DOI: 10.1016/j.csbj.2025.09.009 · 2025-09-10

## TL;DR

This study uses deep learning to find health issues like falls and incontinence in older adults from unstructured medical records, showing better results than traditional methods.

## Contribution

The novel use of deep learning-based NER to identify underreported health conditions in older adults from free-text health records.

## Key findings

- NER models outperformed diagnostic codes in identifying falls and incontinence with F1 scores above 0.80.
- NER-identified conditions showed stronger mortality hazard ratios compared to diagnostic codes.
- The study highlights the potential of NER to improve patient care by uncovering hidden health risks in unstructured data.

## Abstract

Many clinically significant health conditions in older adults are underreported or only recorded in unstructured health records. These records, however, contain valuable information for patient care and prognosis. This study utilized 10.6 million free-text entries from the electronic health records of 102,525 patients aged 50–80 across various care settings in Finland from 2010 to 2022. A deep learning-based natural language processing model was employed to perform named entity recognition (NER) to identify falls, incontinence, loneliness, and mobility limitations from the free-text entries. The performance of the NER models was evaluated by precision, recall and F1 scores. Diagnostic codes for incontinence and falls were collected for comparisons. Cox regression models were used to assess the predictive value of the identified conditions for all-cause mortality. The NER models demonstrated excellent performance with recall, precision and F1 scores greater than 0.80 across the health conditions. Compared to diagnostic codes, NER identified greater numbers of falls (31987 vs 4090) and incontinence (7059 vs 3873) onsets and yielded greater hazard ratios for all-cause mortality: 1.31 vs 1.04 for falls and 1.99 vs 0.65 for incontinence. Deep learning-based NER models present new opportunities to identify vulnerable patients in free text health records.

## Full-text entities

- **Diseases:** falls (MESH:C537863), incontinence (MESH:D014549), mobility limitations (MESH:D051346)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12765990/full.md

---
Source: https://tomesphere.com/paper/PMC12765990