# Detection of Antithrombotic-Related Bleeding in Older Inpatients: Multicenter Retrospective Study Using Structured and Unstructured Electronic Health Record Data

**Authors:** Claire Coumau, Frederic Gaspar, Mehdi Zayene, Elliott Bertrand, Lorenzo Alberio, Christian Lovis, Patrick E Beeler, Fabio Rinaldi, Monika Lutters, Marie-Annick Le Pogam, Chantal Csajka, Bernard Burnand

PMC · DOI: 10.2196/77809 · 2026-01-29

## TL;DR

This study developed and tested algorithms to detect bleeding events in older patients using electronic health records, combining structured data and natural language processing for better accuracy.

## Contribution

The novel contribution is an integrated algorithm combining structured data rules and NLP for improved detection of antithrombotic-related bleeding in older inpatients.

## Key findings

- Structured data algorithms detected 8.26% major and 15.04% clinically relevant nonmajor bleeding cases.
- Combining structured data with NLP improved detection to 12.2% major and 27.4% clinically relevant nonmajor bleeding cases.
- The combined model achieved a sensitivity of 0.84 and F1-score of 0.64, showing strong performance and generalizability.

## Abstract

Bleeding complications are a major contributor to adverse drug events among older inpatients, particularly in those treated with antithrombotic agents. Timely and accurate detection of bleeding events is essential for improving drug safety surveillance and clinical risk management.

The study aimed to develop and validate automated algorithms for detecting major bleeding (MB) and clinically relevant nonmajor bleeding (CRNMB) events from electronic medical records (EMRs) by combining structured data-based rule models and a natural language processing (NLP) approach, and to evaluate their performance and generalizability against a manually reviewed gold standard and an external dataset.

We conducted a multicenter retrospective study using routinely collected EMR data from 3 Swiss university hospitals. Patients 65 years or older who received at least one antithrombotic agent and were hospitalized between January 2015 and December 2016 were included. To detect MB and CRNMB events, rule-based algorithms were developed using structured data (International Statistical Classification of Diseases, 10th Revision, German Modification [ICD-10-GM] codes, laboratory values, transfusion records, and antihemorrhagic prescriptions), with variables and cutoff values defined according to adapted International Society on Thrombosis and Haemostasis definitions and expert consensus. In parallel, a supervised NLP model was applied to discharge summaries from one hospital. A manual review of 754 EMRs served as the reference standard for internal validation, and the algorithm performance of the structured data algorithms (SDA), NLP, and their combination (SDA+NLP) was evaluated against this manually reviewed gold standard using standard performance metrics. External validation was performed on an independent dataset from the Lausanne University Hospital to assess model robustness and generalizability.

Among 36,039 inpatient stays, SDA identified 8.26% (n=2979) as MB and 15.04% (n=5419) as CRNMB cases. ICD-10-GM codes alone detected 28.5% (n=849) of MB and 31.48% (n=1706) of CRNMB cases, while laboratory data contributed most to event detection (n=1994, 66.94% for MB and n=3663, 67.60% for CRNMB). Integrating SDA with NLP improved detection, identifying 12.2% (920/7513) of MB and 27.4% (2062/7513) of CRNMB cases at 1 hospital. The combined model achieved the best performance (sensitivity 0.84, positive predictive value 0.51, F1-score 0.64). External validation on Lausanne University Hospital 2021‐2022 data (n=24,054 stays) confirmed the algorithms’ reproducibility; the prevalence of MB decreased while CRNMB increased, reflecting evolving clinical practices and antithrombotic use patterns.

Our integrated approach, combining SDA with NLP, enhances the detection of hemorrhagic events in older hospitalized patients treated with antithrombotic agents, suggesting its potential usefulness for drug safety monitoring and clinical risk management.

## Full-text entities

- **Diseases:** Bleeding complications (MESH:D008107), Bleeding (MESH:D006470), Thrombosis and Haemostasis (MESH:D020141), MB (MESH:D004830)
- **Chemicals:** Antithrombotic (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12854658/full.md

---
Source: https://tomesphere.com/paper/PMC12854658