# Predicting atrial fibrillation and flutter using BEHRT and identifying multimorbidity patterns using BERTopic

**Authors:** Sookyung Bae, Yeonjae Kim, Samina Park, Hwiyoung Kim, Bomi Park

PMC · DOI: 10.3389/fdgth.2026.1722338 · Frontiers in Digital Health · 2026-02-05

## TL;DR

This study uses AI to predict heart rhythm disorders and identify sex-specific disease patterns, aiming to improve personalized healthcare.

## Contribution

Combines BEHRT and BERTopic to predict atrial fibrillation/flutter and identify sex-specific multimorbidity patterns.

## Key findings

- BEHRT achieved an AUC of 0.80 in predicting atrial fibrillation and flutter.
- BERTopic identified sex-specific multimorbidity patterns, such as aortic aneurysm in males and Alzheimer's in females.

## Abstract

Atrial fibrillation and flutter are heart rhythm disorders frequently associated with multiple other chronic conditions, complicating their management and requiring optimized care. Analyzing pre-atrial fibrillation and flutter comorbidity patterns could enable proactive, preventive, and personalized healthcare.

This population-based nested case-control study analyzed data from the Korean National Health Insurance Corporation (2002–2019). Adults aged ≥19 years with at least three years of recorded claims were included. Cases were individuals newly diagnosed with atrial fibrillation and flutter between 2007 and 2019 following a washout period (2002–2006). Controls were matched 1:4 using stratified random sampling. Using 5-year disease histories, BEHRT, a transformer-based model, predicted atrial fibrillation and flutter, while BERTopic identified sex-specific multimorbidity patterns. Predictive performance was evaluated using the area under the receiver operating characteristic curve (AUC).

BEHRT achieved an AUC of 0.80 for predicting atrial fibrillation and flutter among 600,030 participants (8,661 cases and 591,369 controls). BERTopic analysis revealed sex-specific multimorbidity patterns: aortic aneurysm, hypertensive heart disease, and chronic obstructive pulmonary disease were common in males, while Alzheimer's disease, Parkinson's disease, and rheumatic heart disease were prominent in females.

The combination of BEHRT and BERTopic demonstrated the ability to predict atrial fibrillation and flutter based on multimorbid histories while identifying distinct sex-specific disease patterns. These findings underscore the potential for artificial intelligence to enhance personalized healthcare and optimize prevention and management strategies for chronic conditions.

## Linked entities

- **Diseases:** atrial fibrillation (MONDO:0004981), aortic aneurysm (MONDO:0005160), hypertensive heart disease (MONDO:0001302), chronic obstructive pulmonary disease (MONDO:0005002), Alzheimer's disease (MONDO:0004975), Parkinson's disease (MONDO:0005180), rheumatic heart disease (MONDO:0006955)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Genes:** CMM (cutaneous malignant melanoma/dysplastic nevus) [NCBI Gene 1243] {aka CMM1, DNS, FAMMM, MLM}
- **Diseases:** hypercapnia (MESH:D006935), atopic diseases (MESH:D006969), stroke (MESH:D020521), ML (MESH:D007859), COPD (MESH:D029424), cardiomyopathy (MESH:D009202), arrhythmias (MESH:D001145), AI (MESH:C538142), gallbladder cancer (MESH:D005706), tubulointerstitial nephritis (MESH:D009395), hypoxia (MESH:D000860), SLE (MESH:D008180), low back pain (MESH:D017116), pyelonephritis (MESH:D011704), Hemorrhoids (MESH:D006484), neurological deficits (MESH:D009461), skin (MESH:D012871), Parkinson's disease (MESH:D010300), Respiratory diseases (MESH:D012140), thoracic aortic aneurysms (MESH:D017545), abscess (MESH:D000038), inflammatory (MESH:D007249), neurodegenerative conditions (MESH:D019636), disease (MESH:D004194), GERD (MESH:D005764), Aortic aneurysms (MESH:D001014), chronic kidney disease (MESH:D051436), DM (MESH:D009223), asthma (MESH:D001249), (central nervous system) cancer (MESH:D009369), Diabetes mellitus (MESH:D003920), Alzheimer's (MESH:D000544), Heart diseases (MESH:D006331), dementia (MESH:D003704), cardiac remodeling (MESH:D020257), thromboembolic (MESH:D013923), endocarditis (MESH:D004696), rheumatic heart disease (MESH:D012214), Musculoskeletal diseases (MESH:D009140), cataracts (MESH:D002386), bacterial skin diseases (MESH:D017192), chronic diseases (MESH:D002908), brain and nervous system cancers (MESH:D001932), cognitive decline (MESH:D003072), Alopecia areata (MESH:D000506), periodontal disease (MESH:D010510), eye diseases (MESH:D005128), colon/rectal cancers (MESH:D015179), hypertensive heart disease (MESH:D006973), decubitus ulcers (MESH:D003668), gastrointestinal and respiratory diseases (MESH:D012818), AFF (MESH:D001282), benign prostatic hyperplasia (MESH:D011470), urinary tract infections (MESH:D014552), impetigo (MESH:D007169), ischemic stroke (MESH:D002544), peptic ulcer disease (MESH:D010437), Gastrointestinal diseases (MESH:D005767), ischemic heart disease (MESH:D017202), CKD (MESH:D012080)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12917773/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12917773/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC12917773/full.md

---
Source: https://tomesphere.com/paper/PMC12917773