Development and validation of identification algorithms for five autoimmune diseases using electronic health records: a retrospective cohort study in China
Junting Yang, Yunxiao Wu, Jinxin Guo, Xiaoxuan Wang, Xin Gao, Xin Chen, Mengdi Zhang, Jin Yang, Zuojing Liu, Yan Liu, Zhike Liu, Siyan Zhan

TL;DR
This study developed and validated algorithms to identify five autoimmune diseases using electronic health records in China, showing good accuracy for most conditions.
Contribution
The study presents the first validated algorithms for identifying autoimmune diseases using EHR data in China.
Findings
The algorithm for Hashimoto’s thyroiditis achieved high accuracy with 97.44% sensitivity and 98.28% PPV.
Combining data sources improved performance for IBD and ITP, achieving PPV above 70%.
The T1D algorithm using admission and outpatient records had 84.09% sensitivity and 74.00% PPV.
Abstract
This study aims to assess the identification algorithms for five autoimmune diseases—Hashimoto’s thyroiditis, inflammatory bowel disease (IBD), primary immune thrombocytopenia (ITP), rheumatoid arthritis (RA), and type 1 diabetes (T1D)—using the Yinzhou Regional Health Information Platform (YRHIP) in China. Diagnostic data was extracted from YRHIP’s population registry (2010-2021), combining ICD-10 codes and Chinese medical terminology from outpatient, inpatient, and discharge records. Algorithms were validated through chart reviews, adhering to global clinical guidelines. Cases were adjudicated using electronic case report forms. We evaluated algorithm performance based on sensitivity and positive predictive value (PPV), with a 70% PPV threshold for optimization. Among all reviewed cases, we identified 136 cases for Hashimoto’s thyroiditis, 65 for IBD, 76 for ITP, 130 for RA, and 43…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDiabetes and associated disorders · Hepatitis C virus research · Immunodeficiency and Autoimmune Disorders
