Developing a cardiovascular disease risk factor annotated corpus of Chinese electronic medical records
Jia Su, Bin He, Yi Guan, Jingchi Jiang, Jinfeng Yang

TL;DR
This study created the first annotated Chinese electronic medical record corpus for cardiovascular disease risk factors, enabling future research in risk monitoring and information extraction.
Contribution
It introduces a novel, clinician-guided annotation method and a high-reliability corpus for CVD risk factors in Chinese medical records.
Findings
Developed a corpus from 600 patient records with high inter-annotator agreement.
Proposed guidelines for capturing CVD risk factors in Chinese EMRs.
Enabled future risk factor monitoring and CVD progression studies.
Abstract
Cardiovascular disease (CVD) has become the leading cause of death in China, and most of the cases can be prevented by controlling risk factors. The goal of this study was to build a corpus of CVD risk factor annotations based on Chinese electronic medical records (CEMRs). This corpus is intended to be used to develop a risk factor information extraction system that, in turn, can be applied as a foundation for the further study of the progress of risk factors and CVD. We designed a light annotation task to capture CVD risk factors with indicators, temporal attributes and assertions that were explicitly or implicitly displayed in the records. The task included: 1) preparing data; 2) creating guidelines for capturing annotations (these were created with the help of clinicians); 3) proposing an annotation method including building the guidelines draft, training the annotators and updating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Biomedical Text Mining and Ontologies · Cardiovascular Health and Risk Factors
