An efficient representation of chronological events in medical texts
Andrey Kormilitzin, Nemanja Vaci, Qiang Liu, Hao Ni, Goran Nenadic,, Alejo Nevado-Holgado

TL;DR
This paper introduces a novel path signature framework for capturing sequential information in clinical notes, significantly improving survival risk prediction in Alzheimer's disease patients from electronic health records.
Contribution
The paper presents a systematic, non-parametric hierarchical method for representing chronological events in medical texts, validated on large UK mental health EHR data.
Findings
15.4% increase in risk prediction AUC at 20 months
Outperformed baseline models by over 13%
Validated on large-scale UK mental health data
Abstract
In this work we addressed the problem of capturing sequential information contained in longitudinal electronic health records (EHRs). Clinical notes, which is a particular type of EHR data, are a rich source of information and practitioners often develop clever solutions how to maximise the sequential information contained in free-texts. We proposed a systematic methodology for learning from chronological events available in clinical notes. The proposed methodological {\it path signature} framework creates a non-parametric hierarchical representation of sequential events of any type and can be used as features for downstream statistical learning tasks. The methodology was developed and externally validated using the largest in the UK secondary care mental health EHR data on a specific task of predicting survival risk of patients diagnosed with Alzheimer's disease. The signature-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
