TL;DR
This paper introduces a novel unsupervised representation learning method for electronic medical records that combines structured and unstructured data using a tree-based model to improve patient similarity assessment and mortality prediction.
Contribution
It presents a new data representation approach that models temporal relations in EMRs with a tree structure and relabeling methods, enhancing patient similarity measurement.
Findings
Lower mean squared error in mortality prediction
Higher precision in patient similarity tasks
Improved NDCG scores over baseline methods
Abstract
Patient similarity assessment, which identifies patients similar to a given patient, can help improve medical care. The assessment can be performed using Electronic Medical Records (EMRs). Patient similarity measurement requires converting heterogeneous EMRs into comparable formats to calculate their distance. While versatile document representation learning methods have been developed in recent years, it is still unclear how complex EMR data should be processed to create the most useful patient representations. This study presents a new data representation method for EMRs that takes the information in clinical narratives into account. To address the limitations of previous approaches in handling complex parts of EMR data, an unsupervised method is proposed for building a patient representation, which integrates unstructured data with structured data extracted from patients' EMRs. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
