Patient representation learning and interpretable evaluation using clinical notes
Madhumita Sushil, Simon \v{S}uster, Kim Luyckx, Walter, Daelemans

TL;DR
This paper investigates learning dense, task-independent patient representations from clinical notes using autoencoders and paragraph vectors, evaluating their transferability and interpretability across multiple clinical prediction tasks.
Contribution
It introduces a method for learning transferable patient representations from clinical notes and proposes techniques for their interpretability, demonstrating advantages over traditional sparse representations.
Findings
Dense representations outperform sparse ones with limited data.
Medical concept-based features do not improve classification performance.
Novel interpretability techniques identify influential features in patient representations.
Abstract
We have three contributions in this work: 1. We explore the utility of a stacked denoising autoencoder and a paragraph vector model to learn task-independent dense patient representations directly from clinical notes. To analyze if these representations are transferable across tasks, we evaluate them in multiple supervised setups to predict patient mortality, primary diagnostic and procedural category, and gender. We compare their performance with sparse representations obtained from a bag-of-words model. We observe that the learned generalized representations significantly outperform the sparse representations when we have few positive instances to learn from, and there is an absence of strong lexical features. 2. We compare the model performance of the feature set constructed from a bag of words to that obtained from medical concepts. In the latter case, concepts represent problems,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDenoising Autoencoder · Solana Customer Service Number +1-833-534-1729
