Snomed2Vec: Random Walk and Poincar\'e Embeddings of a Clinical Knowledge Base for Healthcare Analytics
Khushbu Agarwal, Tome Eftimov, Raghavendra Addanki, Sutanay Choudhury,, Suzanne Tamang, and Robert Rallo

TL;DR
This paper introduces a novel approach to embedding medical concepts from SNOMED-CT using graph-based methods, significantly improving biomedical task performance over existing embeddings.
Contribution
It applies graph-based representation learning, including Poincaré embeddings, to SNOMED-CT, demonstrating superior performance in healthcare-related tasks.
Findings
Embeddings from SNOMED-CT outperform previous methods by 5-6x in concept similarity.
Significant 6-20% improvement in patient diagnosis accuracy.
Graph-based embeddings enhance biomedical task performance.
Abstract
Representation learning methods that transform encoded data (e.g., diagnosis and drug codes) into continuous vector spaces (i.e., vector embeddings) are critical for the application of deep learning in healthcare. Initial work in this area explored the use of variants of the word2vec algorithm to learn embeddings for medical concepts from electronic health records or medical claims datasets. We propose learning embeddings for medical concepts by using graph-based representation learning methods on SNOMED-CT, a widely popular knowledge graph in the healthcare domain with numerous operational and research applications. Current work presents an empirical analysis of various embedding methods, including the evaluation of their performance on multiple tasks of biomedical relevance (node classification, link prediction, and patient state prediction). Our results show that concept embeddings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Artificial Intelligence in Healthcare
