Integration of Domain Knowledge using Medical Knowledge Graph Deep Learning for Cancer Phenotyping
Mohammed Alawad, Shang Gao, Mayanka Chandra Shekar, S.M.Shamimul, Hasan, J. Blair Christian, Xiao-Cheng Wu, Eric B. Durbin, Jennifer Doherty,, Antoinette Stroup, Linda Coyle, Lynne Penberthy, Georgia Tourassi

TL;DR
This paper introduces a method to incorporate medical knowledge graphs into word embeddings to improve NLP tasks in cancer pathology report analysis, leading to significant performance gains.
Contribution
It presents a novel approach to integrate domain-specific medical knowledge into word embeddings for enhanced cancer phenotyping from pathology reports.
Findings
Knowledge graph-informed embeddings outperform standard embeddings.
Improved F1 scores by 4.97% micro and 22.5% macro.
Effective extraction of six cancer characteristics.
Abstract
A key component of deep learning (DL) for natural language processing (NLP) is word embeddings. Word embeddings that effectively capture the meaning and context of the word that they represent can significantly improve the performance of downstream DL models for various NLP tasks. Many existing word embeddings techniques capture the context of words based on word co-occurrence in documents and text; however, they often cannot capture broader domain-specific relationships between concepts that may be crucial for the NLP task at hand. In this paper, we propose a method to integrate external knowledge from medical terminology ontologies into the context captured by word embeddings. Specifically, we use a medical knowledge graph, such as the unified medical language system (UMLS), to find connections between clinical terms in cancer pathology reports. This approach aims to minimize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Advanced Graph Neural Networks
