Hierarchical Pretraining for Biomedical Term Embeddings
Bryan Cai, Sihang Zeng, Yucong Lin, Zheng Yuan, Doudou Zhou, and Lu, Tian

TL;DR
This paper introduces HiPrBERT, a hierarchical pretraining model for biomedical term embeddings that leverages hierarchical structures to produce more informative and nuanced representations for clinical NLP tasks.
Contribution
We propose a novel hierarchical pretraining approach, HiPrBERT, which incorporates biomedical hierarchies into language model training to improve semantic embeddings.
Findings
HiPrBERT captures hierarchical relationships effectively.
Embeddings from HiPrBERT outperform standard models in relatedness tasks.
Hierarchical information enhances biomedical term representations.
Abstract
Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly similar and related clinical concepts, a more effective modeling strategy is to represent clinical terms as semantic embeddings via representation learning and use the low dimensional embeddings as feature vectors for predictive modeling. To achieve efficient representation, fine-tuning pretrained language models with biomedical knowledge graphs may generate better embeddings for biomedical terms than those from standard language models alone. These embeddings can effectively discriminate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Biomedical Text Mining and Ontologies
Methodsfail
