A Hybrid Approach to Measure Semantic Relatedness in Biomedical Concepts
Katikapalli Subramanyam Kalyan, Sivanesan Sangeetha

TL;DR
This study demonstrates that a hybrid approach combining Sentence BERT and retrofitting with ontology knowledge effectively measures semantic relatedness between biomedical concepts, outperforming traditional models.
Contribution
The paper introduces a hybrid method that integrates Sentence BERT with ontology-based retrofitting to improve biomedical concept relatedness measurement.
Findings
Sentence BERT models outperform BERT models in relatedness tasks.
Retrofitting with UMLS concepts enhances vector quality.
The hybrid approach achieves top results on multiple datasets.
Abstract
Objective: This work aimed to demonstrate the effectiveness of a hybrid approach based on Sentence BERT model and retrofitting algorithm to compute relatedness between any two biomedical concepts. Materials and Methods: We generated concept vectors by encoding concept preferred terms using ELMo, BERT, and Sentence BERT models. We used BioELMo and Clinical ELMo. We used Ontology Knowledge Free (OKF) models like PubMedBERT, BioBERT, BioClinicalBERT, and Ontology Knowledge Injected (OKI) models like SapBERT, CoderBERT, KbBERT, and UmlsBERT. We trained all the BERT models using Siamese network on SNLI and STSb datasets to allow the models to learn more semantic information at the phrase or sentence level so that they can represent multi-word concepts better. Finally, to inject ontology relationship knowledge into concept vectors, we used retrofitting algorithm and concepts from various UMLS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
MethodsLinear Layer · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · WordPiece · Residual Connection · Layer Normalization · Dense Connections · Bidirectional LSTM · Attention Dropout
