An Automated Method to Enrich Consumer Health Vocabularies Using GloVe Word Embeddings and An Auxiliary Lexical Resource
Mohammed Ibrahim, Susan Gauch, Omar Salman, Mohammed Alqahatani

TL;DR
This paper introduces an automatic method leveraging GloVe embeddings and WordNet to enrich consumer health vocabularies, significantly improving the identification of layman terms and reducing manual effort in vocabulary expansion.
Contribution
It presents a novel fully automated approach combining GloVe embeddings with lexical resources to enhance consumer health vocabularies across domains.
Findings
GloVe achieved an F-score of 48.44% in identifying new layman terms.
Enhanced GloVe outperformed basic GloVe with a 25% relative F-score improvement.
The approach showed statistically significant results with P<.001.
Abstract
Background: Clear language makes communication easier between any two parties. A layman may have difficulty communicating with a professional due to not understanding the specialized terms common to the domain. In healthcare, it is rare to find a layman knowledgeable in medical terminology which can lead to poor understanding of their condition and/or treatment. To bridge this gap, several professional vocabularies and ontologies have been created to map laymen medical terms to professional medical terms and vice versa. Objective: Many of the presented vocabularies are built manually or semi-automatically requiring large investments of time and human effort and consequently the slow growth of these vocabularies. In this paper, we present an automatic method to enrich laymen's vocabularies that has the benefit of being able to be applied to vocabularies in any domain. Methods: Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGloVe Embeddings
