Using word evolution to predict drug repurposing
Judita Preiss

TL;DR
This paper explores using changes in word contexts over time to identify drugs that could be repurposed for new medical uses.
Contribution
A novel approach using word evolution and time series word embeddings to predict drug repurposing is introduced.
Findings
Word embeddings from MEDLINE publications were used to detect drugs suitable for repurposing.
Classification performance reached 65% with UMLS labels and 81% with SemRep labels.
Different deep learning architectures perform better depending on the annotation method and training data quantity.
Abstract
Traditional literature based discovery is based on connecting knowledge pairs extracted from separate publications via a common mid point to derive previously unseen knowledge pairs. To avoid the over generation often associated with this approach, we explore an alternative method based on word evolution. Word evolution examines the changing contexts of a word to identify changes in its meaning or associations. We investigate the possibility of using changing word contexts to detect drugs suitable for repurposing. Word embeddings, which represent a word’s context, are constructed from chronologically ordered publications in MEDLINE at bi-monthly intervals, yielding a time series of word embeddings for each word. Focusing on clinical drugs only, any drugs repurposed in the final time segment of the time series are annotated as positive examples. The decision regarding the drug’s…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Advanced Text Analysis Techniques · Topic Modeling
