Finding Complex Biological Relationships in Recent PubMed Articles Using Bio-LDA
Huijun Wang, Ying Ding, Jie Tang, Xiao Dong, Bing He, Judy Qiu, David, J. Wild

TL;DR
Bio-LDA is an algorithm that automatically uncovers latent biological topics and relationships from PubMed articles, aiding research in gene-disease-compound interactions and drug discovery.
Contribution
The paper introduces Bio-LDA, a novel method for extracting and analyzing biological relationships from literature using latent topic modeling.
Findings
Successfully identified biologically relevant associations
Demonstrated utility in drug repurposing and target discovery
Enabled new insights through case studies
Abstract
The overwhelming amount of available scholarly literature in the life sciences poses significant challenges to scientists wishing to keep up with important developments related to their research, but also provides a useful resource for the discovery of recent information concerning genes, diseases, compounds and the interactions between them. In this paper, we describe an algorithm called Bio-LDA that uses extracted biological terminology to automatically identify latent topics, and provides a variety of measures to uncover putative relations among topics and bio-terms. Relationships identified using those approaches are combined with existing data in life science datasets to provide additional insight. Three case studies demonstrate the utility of the Bio-LDA model, including association predication, association search and connectivity map generation. This combined approach offers new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
