GRAPHENE: A Precise Biomedical Literature Retrieval Engine with Graph   Augmented Deep Learning and External Knowledge Empowerment

Sendong Zhao; Chang Su; Andrea Sboner; Fei Wang

arXiv:1911.00760·cs.IR·November 22, 2019

GRAPHENE: A Precise Biomedical Literature Retrieval Engine with Graph Augmented Deep Learning and External Knowledge Empowerment

Sendong Zhao, Chang Su, Andrea Sboner, Fei Wang

PDF

TL;DR

GRAPHENE is a deep learning framework that enhances biomedical literature retrieval by integrating graph-based external knowledge, query expansion, and learning to rank, significantly improving retrieval accuracy in precision medicine applications.

Contribution

The paper introduces a novel deep learning framework combining graph-augmented document representation, query expansion, and ranking methods for precise biomedical literature retrieval.

Findings

01

Outperforms existing retrieval methods on TREC Precision Medicine data

02

Effectively captures global biomedical concepts through external knowledge integration

03

Improves query understanding with abbreviation and synonym expansion

Abstract

Effective biomedical literature retrieval (BLR) plays a central role in precision medicine informatics. In this paper, we propose GRAPHENE, which is a deep learning based framework for precise BLR. GRAPHENE consists of three main different modules 1) graph-augmented document representation learning; 2) query expansion and representation learning and 3) learning to rank biomedical articles. The graph-augmented document representation learning module constructs a document-concept graph containing biomedical concept nodes and document nodes so that global biomedical related concept from external knowledge source can be captured, which is further connected to a BiLSTM so both local and global topics can be explored. Query expansion and representation learning module expands the query with abbreviations and different names, and then builds a CNN-based model to convolve the expanded query and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM