Multimodal Graph-based Transformer Framework for Biomedical Relation Extraction
Sriram Pingali, Shweta Yadav, Pratik Dutta, and Sriparna Saha

TL;DR
This paper presents a multimodal graph-based Transformer framework that integrates textual and molecular structure data to improve biomedical relation extraction, specifically protein-protein interactions.
Contribution
It introduces a generalized graph-based multi-modal learning mechanism using GraphBERT to incorporate domain-specific molecular information alongside text.
Findings
Enhanced performance on Protein-Protein Interaction task
Effective integration of molecular structure with textual data
Demonstrated benefits of multi-modal cues in biomedical relation extraction
Abstract
The recent advancement of pre-trained Transformer models has propelled the development of effective text mining models across various biomedical tasks. However, these models are primarily learned on the textual data and often lack the domain knowledge of the entities to capture the context beyond the sentence. In this study, we introduced a novel framework that enables the model to learn multi-omnics biological information about entities (proteins) with the help of additional multi-modal cues like molecular structure. Towards this, rather developing modality-specific architectures, we devise a generalized and optimized graph based multi-modal learning mechanism that utilizes the GraphBERT model to encode the textual and molecular structure information and exploit the underlying features of various modalities to enable end-to-end learning. We evaluated our proposed method on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Machine Learning in Bioinformatics
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Layer Normalization · Dropout · Label Smoothing
