GraSAME: Injecting Token-Level Structural Information to Pretrained   Language Models via Graph-guided Self-Attention Mechanism

Shuzhou Yuan; Michael F\"arber

arXiv:2404.06911·cs.CL·April 11, 2024·1 cites

GraSAME: Injecting Token-Level Structural Information to Pretrained Language Models via Graph-guided Self-Attention Mechanism

Shuzhou Yuan, Michael F\"arber

PDF

Open Access 1 Video

TL;DR

GraSAME introduces a novel, efficient self-attention mechanism that seamlessly integrates token-level graph structural information into pretrained language models, improving graph-to-text generation without extra pre-training or increased complexity.

Contribution

The paper presents GraSAME, a lightweight, end-to-end module that effectively bridges graph and text modalities within PLMs using a novel graph-guided self-attention mechanism.

Findings

01

Outperforms baseline models on WebNLG dataset.

02

Achieves results comparable to state-of-the-art models.

03

Reduces trainable parameters by over 100 million.

Abstract

Pretrained Language Models (PLMs) benefit from external knowledge stored in graph structures for various downstream tasks. However, bridging the modality gap between graph structures and text remains a significant challenge. Traditional methods like linearizing graphs for PLMs lose vital graph connectivity, whereas Graph Neural Networks (GNNs) require cumbersome processes for integration into PLMs. In this work, we propose a novel graph-guided self-attention mechanism, GraSAME. GraSAME seamlessly incorporates token-level structural information into PLMs without necessitating additional alignment or concatenation efforts. As an end-to-end, lightweight multimodal module, GraSAME follows a multi-task learning strategy and effectively bridges the gap between graph and textual modalities, facilitating dynamic interactions between GNNs and PLMs. Our experiments on the graph-to-text generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

GraSAME: Injecting Token-Level Structural Information to Pretrained Language Models via Graph-guided Self-Attention Mechanism· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques