Integrating a Heterogeneous Graph with Entity-aware Self-attention using Relative Position Labels for Reading Comprehension Model
Shima Foolad, Kourosh Kiani

TL;DR
This paper proposes a novel transformer attention pattern that integrates reasoning knowledge from a heterogeneous graph directly into the model, enhancing reading comprehension performance without external knowledge sources.
Contribution
It introduces a new attention mechanism combining global-local, graph, and relationship-aware attention with relative position labels, improving reasoning capabilities in transformers.
Findings
Outperforms LUKE-Graph and LUKE baseline models on ReCoRD and WikiHop datasets.
Enhances reasoning abilities in reading comprehension tasks.
Demonstrates effectiveness without external knowledge sources.
Abstract
Despite the significant progress made by transformer models in machine reading comprehension tasks, they still fall short in handling complex reasoning tasks due to the absence of explicit knowledge in the input sequence. To address this limitation, many recent works have proposed injecting external knowledge into the model. However, selecting relevant external knowledge, ensuring its availability, and requiring additional processing steps remain challenging. In this paper, we introduce a novel attention pattern that integrates reasoning knowledge derived from a heterogeneous graph into the transformer architecture without relying on external knowledge. The proposed attention pattern comprises three key elements: global-local attention for word tokens, graph attention for entity tokens that exhibit strong attention towards tokens connected in the graph as opposed to those unconnected,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Topic Modeling · Advanced Graph Neural Networks
MethodsGlobal-Local Attention
