Graph Transformer Networks with Syntactic and Semantic Structures for   Event Argument Extraction

Amir Pouran Ben Veyseh; Tuan Ngo Nguyen; Thien Huu Nguyen

arXiv:2010.13391·cs.CL·October 27, 2020

Graph Transformer Networks with Syntactic and Semantic Structures for Event Argument Extraction

Amir Pouran Ben Veyseh, Tuan Ngo Nguyen, Thien Huu Nguyen

PDF

TL;DR

This paper introduces a novel Graph Transformer Network model that leverages both syntactic and semantic sentence structures, along with an information bottleneck bias, to improve event argument extraction accuracy, achieving state-of-the-art results.

Contribution

The paper presents a new model combining syntactic and semantic structures with Graph Transformer Networks and an information bottleneck bias for enhanced EAE performance.

Findings

01

Achieved state-of-the-art results on standard datasets.

02

Demonstrated the effectiveness of combining syntactic and semantic information.

03

Showed that the information bottleneck improves model generalization.

Abstract

The goal of Event Argument Extraction (EAE) is to find the role of each entity mention for a given event trigger word. It has been shown in the previous works that the syntactic structures of the sentences are helpful for the deep learning models for EAE. However, a major problem in such prior works is that they fail to exploit the semantic structures of the sentences to induce effective representations for EAE. Consequently, in this work, we propose a novel model for EAE that exploits both syntactic and semantic structures of the sentences with the Graph Transformer Networks (GTNs) to learn more effective sentence structures for EAE. In addition, we introduce a novel inductive bias based on information bottleneck to improve generalization of the EAE models. Extensive experiments are performed to demonstrate the benefits of the proposed model, leading to state-of-the-art performance for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Multi-Head Attention · Layer Normalization · Byte Pair Encoding · Softmax · Adam · Dense Connections