RGAT: A Deeper Look into Syntactic Dependency Information for Coreference Resolution
Yuan Meng, Xuhao Pan, Jun Chang, Yue Wang

TL;DR
This paper introduces RGAT, a novel syntactic dependency graph attention network integrated with BERT, which enhances coreference resolution performance by leveraging syntactic information without full BERT fine-tuning.
Contribution
The paper proposes RGAT to better incorporate syntactic dependency information into coreference resolution, improving performance over previous models without fine-tuning BERT.
Findings
F1-score improved from 80.3% to 82.5% on GAP dataset.
Model outperforms previous state-of-the-art without fine-tuning BERT.
Syntactic embeddings enhance coreference resolution accuracy.
Abstract
Although syntactic information is beneficial for many NLP tasks, combining it with contextual information between words to solve the coreference resolution problem needs to be further explored. In this paper, we propose an end-to-end parser that combines pre-trained BERT with a Syntactic Relation Graph Attention Network (RGAT) to take a deeper look into the role of syntactic dependency information for the coreference resolution task. In particular, the RGAT model is first proposed, then used to understand the syntactic dependency graph and learn better task-specific syntactic embeddings. An integrated architecture incorporating BERT embeddings and syntactic embeddings is constructed to generate blending representations for the downstream task. Our experiments on a public Gendered Ambiguous Pronouns (GAP) dataset show that with the supervision learning of the syntactic dependency graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Layer Normalization · Linear Layer · Dense Connections · Attention Dropout · Residual Connection · Adam · Weight Decay
