Deep Contextualized Word Embeddings in Transition-Based and Graph-Based   Dependency Parsing -- A Tale of Two Parsers Revisited

Artur Kulmizev; Miryam de Lhoneux; Johannes Gontrum; Elena Fano and; Joakim Nivre

arXiv:1908.07397·cs.CL·August 28, 2019

Deep Contextualized Word Embeddings in Transition-Based and Graph-Based Dependency Parsing -- A Tale of Two Parsers Revisited

Artur Kulmizev, Miryam de Lhoneux, Johannes Gontrum, Elena Fano and, Joakim Nivre

PDF

TL;DR

This paper compares transition-based and graph-based dependency parsers, showing that deep contextualized embeddings help transition-based parsers perform as well as graph-based ones by reducing search errors.

Contribution

It demonstrates that deep contextualized embeddings equalize the performance gap between the two parser types by enhancing local decision-making in transition-based parsers.

Findings

01

Deep contextualized embeddings benefit transition-based parsers more.

02

The two parser types become nearly equivalent in accuracy with embeddings.

03

Error analysis on 13 languages supports the findings.

Abstract

Transition-based and graph-based dependency parsers have previously been shown to have complementary strengths and weaknesses: transition-based parsers exploit rich structural features but suffer from error propagation, while graph-based parsers benefit from global optimization but have restricted feature scope. In this paper, we show that, even though some details of the picture have changed after the switch to neural networks and continuous representations, the basic trade-off between rich features and global optimization remains essentially the same. Moreover, we show that deep contextualized word embeddings, which allow parsers to pack information about global sentence structure into local feature representations, benefit transition-based parsers more than graph-based parsers, making the two approaches virtually equivalent in terms of both accuracy and error profile. We argue that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.