Fill it up: Exploiting partial dependency annotations in a minimum   spanning tree parser

Liang Sun; Jason Mielens; Jason Baldridge

arXiv:1611.08765·cs.CL·November 29, 2016

Fill it up: Exploiting partial dependency annotations in a minimum spanning tree parser

Liang Sun, Jason Mielens, Jason Baldridge

PDF

Open Access

TL;DR

This paper demonstrates that incorporating small amounts of partial dependency annotations into an unsupervised dependency parser significantly improves parsing accuracy, offering a practical middle ground between zero and full supervision.

Contribution

The authors adapt the ConvexMST parser to learn from partial dependency annotations, achieving notable improvements with minimal annotation effort.

Findings

01

7% improvement in English unlabeled dependency scores

02

17% improvement in Spanish unlabeled dependency scores

03

Less than 24 hours of annotation yields significant gains

Abstract

Unsupervised models of dependency parsing typically require large amounts of clean, unlabeled data plus gold-standard part-of-speech tags. Adding indirect supervision (e.g. language universals and rules) can help, but we show that obtaining small amounts of direct supervision - here, partial dependency annotations - provides a strong balance between zero and full supervision. We adapt the unsupervised ConvexMST dependency parser to learn from partial dependencies expressed in the Graph Fragment Language. With less than 24 hours of total annotation, we obtain 7% and 17% absolute improvement in unlabeled dependency scores for English and Spanish, respectively, compared to the same parser using only universal grammar constraints.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research