Fill it up: Exploiting partial dependency annotations in a minimum spanning tree parser
Liang Sun, Jason Mielens, Jason Baldridge

TL;DR
This paper demonstrates that incorporating small amounts of partial dependency annotations into an unsupervised dependency parser significantly improves parsing accuracy, offering a practical middle ground between zero and full supervision.
Contribution
The authors adapt the ConvexMST parser to learn from partial dependency annotations, achieving notable improvements with minimal annotation effort.
Findings
7% improvement in English unlabeled dependency scores
17% improvement in Spanish unlabeled dependency scores
Less than 24 hours of annotation yields significant gains
Abstract
Unsupervised models of dependency parsing typically require large amounts of clean, unlabeled data plus gold-standard part-of-speech tags. Adding indirect supervision (e.g. language universals and rules) can help, but we show that obtaining small amounts of direct supervision - here, partial dependency annotations - provides a strong balance between zero and full supervision. We adapt the unsupervised ConvexMST dependency parser to learn from partial dependencies expressed in the Graph Fragment Language. With less than 24 hours of total annotation, we obtain 7% and 17% absolute improvement in unlabeled dependency scores for English and Spanish, respectively, compared to the same parser using only universal grammar constraints.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research
