Precision-biased Parsing and High-Quality Parse Selection

Yoav Goldberg; Michael Elhadad

arXiv:1205.4387·cs.CL·May 22, 2012·2 cites

Precision-biased Parsing and High-Quality Parse Selection

Yoav Goldberg, Michael Elhadad

PDF

Open Access

TL;DR

This paper introduces precision-biased dependency parsing that prioritizes accuracy over coverage by abstaining from uncertain decisions, enabling high-quality parse selection and confidence estimation without ensembles.

Contribution

It proposes a novel precision-biased parsing framework, including ensemble and direct risk prediction methods, to improve parse quality and confidence estimation in dependency parsing.

Findings

01

Achieves over 96% accuracy on 84% of tokens

02

Selects high-quality parse subsets with 97% accuracy on 33% of trees

03

Demonstrates effective confidence estimation without ensembles

Abstract

We introduce precision-biased parsing: a parsing task which favors precision over recall by allowing the parser to abstain from decisions deemed uncertain. We focus on dependency-parsing and present an ensemble method which is capable of assigning parents to 84% of the text tokens while being over 96% accurate on these tokens. We use the precision-biased parsing task to solve the related high-quality parse-selection task: finding a subset of high-quality (accurate) trees in a large collection of parsed text. We present a method for choosing over a third of the input trees while keeping unlabeled dependency parsing accuracy of 97% on these trees. We also present a method which is not based on an ensemble but rather on directly predicting the risk associated with individual parser decisions. In addition to its efficiency, this method demonstrates that a parsing system can provide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research