Precision-biased Parsing and High-Quality Parse Selection
Yoav Goldberg, Michael Elhadad

TL;DR
This paper introduces precision-biased dependency parsing that prioritizes accuracy over coverage by abstaining from uncertain decisions, enabling high-quality parse selection and confidence estimation without ensembles.
Contribution
It proposes a novel precision-biased parsing framework, including ensemble and direct risk prediction methods, to improve parse quality and confidence estimation in dependency parsing.
Findings
Achieves over 96% accuracy on 84% of tokens
Selects high-quality parse subsets with 97% accuracy on 33% of trees
Demonstrates effective confidence estimation without ensembles
Abstract
We introduce precision-biased parsing: a parsing task which favors precision over recall by allowing the parser to abstain from decisions deemed uncertain. We focus on dependency-parsing and present an ensemble method which is capable of assigning parents to 84% of the text tokens while being over 96% accurate on these tokens. We use the precision-biased parsing task to solve the related high-quality parse-selection task: finding a subset of high-quality (accurate) trees in a large collection of parsed text. We present a method for choosing over a third of the input trees while keeping unlabeled dependency parsing accuracy of 97% on these trees. We also present a method which is not based on an ensemble but rather on directly predicting the risk associated with individual parser decisions. In addition to its efficiency, this method demonstrates that a parsing system can provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research
