An Empirical Comparison of Probability Models for Dependency Grammar
Jason Eisner (Univ. of Pennsylvania)

TL;DR
This paper empirically compares different probability models for dependency grammar parsing, demonstrating improved accuracy with larger training data and exploring variants that outperform previous models in attachment accuracy.
Contribution
It provides a detailed experimental comparison of dependency grammar models with larger training data, showing improved parsing accuracy and analyzing model differences.
Findings
Nearly half the sentences parsed with no misattachments
Two-thirds of sentences with at most one misattachment
Best model achieves 93% attachment accuracy with known tags
Abstract
This technical report is an appendix to Eisner (1996): it gives superior experimental results that were reported only in the talk version of that paper. Eisner (1996) trained three probability models on a small set of about 4,000 conjunction-free, dependency-grammar parses derived from the Wall Street Journal section of the Penn Treebank, and then evaluated the models on a held-out test set, using a novel O(n^3) parsing algorithm. The present paper describes some details of the experiments and repeats them with a larger training set of 25,000 sentences. As reported at the talk, the more extensive training yields greatly improved performance. Nearly half the sentences are parsed with no misattachments; two-thirds are parsed with at most one misattachment. Of the models described in the original written paper, the best score is still obtained with the generative (top-down) "model C."…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Algorithms and Data Compression
