An Empirical Comparison of Probability Models for Dependency Grammar

Jason Eisner (Univ. of Pennsylvania)

arXiv:cmp-lg/9706004·cmp-lg·February 3, 2008·54 cites

An Empirical Comparison of Probability Models for Dependency Grammar

Jason Eisner (Univ. of Pennsylvania)

PDF

Open Access

TL;DR

This paper empirically compares different probability models for dependency grammar parsing, demonstrating improved accuracy with larger training data and exploring variants that outperform previous models in attachment accuracy.

Contribution

It provides a detailed experimental comparison of dependency grammar models with larger training data, showing improved parsing accuracy and analyzing model differences.

Findings

01

Nearly half the sentences parsed with no misattachments

02

Two-thirds of sentences with at most one misattachment

03

Best model achieves 93% attachment accuracy with known tags

Abstract

This technical report is an appendix to Eisner (1996): it gives superior experimental results that were reported only in the talk version of that paper. Eisner (1996) trained three probability models on a small set of about 4,000 conjunction-free, dependency-grammar parses derived from the Wall Street Journal section of the Penn Treebank, and then evaluated the models on a held-out test set, using a novel O(n^3) parsing algorithm. The present paper describes some details of the experiments and repeats them with a larger training set of 25,000 sentences. As reported at the talk, the more extensive training yields greatly improved performance. Nearly half the sentences are parsed with no misattachments; two-thirds are parsed with at most one misattachment. Of the models described in the original written paper, the best score is still obtained with the generative (top-down) "model C."…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Algorithms and Data Compression