Three New Probabilistic Models for Dependency Parsing: An Exploration

Jason Eisner (Univ. of Pennsylvania)

arXiv:cmp-lg/9706003·cmp-lg·February 6, 2008·166 cites

Three New Probabilistic Models for Dependency Parsing: An Exploration

Jason Eisner (Univ. of Pennsylvania)

PDF

Open Access

TL;DR

This paper introduces a new cubic-time dependency parsing algorithm and explores three probabilistic models, with the generative model showing superior performance on Wall Street Journal data.

Contribution

It presents a novel O(n^3) parsing algorithm and three contrasting stochastic models for dependency parsing, highlighting the effectiveness of the generative approach.

Findings

01

The generative model outperforms the others in parsing accuracy.

02

All models perform similarly in part-of-speech tagging.

03

Preliminary results are based on Wall Street Journal data.

Abstract

After presenting a novel O(n^3) parsing algorithm for dependency grammar, we develop three contrasting ways to stochasticize it. We propose (a) a lexical affinity model where words struggle to modify each other, (b) a sense tagging model where words fluctuate randomly in their selectional preferences, and (c) a generative model where the speaker fleshes out each word's syntactic and conceptual structure without regard to the implications for the hearer. We also give preliminary empirical results from evaluating the three models' parsing performance on annotated Wall Street Journal training text (derived from the Penn Treebank). In these results, the generative (i.e., top-down) model performs significantly better than the others, and does about equally well at assigning part-of-speech tags.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Authorship Attribution and Profiling