Probabilistic Tagging with Feature Structures

Andre Kempe (University of Stuttgart)

arXiv:cmp-lg/9410027·cmp-lg·February 3, 2008·3 cites

Probabilistic Tagging with Feature Structures

Andre Kempe (University of Stuttgart)

PDF

Open Access

TL;DR

This paper introduces a probabilistic tagging method using feature-structured tags within a hidden Markov model, which is especially effective for morphologically rich languages with limited training data.

Contribution

It presents a novel approach that leverages feature structures for tagging, improving performance in scenarios with small corpora and large tag sets.

Findings

01

Effective for morphologically rich languages

02

Performs well with limited training data

03

Utilizes feature-value-pairs for contextual probabilities

Abstract

The described tagger is based on a hidden Markov model and uses tags composed of features such as part-of-speech, gender, etc. The contextual probability of a tag (state transition probability) is deduced from the contextual probabilities of its feature-value-pairs. This approach is advantageous when the available training corpus is small and the tag set large, which can be the case with morphologically rich languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques