Tagging French -- comparing a statistical and a constraint-based method

Jean-Pierre Chanod; Pasi Tapanainen (Rank Xerox Research Centre,; Grenoble Laboratory)

arXiv:cmp-lg/9503003·cmp-lg·August 31, 2016·6 cites

Tagging French -- comparing a statistical and a constraint-based method

Jean-Pierre Chanod, Pasi Tapanainen (Rank Xerox Research Centre,, Grenoble Laboratory)

PDF

Open Access

TL;DR

This paper compares statistical and constraint-based part-of-speech tagging methods for French, showing that the constraint-based approach outperforms the statistical one within a similar development time frame.

Contribution

It provides a comparative analysis of two different tagging approaches for French, highlighting the efficiency of constraint-based methods with limited development time.

Findings

01

Constraint-based tagger outperforms statistical tagger within same development time

02

Statistical method achieves accuracy comparable to English taggers

03

Constraint-based approach is more effective with limited rule development time

Abstract

In this paper we compare two competing approaches to part-of-speech tagging, statistical and constraint-based disambiguation, using French as our test language. We imposed a time limit on our experiment: the amount of time spent on the design of our constraint system was about the same as the time we used to train and test the easy-to-implement statistical model. We describe the two systems and compare the results. The accuracy of the statistical method is reasonably good, comparable to taggers for English. But the constraint-based tagger seems to be superior even with the limited time we allowed ourselves for rule development.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems · Text Readability and Simplification