Automatic Error Detection in Part of Speech Tagging

David Elworthy (Sharp Laboratories of Europe)

arXiv:cmp-lg/9410013·cmp-lg·February 3, 2008·3 cites

Automatic Error Detection in Part of Speech Tagging

David Elworthy (Sharp Laboratories of Europe)

PDF

Open Access

TL;DR

This paper presents a technique for detecting errors in part of speech tagging using Hidden Markov Models by comparing observable values with a threshold, improving accuracy at the cost of efficiency.

Contribution

It introduces a novel error detection method for HMM taggers based on threshold comparison, enhancing tagging accuracy.

Findings

01

Technique effectively detects tagging errors

02

Empirical results validate the approach

03

Guidelines for threshold selection provided

Abstract

A technique for detecting errors made by Hidden Markov Model taggers is described, based on comparing observable values of the tagging process with a threshold. The resulting approach allows the accuracy of the tagger to be improved by accepting a lower efficiency, defined as the proportion of words which are tagged. Empirical observations are presented which demonstrate the validity of the technique and suggest how to choose an appropriate threshold.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems