A Comparison of Named Entity Recognition Tools Applied to Biographical   Texts

Samet Atda\u{g}; Vincent Labatut

arXiv:1308.0661·cs.IR·November 27, 2013

A Comparison of Named Entity Recognition Tools Applied to Biographical Texts

Samet Atda\u{g}, Vincent Labatut

PDF

1 Repo

TL;DR

This paper compares several publicly available NER tools on a new biographical text corpus, evaluating their performance differences and potential for combined use to enhance accuracy in recognizing entities in biographical data.

Contribution

It introduces a new annotated Wikipedia corpus for biographical texts and provides a comparative analysis of four prominent NER tools, highlighting their strengths and limitations.

Findings

01

Stanford NER performs best overall

02

Performance varies by entity type and article category

03

Combining tools could improve NER accuracy

Abstract

Named entity recognition (NER) is a popular domain of natural language processing. For this reason, many tools exist to perform this task. Amongst other points, they differ in the processing method they rely upon, the entity types they can detect, the nature of the text they can handle, and their input/output formats. This makes it difficult for a user to select an appropriate NER tool for a specific situation. In this article, we try to answer this question in the context of biographic texts. For this matter, we first constitute a new corpus by annotating Wikipedia articles. We then select publicly available, well known and free for research NER tools for comparison: Stanford NER, Illinois NET, OpenCalais NER WS and Alias-i LingPipe. We apply them to our corpus, assess their performances and compare them. When considering overall performances, a clear hierarchy emerges: Stanford has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CompNet/nerwip
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.