RuNNE-2022 Shared Task: Recognizing Nested Named Entities

Ekaterina Artemova; Maxim Zmeev; Natalia Loukachevitch; Igor Rozhkov,; Tatiana Batura; Vladimir Ivanov; Elena Tutubalina

arXiv:2205.11159·cs.CL·October 4, 2023·6 cites

RuNNE-2022 Shared Task: Recognizing Nested Named Entities

Ekaterina Artemova, Maxim Zmeev, Natalia Loukachevitch, Igor Rozhkov,, Tatiana Batura, Vladimir Ivanov, Elena Tutubalina

PDF

Open Access 1 Repo

TL;DR

The RuNNE-2022 shared task focuses on recognizing nested named entities in Russian texts, involving complex annotations and multiple entity types, with many submissions outperforming baseline models.

Contribution

This paper introduces a shared task dataset and evaluation framework for nested Russian NER, highlighting the challenge of recognizing overlapping and nested entities.

Findings

01

Half of the submissions outperform the BERT baseline.

02

Nested entities can be recognized with up to six levels of nesting.

03

The shared task provides insights into nested NER system performance.

Abstract

The RuNNE Shared Task approaches the problem of nested named entity recognition. The annotation schema is designed in such a way, that an entity may partially overlap or even be nested into another entity. This way, the named entity "The Yermolova Theatre" of type "organization" houses another entity "Yermolova" of type "person". We adopt the Russian NEREL dataset for the RuNNE Shared Task. NEREL comprises news texts written in the Russian language and collected from the Wikinews portal. The annotation schema includes 29 entity types. The nestedness of named entities in NEREL reaches up to six levels. The RuNNE Shared Task explores two setups. (i) In the general setup all entities occur more or less with the same frequency. (ii) In the few-shot setup the majority of entity types occur often in the training set. However, some of the entity types are have lower frequency, being thus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dialogue-evaluation/runne
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification