NameTag 3: A Tool and a Service for Multilingual/Multitagset NER
Jana Strakov\'a, Milan Straka

TL;DR
NameTag 3 is an open-source, multilingual NER tool and web service that achieves state-of-the-art results across multiple datasets and languages, supporting flat and nested entities with a single model.
Contribution
It introduces a unified, multilingual, multitagset NER system with state-of-the-art performance, available as a command-line tool and cloud service, supporting flat and nested entities.
Findings
Achieves state-of-the-art results on 21 datasets in 15 languages.
Supports 17 languages for flat NER and nested NER for Czech.
Operates with a single 355M-parameter model for flat NER.
Abstract
We introduce NameTag 3, an open-source tool and cloud-based web service for multilingual, multidataset, and multitagset named entity recognition (NER), supporting both flat and nested entities. NameTag 3 achieves state-of-the-art results on 21 test datasets in 15 languages and remains competitive on the rest, even against larger models. It is available as a command-line tool and as a cloud-based service, enabling use without local installation. NameTag 3 web service currently provides flat NER for 17 languages, trained on 21 corpora and three NE tagsets, all powered by a single 355M-parameter fine-tuned model; and nested NER for Czech, powered by a 126M fine-tuned model. The source code is licensed under open-source MPL 2.0, while the models are distributed under non-commercial CC BY-NC-SA 4.0. Documentation is available at https://ufal.mff.cuni.cz/nametag, source code at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
