Neural Named Entity Recognition from Subword Units
Abdalghani Abujabal, Judith Gaspers

TL;DR
This paper explores neural NER using subword units like characters, phonemes, and bytes, reducing vocabulary size and improving performance, especially with larger datasets, across multiple languages.
Contribution
It introduces a subword-based neural NER model that reduces vocabulary size and enhances performance, especially in large-scale, multilingual settings.
Findings
Subword units approach approaches word-level performance with less vocabulary.
Combining subword units improves NER accuracy.
Subword models outperform word-only models with limited data.
Abstract
Named entity recognition (NER) is a vital task in spoken language understanding, which aims to identify mentions of named entities in text e.g., from transcribed speech. Existing neural models for NER rely mostly on dedicated word-level representations, which suffer from two main shortcomings. First, the vocabulary size is large, yielding large memory requirements and training time. Second, these models are not able to learn morphological or phonological representations. To remedy the above shortcomings, we adopt a neural solution based on bidirectional LSTMs and conditional random fields, where we rely on subword units, namely characters, phonemes, and bytes. For each word in an utterance, our model learns a representation from each of the subword units. We conducted experiments in a real-world large-scale setting for the use case of a voice-controlled device covering four languages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
