Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition
Shining Liang, Ming Gong, Jian Pei, Linjun Shou, Wanli Zuo, Xianglin, Zuo, Daxin Jiang

TL;DR
This paper introduces a novel reinforcement learning-based semi-supervised approach for cross-lingual NER that leverages large unlabeled target language data, achieving state-of-the-art results and practical deployment in industry applications.
Contribution
The paper presents a new reinforcement learning method to utilize unlabeled target language data for cross-lingual NER, improving performance over existing methods.
Findings
Achieved new state-of-the-art results on three benchmark datasets.
Demonstrated effectiveness in real-world industrial applications.
Enhanced NER performance by leveraging unlabeled data with reinforcement learning.
Abstract
Named entity recognition (NER) is a fundamental component in many applications, such as Web Search and Voice Assistants. Although deep neural networks greatly improve the performance of NER, due to the requirement of large amounts of training data, deep neural networks can hardly scale out to many languages in an industry setting. To tackle this challenge, cross-lingual NER transfers knowledge from a rich-resource language to languages with low resources through pre-trained multilingual language models. Instead of using training data in target languages, cross-lingual NER has to rely on only training data in source languages, and optionally adds the translated training data derived from source languages. However, the existing cross-lingual NER methods do not make good use of rich unlabeled data in target languages, which is relatively easy to collect in industry applications. To address…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
