Reinforced Iterative Knowledge Distillation for Cross-Lingual Named   Entity Recognition

Shining Liang; Ming Gong; Jian Pei; Linjun Shou; Wanli Zuo; Xianglin; Zuo; Daxin Jiang

arXiv:2106.00241·cs.CL·June 2, 2021·5 cites

Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition

Shining Liang, Ming Gong, Jian Pei, Linjun Shou, Wanli Zuo, Xianglin, Zuo, Daxin Jiang

PDF

Open Access

TL;DR

This paper introduces a novel reinforcement learning-based semi-supervised approach for cross-lingual NER that leverages large unlabeled target language data, achieving state-of-the-art results and practical deployment in industry applications.

Contribution

The paper presents a new reinforcement learning method to utilize unlabeled target language data for cross-lingual NER, improving performance over existing methods.

Findings

01

Achieved new state-of-the-art results on three benchmark datasets.

02

Demonstrated effectiveness in real-world industrial applications.

03

Enhanced NER performance by leveraging unlabeled data with reinforcement learning.

Abstract

Named entity recognition (NER) is a fundamental component in many applications, such as Web Search and Voice Assistants. Although deep neural networks greatly improve the performance of NER, due to the requirement of large amounts of training data, deep neural networks can hardly scale out to many languages in an industry setting. To tackle this challenge, cross-lingual NER transfers knowledge from a rich-resource language to languages with low resources through pre-trained multilingual language models. Instead of using training data in target languages, cross-lingual NER has to rely on only training data in source languages, and optionally adds the translated training data derived from source languages. However, the existing cross-lingual NER methods do not make good use of rich unlabeled data in target languages, which is relatively easy to collect in industry applications. To address…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning