Sharing Network Parameters for Crosslingual Named Entity Recognition

Rudra Murthy V; Mitesh Khapra; Pushpak Bhattacharyya

arXiv:1607.00198·cs.CL·July 4, 2016·5 cites

Sharing Network Parameters for Crosslingual Named Entity Recognition

Rudra Murthy V, Mitesh Khapra, Pushpak Bhattacharyya

PDF

Open Access

TL;DR

This paper introduces a neural network model that shares parameters across languages for crosslingual Named Entity Recognition, improving performance in resource-scarce languages by leveraging data from resource-rich languages without handcrafted features.

Contribution

The paper proposes a novel neural network architecture that shares decoder, word, and character parameters across languages for NER, enabling transfer learning without manual feature engineering.

Findings

01

Sharing parameters improves NER performance in low-resource languages.

02

Joint training with multiple languages outperforms single-language models.

03

No handcrafted features are needed, as the model learns representations directly from data.

Abstract

Most state of the art approaches for Named Entity Recognition rely on hand crafted features and annotated corpora. Recently Neural network based models have been proposed which do not require handcrafted features but still require annotated corpora. However, such annotated corpora may not be available for many languages. In this paper, we propose a neural network based model which allows sharing the decoder as well as word and character level parameters between two languages thereby allowing a resource fortunate language to aid a resource deprived language. Specifically, we focus on the case when limited annotated corpora is available in one language ( $L_{1}$ ) and abundant annotated corpora is available in another language ( $L_{2}$ ). Sharing the network architecture and parameters between $L_{1}$ and $L_{2}$ leads to improved performance in $L_{1}$ . Further, our approach does not require any hand…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies