Sharing Network Parameters for Crosslingual Named Entity Recognition
Rudra Murthy V, Mitesh Khapra, Pushpak Bhattacharyya

TL;DR
This paper introduces a neural network model that shares parameters across languages for crosslingual Named Entity Recognition, improving performance in resource-scarce languages by leveraging data from resource-rich languages without handcrafted features.
Contribution
The paper proposes a novel neural network architecture that shares decoder, word, and character parameters across languages for NER, enabling transfer learning without manual feature engineering.
Findings
Sharing parameters improves NER performance in low-resource languages.
Joint training with multiple languages outperforms single-language models.
No handcrafted features are needed, as the model learns representations directly from data.
Abstract
Most state of the art approaches for Named Entity Recognition rely on hand crafted features and annotated corpora. Recently Neural network based models have been proposed which do not require handcrafted features but still require annotated corpora. However, such annotated corpora may not be available for many languages. In this paper, we propose a neural network based model which allows sharing the decoder as well as word and character level parameters between two languages thereby allowing a resource fortunate language to aid a resource deprived language. Specifically, we focus on the case when limited annotated corpora is available in one language () and abundant annotated corpora is available in another language (). Sharing the network architecture and parameters between and leads to improved performance in . Further, our approach does not require any hand…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies
