Zero-shot Cross-lingual Transfer Learning with Multiple Source and Target Languages for Information Extraction: Language Selection and Adversarial Training
Nghia Trung Ngo, Thien Huu Nguyen

TL;DR
This paper explores multi-lingual information extraction by analyzing language transferability, developing a language distance metric, and proposing adversarial training methods to improve zero-shot cross-lingual performance across many languages.
Contribution
It introduces a comprehensive analysis of cross-lingual transferability, a new language distance metric, and a relational transfer approach using adversarial training for multi-lingual IE.
Findings
Correlation between transfer performance and linguistic distances.
A robust language distance metric for diverse languages.
Enhanced zero-shot transfer with adversarial relational training.
Abstract
The majority of previous researches addressing multi-lingual IE are limited to zero-shot cross-lingual single-transfer (one-to-one) setting, with high-resource languages predominantly as source training data. As a result, these works provide little understanding and benefit for the realistic goal of developing a multi-lingual IE system that can generalize to as many languages as possible. Our study aims to fill this gap by providing a detailed analysis on Cross-Lingual Multi-Transferability (many-to-many transfer learning), for the recent IE corpora that cover a diverse set of languages. Specifically, we first determine the correlation between single-transfer performance and a wide range of linguistic-based distances. From the obtained insights, a combined language distance metric can be developed that is not only highly correlated but also robust across different tasks and model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
MethodsSparse Evolutionary Training
