Everything Is All It Takes: A Multipronged Strategy for Zero-Shot   Cross-Lingual Information Extraction

Mahsa Yarmohammadi; Shijie Wu; Marc Marone; Haoran Xu; Seth Ebner,; Guanghui Qin; Yunmo Chen; Jialiang Guo; Craig Harman; Kenton Murray; Aaron; Steven White; Mark Dredze; Benjamin Van Durme

arXiv:2109.06798·cs.CL·September 15, 2021

Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction

Mahsa Yarmohammadi, Shijie Wu, Marc Marone, Haoran Xu, Seth Ebner,, Guanghui Qin, Yunmo Chen, Jialiang Guo, Craig Harman, Kenton Murray, Aaron, Steven White, Mark Dredze, Benjamin Van Durme

PDF

2 Repos

TL;DR

This paper proposes a multipronged approach combining data projection and self-training techniques to improve zero-shot cross-lingual information extraction across multiple languages and tasks, outperforming individual strategies.

Contribution

It introduces a comprehensive strategy that integrates multiple techniques and evaluates their combined effectiveness for zero-shot cross-lingual IE across diverse languages and tasks.

Findings

01

Combining techniques yields better performance than individual methods.

02

Different techniques perform best for different tasks and languages.

03

The approach is effective across event extraction, NER, POS tagging, and dependency parsing.

Abstract

Zero-shot cross-lingual information extraction (IE) describes the construction of an IE model for some target language, given existing annotations exclusively in some other language, typically English. While the advance of pretrained multilingual encoders suggests an easy optimism of "train on English, run on any language", we find through a thorough exploration and extension of techniques that a combination of approaches, both new and old, leads to better performance than any one cross-lingual strategy in particular. We explore techniques including data projection and self-training, and how different pretrained encoders impact them. We use English-to-Arabic IE as our initial example, demonstrating strong performance in this setting for event extraction, named entity recognition, part-of-speech tagging, and dependency parsing. We then apply data projection and self-training to three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.