Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
Mahsa Yarmohammadi, Shijie Wu, Marc Marone, Haoran Xu, Seth Ebner,, Guanghui Qin, Yunmo Chen, Jialiang Guo, Craig Harman, Kenton Murray, Aaron, Steven White, Mark Dredze, Benjamin Van Durme

TL;DR
This paper proposes a multipronged approach combining data projection and self-training techniques to improve zero-shot cross-lingual information extraction across multiple languages and tasks, outperforming individual strategies.
Contribution
It introduces a comprehensive strategy that integrates multiple techniques and evaluates their combined effectiveness for zero-shot cross-lingual IE across diverse languages and tasks.
Findings
Combining techniques yields better performance than individual methods.
Different techniques perform best for different tasks and languages.
The approach is effective across event extraction, NER, POS tagging, and dependency parsing.
Abstract
Zero-shot cross-lingual information extraction (IE) describes the construction of an IE model for some target language, given existing annotations exclusively in some other language, typically English. While the advance of pretrained multilingual encoders suggests an easy optimism of "train on English, run on any language", we find through a thorough exploration and extension of techniques that a combination of approaches, both new and old, leads to better performance than any one cross-lingual strategy in particular. We explore techniques including data projection and self-training, and how different pretrained encoders impact them. We use English-to-Arabic IE as our initial example, demonstrating strong performance in this setting for event extraction, named entity recognition, part-of-speech tagging, and dependency parsing. We then apply data projection and self-training to three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
