Contextual Label Projection for Cross-Lingual Structured Prediction
Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

TL;DR
This paper introduces CLaP, a novel label projection method that uses contextual translation with multilingual instruction-tuned models to improve cross-lingual structured prediction accuracy across many languages.
Contribution
CLaP is a new label projection approach that enhances translation accuracy by leveraging contextual information from translated texts using instruction-tuned multilingual models.
Findings
Over 2.4 F1 improvement in event argument extraction
Over 1.4 F1 improvement in named entity recognition
Effective on ten extremely low-resource languages
Abstract
Label projection, which involves obtaining translated labels and texts jointly, is essential for leveraging machine translation to facilitate cross-lingual transfer in structured prediction tasks. Prior research exploring label projection often compromise translation accuracy by favoring simplified label translation or relying solely on word-level alignments. In this paper, we introduce a novel label projection approach, CLaP, which translates text to the target language and performs contextual translation on the labels using the translated text as the context, ensuring better accuracy for the translated labels. We leverage instruction-tuned language models with multilingual capabilities as our contextual translator, imposing the constraint of the presence of translated labels in the translated text via instructions. We benchmark CLaP with other label projection techniques on zero-shot…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Web Data Mining and Analysis
