TL;DR
This paper introduces PPT, a simple yet effective unsupervised cross-lingual parser transfer method that leverages self-training with minimal resources, improving parsing accuracy for low-resource languages.
Contribution
It proposes a flexible, resource-efficient transfer approach that works without source data and supports various parsing types, outperforming existing models.
Findings
Significant improvements over state-of-the-art transfer models.
Effective multi-source transfer enhances parsing accuracy.
Non-projective parsing offers notable advantages.
Abstract
Cross-lingual transfer is a leading technique for parsing low-resource languages in the absence of explicit supervision. Simple `direct transfer' of a learned model based on a multilingual input encoding has provided a strong benchmark. This paper presents a method for unsupervised cross-lingual transfer that improves over direct transfer systems by using their output as implicit supervision as part of self-training on unlabelled text in the target language. The method assumes minimal resources and provides maximal flexibility by (a) accepting any pre-trained arc-factored dependency parser; (b) assuming no access to source language data; (c) supporting both projective and non-projective parsing; and (d) supporting multi-source transfer. With English as the source language, we show significant improvements over state-of-the-art transfer models on both distant and nearby languages,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
