Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum   Learning

Miryam de Lhoneux; Sheng Zhang; Anders S{\o}gaard

arXiv:2203.08555·cs.CL·March 17, 2022

Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning

Miryam de Lhoneux, Sheng Zhang, Anders S{\o}gaard

PDF

Open Access 1 Repo

TL;DR

This paper introduces a curriculum learning approach that dynamically optimizes zero-shot dependency parsing performance on low-resource and outlier languages, outperforming traditional sampling methods.

Contribution

It proposes a novel automated curriculum learning method tailored for zero-shot dependency parsing across diverse languages, especially low-resource and outlier languages.

Findings

01

Outperforms uniform sampling in zero-shot parsing

02

Significantly improves parsing accuracy on outlier languages

03

Demonstrates effectiveness of curriculum learning in multilingual models

Abstract

Large multilingual pretrained language models such as mBERT and XLM-RoBERTa have been found to be surprisingly effective for cross-lingual transfer of syntactic parsing models (Wu and Dredze 2019), but only between related languages. However, source and training languages are rarely related, when parsing truly low-resource languages. To close this gap, we adopt a method from multi-task learning, which relies on automated curriculum learning, to dynamically optimize for parsing performance on outlier languages. We show that this approach is significantly better than uniform and size-proportional sampling in the zero-shot setting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mdelhoneux/machamp-worst_case_acl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsmBERT