Zero-Shot Cross-Lingual Transfer in Legal Domain Using Transformer Models
Zein Shaheen, Gerhard Wohlgenannt, Dmitry Mouromtsev

TL;DR
This paper explores zero-shot cross-lingual transfer for legal document classification from English to French and German using transformer models, demonstrating significant improvements with specific training techniques.
Contribution
It introduces effective training techniques like Gradual Unfreezing and Language Model finetuning to enhance zero-shot transfer in legal NLP tasks across languages.
Findings
Language model finetuning yields up to 87.54% relative improvement.
Gradual unfreezing improves performance by up to 70%.
Zero-shot models achieve 86% of joint training performance.
Abstract
Zero-shot cross-lingual transfer is an important feature in modern NLP models and architectures to support low-resource languages. In this work, We study zero-shot cross-lingual transfer from English to French and German under Multi-Label Text Classification, where we train a classifier using English training set, and we test using French and German test sets. We extend EURLEX57K dataset, the English dataset for topic classification of legal documents, with French and German official translation. We investigate the effect of using some training techniques, namely Gradual Unfreezing and Language Model finetuning, on the quality of zero-shot cross-lingual transfer. We find that Language model finetuning of multi-lingual pre-trained model (M-DistilBERT, M-BERT) leads to 32.0-34.94%, 76.15-87.54% relative improvement on French and German test sets correspondingly. Also, Gradual unfreezing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
