Constrained Density Matching and Modeling for Cross-lingual Alignment of Contextualized Representations
Wei Zhao, Steffen Eger

TL;DR
This paper introduces density-based alignment methods, Real-NVP and GAN-Real-NVP, for cross-lingual representation alignment that perform well with limited or no parallel data, improving multilingual NLP tasks.
Contribution
It proposes novel density matching and modeling techniques using Normalizing Flow for multilingual alignment, addressing data efficiency and training issues of prior methods.
Findings
Supervised approach with 20k sentences outperforms methods trained on 100k sentences.
Unsupervised approach enables removing parallel data without performance loss.
Validation criteria effectively guide training without relying on validation data.
Abstract
Multilingual representations pre-trained with monolingual data exhibit considerably unequal task performances across languages. Previous studies address this challenge with resource-intensive contextualized alignment, which assumes the availability of large parallel data, thereby leaving under-represented language communities behind. In this work, we attribute the data hungriness of previous alignment techniques to two limitations: (i) the inability to sufficiently leverage data and (ii) these techniques are not trained properly. To address these issues, we introduce supervised and unsupervised density-based approaches named Real-NVP and GAN-Real-NVP, driven by Normalizing Flow, to perform alignment, both dissecting the alignment of multilingual subspaces into density matching and density modeling. We complement these approaches with our validation criteria in order to guide the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
