Unsupervised Word Translation Pairing using Refinement based Point Set Registration
Silviu Oprea, Sourav Dutta, Haytham Assem

TL;DR
This paper introduces BioSpere, a robust unsupervised method for bilingual word embedding alignment that combines adversarial initialization with point set registration, achieving state-of-the-art results across multiple language pairs.
Contribution
It presents a novel framework integrating adversarial learning and point set registration to improve stability and performance in unsupervised bilingual word embedding alignment.
Findings
Achieves state-of-the-art results on dictionary induction tasks
Demonstrates robustness to parameter variations and training instability
Outperforms existing methods across diverse language pairs
Abstract
Cross-lingual alignment of word embeddings play an important role in knowledge transfer across languages, for improving machine translation and other multi-lingual applications. Current unsupervised approaches rely on similarities in geometric structure of word embedding spaces across languages, to learn structure-preserving linear transformations using adversarial networks and refinement strategies. However, such techniques, in practice, tend to suffer from instability and convergence issues, requiring tedious fine-tuning for precise parameter setting. This paper proposes BioSpere, a novel framework for unsupervised mapping of bi-lingual word embeddings onto a shared vector space, by combining adversarial initialization and refinement procedure with point set registration algorithm used in image processing. We show that our framework alleviates the shortcomings of existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis
