Duality Regularization for Unsupervised Bilingual Lexicon Induction

Xuefeng Bai; Yue Zhang; Hailong Cao; Tiejun Zhao

arXiv:1909.01013·cs.CL·October 7, 2022·1 cites

Duality Regularization for Unsupervised Bilingual Lexicon Induction

Xuefeng Bai, Yue Zhang, Hailong Cao, Tiejun Zhao

PDF

Open Access

TL;DR

This paper introduces a joint training approach with regularization for unsupervised bilingual lexicon induction, leveraging the duality between language pairs to improve translation accuracy.

Contribution

It proposes a novel joint primal-dual training framework with regularizers to enforce consistency, advancing unsupervised bilingual lexicon induction methods.

Findings

01

Significant performance improvements over baselines

02

Achieved best results on standard benchmarks

03

Effective across multiple language pairs

Abstract

Unsupervised bilingual lexicon induction naturally exhibits duality, which results from symmetry in back-translation. For example, EN-IT and IT-EN induction can be mutually primal and dual problems. Current state-of-the-art methods, however, consider the two tasks independently. In this paper, we propose to train primal and dual models jointly, using regularizers to encourage consistency in back translation cycles. Experiments across 6 language pairs show that the proposed method significantly outperforms competitive baselines, obtaining the best-published results on a standard benchmark.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications