Debiasing Word Embeddings Improves Multimodal Machine Translation

Tosho Hirasawa; Mamoru Komachi

arXiv:1905.10464·cs.CL·June 25, 2019·1 cites

Debiasing Word Embeddings Improves Multimodal Machine Translation

Tosho Hirasawa, Mamoru Komachi

PDF

Open Access

TL;DR

This paper investigates the impact of debiasing pretrained word embeddings on multimodal neural machine translation, showing that appropriate debiasing techniques can significantly improve translation quality.

Contribution

It introduces two debiasing methods for pretrained embeddings and demonstrates their effectiveness across multiple models and language pairs in multimodal NMT.

Findings

01

Performance improved by up to +1.93 BLEU and +2.02 METEOR for English-German.

02

Debiasing techniques enhance translation quality in multimodal NMT.

03

Effective debiasing reduces hubness in high-dimensional embeddings.

Abstract

In recent years, pretrained word embeddings have proved useful for multimodal neural machine translation (NMT) models to address the shortage of available datasets. However, the integration of pretrained word embeddings has not yet been explored extensively. Further, pretrained word embeddings in high dimensional spaces have been reported to suffer from the hubness problem. Although some debiasing techniques have been proposed to address this problem for other natural language processing tasks, they have seldom been studied for multimodal NMT models. In this study, we examine various kinds of word embeddings and introduce two debiasing techniques for three multimodal NMT models and two language pairs -- English-German translation and English-French translation. With our optimal settings, the overall performance of multimodal models was improved by up to +1.93 BLEU and +2.02 METEOR for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications