Unsupervised Bilingual Lexicon Induction from Mono-lingual Multimodal Data
Shizhe Chen, Qin Jin, Alexander Hauptmann

TL;DR
This paper introduces a novel unsupervised method for bilingual lexicon induction using multilingual caption models trained on monolingual multimodal data, leveraging both linguistic and localized visual features to improve translation accuracy.
Contribution
It proposes a multi-lingual caption model that jointly learns linguistic and visual features from monolingual multimodal data, enabling unsupervised bilingual lexicon induction without parallel corpora.
Findings
Outperforms previous vision-based methods on multiple language pairs.
Effectively learns translations for less visual-relevant words.
Utilizes both linguistic context and localized visual features for better translation.
Abstract
Bilingual lexicon induction, translating words from the source language to the target language, is a long-standing natural language processing task. Recent endeavors prove that it is promising to employ images as pivot to learn the lexicon induction without reliance on parallel corpora. However, these vision-based approaches simply associate words with entire images, which are constrained to translate concrete words and require object-centered images. We humans can understand words better when they are within a sentence with context. Therefore, in this paper, we propose to utilize images and their associated captions to address the limitations of previous approaches. We propose a multi-lingual caption model trained with different mono-lingual multimodal data to map words in different languages into joint spaces. Two types of word representation are induced from the multi-lingual caption…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling
