Characterizing Departures from Linearity in Word Translation

Ndapa Nakashole; Raphael Flauger

arXiv:1806.04508·cs.CL·June 19, 2018·1 cites

Characterizing Departures from Linearity in Word Translation

Ndapa Nakashole, Raphael Flauger

PDF

Open Access

TL;DR

This paper explores how linear approximations of word translation maps vary across embedding spaces, revealing their non-linear nature and providing insights for improving translation methods.

Contribution

It introduces a method to analyze local linearity in word translation maps, demonstrating their non-linear behavior and its correlation with neighborhood distances.

Findings

01

Local linear maps vary across embedding spaces

02

Non-linearity is tightly correlated with neighborhood distance

03

Results can inform the design of more accurate translation maps

Abstract

We investigate the behavior of maps learned by machine translation methods. The maps translate words by projecting between word embedding spaces of different languages. We locally approximate these maps using linear maps, and find that they vary across the word embedding space. This demonstrates that the underlying maps are non-linear. Importantly, we show that the locally linear maps vary by an amount that is tightly correlated with the distance between the neighborhoods on which they are trained. Our results can be used to test non-linear methods, and to drive the design of more accurate maps for word translation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis