NMT-based Cross-lingual Document Embeddings

Wei Li; Brian Mak

arXiv:1807.11057·cs.CL·August 20, 2020

NMT-based Cross-lingual Document Embeddings

Wei Li, Brian Mak

PDF

Open Access

TL;DR

This paper proposes a constrained neural machine translation-based cross-lingual document embedding method that enhances embedding similarity without requiring translation during testing, improving efficiency and performance.

Contribution

It introduces a new constrained NV method that enforces embedding closeness for parallel documents, eliminating the need for translation at test time.

Findings

01

cNV performs as well as NV in classification tasks

02

cNV outperforms other methods that require decoding

03

The method is more lightweight and flexible

Abstract

This paper investigates a cross-lingual document embedding method that improves the current Neural machine Translation framework based Document Vector (NTDV or simply NV). NV is developed with a self-attention mechanism under the neural machine translation (NMT) framework. In NV, each pair of parallel documents in different languages are projected to the same shared layer in the model. However, the pair of NV embeddings are not guaranteed to be similar. This paper further adds a distance constraint to the training objective function of NV so that the two embeddings of a parallel document are required to be as close as possible. The new method will be called constrained NV (cNV). In a cross-lingual document classification task, the new cNV performs as well as NV and outperforms other published studies that require forward-pass decoding. Compared with the previous NV, cNV does not need a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies