On the Evaluation of Semantic Phenomena in Neural Machine Translation   Using Natural Language Inference

Adam Poliak; Yonatan Belinkov; James Glass; Benjamin Van Durme

arXiv:1804.09779·cs.CL·May 8, 2018

On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference

Adam Poliak, Yonatan Belinkov, James Glass, Benjamin Van Durme

PDF

1 Repo

TL;DR

This paper introduces a method to evaluate how well neural machine translation systems encode various semantic phenomena by using their sentence representations as features for natural language inference tasks.

Contribution

It presents a novel process for assessing semantic encoding in NMT systems through NLI classifiers trained on recast semantic datasets.

Findings

01

NMT encoder supports syntax-semantics inferences

02

Limited support for world-knowledge-based inferences

03

Framework for evaluating semantic coverage in NMT

Abstract

We propose a process for investigating the extent to which sentence representations arising from neural machine translation (NMT) systems encode distinct semantic phenomena. We use these representations as features to train a natural language inference (NLI) classifier based on datasets recast from existing semantic annotations. In applying this process to a representative NMT system, we find its encoder appears most suited to supporting inferences at the syntax-semantics interface, as compared to anaphora resolution requiring world-knowledge. We conclude with a discussion on the merits and potential deficiencies of the existing process, and how it may be improved and extended as a broader framework for evaluating semantic coverage.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

boknilev/nmt-repr-analysis
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.