pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence   Inference

Mandar Joshi; Eunsol Choi; Omer Levy; Daniel S. Weld; Luke Zettlemoyer

arXiv:1810.08854·cs.CL·April 9, 2019

pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference

Mandar Joshi, Eunsol Choi, Omer Levy, Daniel S. Weld, Luke Zettlemoyer

PDF

3 Repos

TL;DR

pair2vec introduces compositional word-pair embeddings that encode background knowledge, improving cross-sentence inference tasks like question answering and natural language inference by enhancing model reasoning capabilities.

Contribution

The paper presents a novel method for learning word-pair embeddings via PMI maximization, which are integrated into existing models to improve inference performance.

Findings

01

2.7% accuracy gain on SQuAD2.0

02

1.3% accuracy gain on MultiNLI

03

Enhanced generalization on adversarial datasets

Abstract

Reasoning about implied relationships (e.g., paraphrastic, common sense, encyclopedic) between pairs of words is crucial for many cross-sentence inference problems. This paper proposes new methods for learning and using embeddings of word pairs that implicitly represent background knowledge about such relationships. Our pairwise embeddings are computed as a compositional function on word representations, which is learned by maximizing the pointwise mutual information (PMI) with the contexts in which the two words co-occur. We add these representations to the cross-sentence attention layer of existing inference models (e.g. BiDAF for QA, ESIM for NLI), instead of extending or replacing existing word embeddings. Experiments show a gain of 2.7% on the recently released SQuAD2.0 and 1.3% on MultiNLI. Our representations also aid in better generalization with gains of around 6-7% on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsEnhanced Sequential Inference Model