Mining Knowledge for Natural Language Inference from Wikipedia   Categories

Mingda Chen; Zewei Chu; Karl Stratos; Kevin Gimpel

arXiv:2010.01239·cs.CL·October 6, 2020

Mining Knowledge for Natural Language Inference from Wikipedia Categories

Mingda Chen, Zewei Chu, Karl Stratos, Kevin Gimpel

PDF

Open Access 1 Repo

TL;DR

This paper introduces WikiNLI, a large dataset derived from Wikipedia category hierarchies, to enhance natural language inference models by pretraining, demonstrating improved performance across multiple languages and knowledge bases.

Contribution

The paper presents WikiNLI, a novel large-scale dataset for NLI and LE tasks, and shows that pretraining models on it improves performance over other knowledge bases and across languages.

Findings

01

Pretraining on WikiNLI improves BERT and RoBERTa performance.

02

WikiNLI outperforms WordNet and Wikidata in pretraining effectiveness.

03

Multilingual WikiNLI enhances NLI tasks in corresponding languages.

Abstract

Accurate lexical entailment (LE) and natural language inference (NLI) often require large quantities of costly annotations. To alleviate the need for labeled data, we introduce WikiNLI: a resource for improving model performance on NLI and LE tasks. It contains 428,899 pairs of phrases constructed from naturally annotated category hierarchies in Wikipedia. We show that we can improve strong baselines such as BERT and RoBERTa by pretraining them on WikiNLI and transferring the models on downstream tasks. We conduct systematic comparisons with phrases extracted from other knowledge bases such as WordNet and Wikidata to find that pretraining on WikiNLI gives the best performance. In addition, we construct WikiNLI in other languages, and show that pretraining on them improves performance on NLI tasks of corresponding languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZeweiChu/WikiNLI
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsLinear Layer · Dense Connections · Layer Normalization · WordPiece · Multi-Head Attention · Dropout · Linear Warmup With Linear Decay · Attention Dropout · Weight Decay · RoBERTa