WANLI: Worker and AI Collaboration for Natural Language Inference   Dataset Creation

Alisa Liu; Swabha Swayamdipta; Noah A. Smith; Yejin Choi

arXiv:2201.05955·cs.CL·November 16, 2022·5 cites

WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation

Alisa Liu, Swabha Swayamdipta, Noah A. Smith, Yejin Choi

PDF

Open Access 1 Repo 1 Models 4 Datasets

TL;DR

This paper presents WANLI, a new NLI dataset created through a collaborative process involving AI-generated examples and human review, leading to improved model performance on out-of-domain tests.

Contribution

It introduces a novel dataset creation method combining language models and human evaluation, resulting in a diverse, challenging NLI dataset that enhances out-of-domain generalization.

Findings

01

Training on WANLI improves out-of-domain test performance by up to 11%.

02

WANLI outperforms larger datasets like MultiNLI in model training.

03

The approach demonstrates effective integration of AI and human efforts for dataset creation.

Abstract

A recurring challenge of crowdsourcing NLP datasets at scale is that human writers often rely on repetitive patterns when crafting examples, leading to a lack of linguistic diversity. We introduce a novel approach for dataset creation based on worker and AI collaboration, which brings together the generative strength of language models and the evaluative strength of humans. Starting with an existing dataset, MultiNLI for natural language inference (NLI), our approach uses dataset cartography to automatically identify examples that demonstrate challenging reasoning patterns, and instructs GPT-3 to compose new examples with similar patterns. Machine generated examples are then automatically filtered, and finally revised and labeled by human crowdworkers. The resulting dataset, WANLI, consists of 107,885 NLI examples and presents unique empirical strengths over existing NLI datasets.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alisawuffles/wanli
noneOfficial

Models

🤗
takehika/mdeberta-v3-en-wanli-nli
model· 15 dl
15 dl

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Attention Dropout · Layer Normalization · Residual Connection · Adam · Dropout · Weight Decay