WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation
Alisa Liu, Swabha Swayamdipta, Noah A. Smith, Yejin Choi

TL;DR
This paper presents WANLI, a new NLI dataset created through a collaborative process involving AI-generated examples and human review, leading to improved model performance on out-of-domain tests.
Contribution
It introduces a novel dataset creation method combining language models and human evaluation, resulting in a diverse, challenging NLI dataset that enhances out-of-domain generalization.
Findings
Training on WANLI improves out-of-domain test performance by up to 11%.
WANLI outperforms larger datasets like MultiNLI in model training.
The approach demonstrates effective integration of AI and human efforts for dataset creation.
Abstract
A recurring challenge of crowdsourcing NLP datasets at scale is that human writers often rely on repetitive patterns when crafting examples, leading to a lack of linguistic diversity. We introduce a novel approach for dataset creation based on worker and AI collaboration, which brings together the generative strength of language models and the evaluative strength of humans. Starting with an existing dataset, MultiNLI for natural language inference (NLI), our approach uses dataset cartography to automatically identify examples that demonstrate challenging reasoning patterns, and instructs GPT-3 to compose new examples with similar patterns. Machine generated examples are then automatically filtered, and finally revised and labeled by human crowdworkers. The resulting dataset, WANLI, consists of 107,885 NLI examples and presents unique empirical strengths over existing NLI datasets.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Attention Dropout · Layer Normalization · Residual Connection · Adam · Dropout · Weight Decay
