SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense   Inference

Rowan Zellers; Yonatan Bisk; Roy Schwartz; Yejin Choi

arXiv:1808.05326·cs.CL·August 17, 2018

SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

Rowan Zellers, Yonatan Bisk, Roy Schwartz, Yejin Choi

PDF

5 Datasets

TL;DR

SWAG introduces a large-scale grounded commonsense inference dataset with adversarial filtering to reduce biases, challenging models to understand and predict grounded situations more effectively.

Contribution

The paper presents SWAG, a new dataset for grounded commonsense inference, and introduces Adversarial Filtering to create a de-biased, challenging benchmark for AI models.

Findings

01

Humans achieve 88% accuracy on SWAG inference tasks.

02

State-of-the-art models perform significantly worse than humans.

03

Adversarial Filtering effectively reduces dataset biases.

Abstract

Given a partial description like "she opened the hood of the car," humans can reason about the situation and anticipate what might come next ("then, she examined the engine"). In this paper, we introduce the task of grounded commonsense inference, unifying natural language inference and commonsense reasoning. We present SWAG, a new dataset with 113k multiple choice questions about a rich spectrum of grounded situations. To address the recurring challenges of the annotation artifacts and human biases found in many existing datasets, we propose Adversarial Filtering (AF), a novel procedure that constructs a de-biased dataset by iteratively training an ensemble of stylistic classifiers, and using them to filter the data. To account for the aggressive adversarial filtering, we use state-of-the-art language models to massively oversample a diverse set of potential counterfactuals.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.