Improving Commonsense Causal Reasoning by Adversarial Training and Data   Augmentation

Ieva Stali\=unait\.e; Philip John Gorinski; Ignacio Iacobacci

arXiv:2101.04966·cs.CL·January 14, 2021·1 cites

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Ieva Stali\=unait\.e, Philip John Gorinski, Ignacio Iacobacci

PDF

Open Access 1 Video

TL;DR

This paper enhances commonsense causal reasoning models by employing adversarial training and data augmentation techniques, significantly improving robustness and performance on challenging datasets with limited data.

Contribution

It introduces adversarial training with synonym perturbations and discourse-based data augmentation to improve causal reasoning models' robustness and accuracy.

Findings

01

Significant performance boost on COPA and Balanced COPA datasets.

02

Improved model robustness against superficial cues.

03

Effective data augmentation with minimal additional data.

Abstract

Determining the plausibility of causal relations between clauses is a commonsense reasoning task that requires complex inference ability. The general approach to this task is to train a large pretrained language model on a specific dataset. However, the available training data for the task is often scarce, which leads to instability of model training or reliance on the shallow features of the dataset. This paper presents a number of techniques for making models more robust in the domain of causal reasoning. Firstly, we perform adversarial training by generating perturbed inputs through synonym substitution. Secondly, based on a linguistic theory of discourse connectives, we perform data augmentation using a discourse parser for detecting causally linked clauses in large text, and a generative language model for generating distractors. Both methods boost model performance on the Choice…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies