Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation
Ieva Stali\=unait\.e, Philip John Gorinski, Ignacio Iacobacci

TL;DR
This paper enhances commonsense causal reasoning models by employing adversarial training and data augmentation techniques, significantly improving robustness and performance on challenging datasets with limited data.
Contribution
It introduces adversarial training with synonym perturbations and discourse-based data augmentation to improve causal reasoning models' robustness and accuracy.
Findings
Significant performance boost on COPA and Balanced COPA datasets.
Improved model robustness against superficial cues.
Effective data augmentation with minimal additional data.
Abstract
Determining the plausibility of causal relations between clauses is a commonsense reasoning task that requires complex inference ability. The general approach to this task is to train a large pretrained language model on a specific dataset. However, the available training data for the task is often scarce, which leads to instability of model training or reliance on the shallow features of the dataset. This paper presents a number of techniques for making models more robust in the domain of causal reasoning. Firstly, we perform adversarial training by generating perturbed inputs through synonym substitution. Secondly, based on a linguistic theory of discourse connectives, we perform data augmentation using a discourse parser for detecting causally linked clauses in large text, and a generative language model for generating distractors. Both methods boost model performance on the Choice…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
