Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models

Mingyue Han; Yinglin Wang

arXiv:2107.01791·cs.CL·June 26, 2025

Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models

Mingyue Han, Yinglin Wang

PDF

Open Access 1 Repo

TL;DR

This paper examines the limitations of current pretrained language models in causal reasoning tasks, revealing their vulnerability to semantic bias and proposing a regularization method to improve robustness and generalization.

Contribution

It identifies semantic similarity bias as a key weakness in COPA models and introduces a simple regularization technique to enhance causal reasoning capabilities.

Findings

01

Regularization improves model robustness on biased datasets

02

Models generalize better with the proposed method

03

Enhanced performance on the challenging BCOPA-CE dataset

Abstract

Pretrained language models (PLM) achieve surprising performance on the Choice of Plausible Alternatives (COPA) task. However, whether PLMs have truly acquired the ability of causal reasoning remains a question. In this paper, we investigate the problem of semantic similarity bias and reveal the vulnerability of current COPA models by certain attacks. Previous solutions that tackle the superficial cues of unbalanced token distribution still encounter the same problem of semantic bias, even more seriously due to the utilization of more training data. We mitigate this problem by simply adding a regularization loss and experimental results show that this solution not only improves the model's generalization ability, but also assists the models to perform more robustly on a challenging dataset, BCOPA-CE, which has unbiased token distribution and is more difficult for models to distinguish…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

badbadcode/weakCOPA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification