HANS, are you clever? Clever Hans Effect Analysis of Neural Systems

Leonardo Ranaldi; Fabio Massimo Zanzotto

arXiv:2309.12481·cs.CL·May 3, 2024

HANS, are you clever? Clever Hans Effect Analysis of Neural Systems

Leonardo Ranaldi, Fabio Massimo Zanzotto

PDF

Open Access

TL;DR

This paper investigates the biases in instruction-tuned large language models, revealing order and positional biases that affect reasoning, and demonstrates how Chain-of-Thought prompting can mitigate these biases for more robust performance.

Contribution

The study identifies inherent biases in It-LLMs and proposes the use of Chain-of-Thought prompting to reduce bias and improve reasoning robustness.

Findings

01

Order bias significantly affects model performance.

02

Positional bias correlates with initial choice positions.

03

Chain-of-Thought prompting mitigates biases and enhances reasoning.

Abstract

Instruction-tuned Large Language Models (It-LLMs) have been exhibiting outstanding abilities to reason around cognitive states, intentions, and reactions of all people involved, letting humans guide and comprehend day-to-day social interactions effectively. In fact, several multiple-choice questions (MCQ) benchmarks have been proposed to construct solid assessments of the models' abilities. However, earlier works are demonstrating the presence of inherent "order bias" in It-LLMs, posing challenges to the appropriate evaluation. In this paper, we investigate It-LLMs' resilience abilities towards a series of probing tests using four MCQ benchmarks. Introducing adversarial examples, we show a significant performance gap, mainly when varying the order of the choices, which reveals a selection bias and brings into discussion reasoning abilities. Following a correlation between first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling