HANS, are you clever? Clever Hans Effect Analysis of Neural Systems
Leonardo Ranaldi, Fabio Massimo Zanzotto

TL;DR
This paper investigates the biases in instruction-tuned large language models, revealing order and positional biases that affect reasoning, and demonstrates how Chain-of-Thought prompting can mitigate these biases for more robust performance.
Contribution
The study identifies inherent biases in It-LLMs and proposes the use of Chain-of-Thought prompting to reduce bias and improve reasoning robustness.
Findings
Order bias significantly affects model performance.
Positional bias correlates with initial choice positions.
Chain-of-Thought prompting mitigates biases and enhances reasoning.
Abstract
Instruction-tuned Large Language Models (It-LLMs) have been exhibiting outstanding abilities to reason around cognitive states, intentions, and reactions of all people involved, letting humans guide and comprehend day-to-day social interactions effectively. In fact, several multiple-choice questions (MCQ) benchmarks have been proposed to construct solid assessments of the models' abilities. However, earlier works are demonstrating the presence of inherent "order bias" in It-LLMs, posing challenges to the appropriate evaluation. In this paper, we investigate It-LLMs' resilience abilities towards a series of probing tests using four MCQ benchmarks. Introducing adversarial examples, we show a significant performance gap, mainly when varying the order of the choices, which reveals a selection bias and brings into discussion reasoning abilities. Following a correlation between first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling
