Probing Causality Manipulation of Large Language Models

Chenyang Zhang; Haibo Tong; Bin Zhang; Dongyu Zhang

arXiv:2408.14380·cs.CL·August 27, 2024

Probing Causality Manipulation of Large Language Models

Chenyang Zhang, Haibo Tong, Bin Zhang, Dongyu Zhang

PDF

Open Access

TL;DR

This paper investigates how large language models understand and manipulate causality, revealing their capabilities and limitations in recognizing causal relationships through hierarchical probing methods.

Contribution

It introduces a novel hierarchical probing approach using retrieval augmented generation and in-context learning to analyze LLMs' causality understanding.

Findings

01

LLMs can detect causality-related entities.

02

LLMs recognize direct causal relationships.

03

LLMs treat causality as part of overall sentence semantics.

Abstract

Large language models (LLMs) have shown various ability on natural language processing, including problems about causality. It is not intuitive for LLMs to command causality, since pretrained models usually work on statistical associations, and do not focus on causes and effects in sentences. So that probing internal manipulation of causality is necessary for LLMs. This paper proposes a novel approach to probe causality manipulation hierarchically, by providing different shortcuts to models and observe behaviors. We exploit retrieval augmented generation (RAG) and in-context learning (ICL) for models on a designed causality classification task. We conduct experiments on mainstream LLMs, including GPT-4 and some smaller and domain-specific models. Our results suggest that LLMs can detect entities related to causality and recognize direct causal relationships. However, LLMs lack…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsLinear Layer · Adam · Layer Normalization · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dense Connections · Residual Connection · Multi-Head Attention · Byte Pair Encoding · Absolute Position Encodings