Generating Effective CoT Traces for Mitigating Causal Hallucination

Yiheng Zhao; Jun Yan

arXiv:2604.12748·cs.CL·April 15, 2026

Generating Effective CoT Traces for Mitigating Causal Hallucination

Yiheng Zhao, Jun Yan

PDF

TL;DR

This paper introduces a pipeline for generating Chain-of-Thought traces to reduce causal hallucination in smaller language models, along with a new metric to measure hallucination, leading to improved reasoning accuracy.

Contribution

It proposes a novel pipeline for creating effective CoT traces and a new metric, CHR, to mitigate causal hallucination in small LLMs.

Findings

01

Fine-tuning with generated CoT traces reduces causal hallucination.

02

The approach improves mean accuracy and generalization across datasets.

03

Models show robustness under misleading prompts.

Abstract

Although large language models (LLMs) excel in complex reasoning tasks, they suffer from severe causal hallucination in event causality identification (ECI), particularly in smaller models ( $\leq$ 1.5B parameters). A promising approach to address this issue is to fine-tune them with Chain-of-Thought (CoT) traces. However, there is currently a lack of CoT trace dataset available for ECI. In this paper, we first investigate the essential criteria that effective CoT traces should possess to mitigate causal hallucination in smaller models. We then design a pipeline to generate CoT traces that meet these criteria. Moreover, since there is currently no metric for quantifying causal hallucination, we also introduce a new metric, the Causal Hallucination Rate (CHR), to quantify causal hallucination, guide the formulation of effective CoT trace criteria, and validate the effectiveness of our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.