ACCESS : A Benchmark for Abstract Causal Event Discovery and Reasoning
Vy Vo, Lizhen Qu, Tao Feng, Yuncheng Hua, Xiaoxi Kang, Songhai Fan,, Tim Dwyer, Lay-Ki Soon, Gholamreza Haffari

TL;DR
ACCESS is a new benchmark for discovering and reasoning about abstract causal events in everyday life, aiming to improve causal understanding in NLP models, especially in out-of-distribution scenarios.
Contribution
The paper introduces ACCESS, a novel benchmark focusing on abstract causal events, and presents a pipeline for extracting causal pairs from a large commonsense causal knowledge dataset.
Findings
ACCESS highlights challenges in automatic abstraction and causal discovery.
Abstract causal knowledge can improve question-answering reasoning in LLMs.
The benchmark provides 1,400 causal pairs for research and evaluation.
Abstract
Identifying cause-and-effect relationships is critical to understanding real-world dynamics and ultimately causal reasoning. Existing methods for identifying event causality in NLP, including those based on Large Language Models (LLMs), exhibit difficulties in out-of-distribution settings due to the limited scale and heavy reliance on lexical cues within available benchmarks. Modern benchmarks, inspired by probabilistic causal inference, have attempted to construct causal graphs of events as a robust representation of causal knowledge, where \texttt{CRAB} \citep{romanou2023crab} is one such recent benchmark along this line. In this paper, we introduce \texttt{ACCESS}, a benchmark designed for discovery and reasoning over abstract causal events. Unlike existing resources, \texttt{ACCESS} focuses on causality of everyday life events on the abstraction level. We propose a pipeline for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Quality and Management · Semantic Web and Ontologies · Business Process Modeling and Analysis
