Causality in Requirements Artifacts: Prevalence, Detection, and Impact
Julian Frattini, Jannik Fischbach, Daniel Mendez, Michael, Unterkalmsteiner, Andreas Vogelsang, Krzystof Wnuk

TL;DR
This paper investigates the prevalence and detection of causal relations in natural language requirements, introduces a tool called CiRA for automatic causality detection, and demonstrates its effectiveness and impact on requirements analysis.
Contribution
It provides the first comprehensive analysis of causality in requirements and presents a novel, effective tool for automatic causality detection in natural language requirements.
Findings
Causality appears in about 28% of requirements sentences.
CiRA achieves a macro-F1 score of 82% in causality detection.
Causality correlates positively with various features of requirements.
Abstract
Background: Causal relations in natural language (NL) requirements convey strong, semantic information. Automatically extracting such causal information enables multiple use cases, such as test case generation, but it also requires to reliably detect causal relations in the first place. Currently, this is still a cumbersome task as causality in NL requirements is still barely understood and, thus, barely detectable. Objective: In our empirically informed research, we aim at better understanding the notion of causality and supporting the automatic extraction of causal relations in NL requirements. Method: In a first case study, we investigate 14.983 sentences from 53 requirements documents to understand the extent and form in which causality occurs. Second, we present and evaluate a tool-supported approach, called CiRA, for causality detection. We conclude with a second case study where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
