Estimating the Causal Effects of Natural Logic Features in Neural NLI Models
Julia Rozanova, Marco Valentino, Andre Freitas

TL;DR
This paper investigates the causal effects of semantic features in neural NLI models by applying causal effect estimation to understand reasoning failures and model robustness in natural language inference tasks.
Contribution
It introduces a causal analysis framework for NLI models focusing on semantic monotonicity and word relations, providing insights into model reasoning and robustness.
Findings
Models show systematic reasoning failures in semantic monotonicity.
Interventions on word pairs significantly affect entailment predictions.
The approach enables comparative profiling of model robustness.
Abstract
Rigorous evaluation of the causal effects of semantic features on language model predictions can be hard to achieve for natural language reasoning problems. However, this is such a desirable form of analysis from both an interpretability and model evaluation perspective, that it is valuable to zone in on specific patterns of reasoning with enough structure and regularity to be able to identify and quantify systematic reasoning failures in widely-used models. In this vein, we pick a portion of the NLI task for which an explicit causal diagram can be systematically constructed: in particular, the case where across two sentences (the premise and hypothesis), two related words/terms occur in a shared context. In this work, we apply causal effect estimation strategies to measure the effect of context interventions (whose effect on the entailment label is mediated by the semantic monotonicity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
