Towards a Science of Causal Interpretability in Deep Learning for Software Engineering
David N. Palacio

TL;DR
This paper introduces DoCode, a causal inference-based interpretability method for neural code models in software engineering, enhancing trust by explaining model predictions through programming language properties.
Contribution
It presents DoCode, a novel post hoc interpretability framework using causal inference to explain neural code models, addressing limitations of associational methods in software tasks.
Findings
NCMs are sensitive to code syntax changes.
NCMs can learn programming concepts with reduced confounding bias.
Causal interpretability improves trust in AI for software engineering.
Abstract
This dissertation addresses achieving causal interpretability in Deep Learning for Software Engineering (DL4SE). While Neural Code Models (NCMs) show strong performance in automating software tasks, their lack of transparency in causal relationships between inputs and outputs limits full understanding of their capabilities. To build trust in NCMs, researchers and practitioners must explain code predictions. Associational interpretability, which identifies correlations, is often insufficient for tasks requiring intervention and change analysis. To address this, the dissertation introduces DoCode, a novel post hoc interpretability method for NCMs. DoCode uses causal inference to provide programming language-oriented explanations of model predictions. It follows a four-step pipeline: modeling causal problems using Structural Causal Models (SCMs), identifying the causal estimand, estimating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
MethodsHigh-Order Consensuses · Causal inference
