Towards a Science of Causal Interpretability in Deep Learning for Software Engineering

David N. Palacio

arXiv:2505.15023·cs.SE·May 22, 2025

Towards a Science of Causal Interpretability in Deep Learning for Software Engineering

David N. Palacio

PDF

Open Access

TL;DR

This paper introduces DoCode, a causal inference-based interpretability method for neural code models in software engineering, enhancing trust by explaining model predictions through programming language properties.

Contribution

It presents DoCode, a novel post hoc interpretability framework using causal inference to explain neural code models, addressing limitations of associational methods in software tasks.

Findings

01

NCMs are sensitive to code syntax changes.

02

NCMs can learn programming concepts with reduced confounding bias.

03

Causal interpretability improves trust in AI for software engineering.

Abstract

This dissertation addresses achieving causal interpretability in Deep Learning for Software Engineering (DL4SE). While Neural Code Models (NCMs) show strong performance in automating software tasks, their lack of transparency in causal relationships between inputs and outputs limits full understanding of their capabilities. To build trust in NCMs, researchers and practitioners must explain code predictions. Associational interpretability, which identifies correlations, is often insufficient for tasks requiring intervention and change analysis. To address this, the dissertation introduces DoCode, a novel post hoc interpretability method for NCMs. DoCode uses causal inference to provide programming language-oriented explanations of model predictions. It follows a four-step pipeline: modeling causal problems using Structural Causal Models (SCMs), identifying the causal estimand, estimating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)

MethodsHigh-Order Consensuses · Causal inference