On the Mechanistic Interpretability of Neural Networks for Causality in   Bio-statistics

Jean-Baptiste A. Conan

arXiv:2505.00555·stat.AP·May 2, 2025

On the Mechanistic Interpretability of Neural Networks for Causality in Bio-statistics

Jean-Baptiste A. Conan

PDF

1 Repo

TL;DR

This paper explores how mechanistic interpretability techniques can be applied to neural networks used in causal inference within bio-statistics, aiming to enhance understanding, validation, and comparison of models in health-related research.

Contribution

It demonstrates methods to probe neural network representations, visualize computational pathways, and compare mechanisms across models for improved causal analysis in bio-statistics.

Findings

01

MI tools can validate internal representations of NNs.

02

Visualization reveals how NNs process confounders and treatments.

03

Comparison methods highlight differences between statistical, ML, and NN models.

Abstract

Interpretable insights from predictive models remain critical in bio-statistics, particularly when assessing causality, where classical statistical and machine learning methods often provide inherent clarity. While Neural Networks (NNs) offer powerful capabilities for modeling complex biological data, their traditional "black-box" nature presents challenges for validation and trust in high-stakes health applications. Recent advances in Mechanistic Interpretability (MI) aim to decipher the internal computations learned by these networks. This work investigates the application of MI techniques to NNs within the context of causal inference for bio-statistics. We demonstrate that MI tools can be leveraged to: (1) probe and validate the internal representations learned by NNs, such as those estimating nuisance functions in frameworks like Targeted Minimum Loss-based Estimation (TMLE); (2)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jbccc/mech-interp-biostats
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsCausal inference