Causal Direction from Convergence Time: Faster Training in the True Causal Direction
Abdulrahman Tamim

TL;DR
This paper introduces Causal Computational Asymmetry (CCA), a novel method for causal inference based on the faster convergence of neural networks trained to predict in the causal direction, supported by theoretical and empirical validation.
Contribution
The paper proposes CCA, a new optimization-based causal inference method that distinguishes causal direction by convergence speed, with formal theoretical analysis and strong empirical results.
Findings
CCA correctly identifies causal direction in 26/30 synthetic cases
CCA converges faster in the true causal direction across multiple neural architectures
Theoretical analysis shows residual dependence causes higher irreducible loss in reverse direction
Abstract
We introduce Causal Computational Asymmetry (CCA), a principle for causal direction identification based on optimization dynamics in which one neural network is trained to predict from and another to predict from , and the direction that converges faster is inferred to be causal. Under the additive noise model with and nonlinear and injective, we establish a formal asymmetry: in the reverse direction, residuals remain statistically dependent on the input regardless of approximation quality, inducing a strictly higher irreducible loss floor and non-separable gradient noise in the optimization dynamics, so that the reverse model requires strictly more gradient steps in expectation to reach any fixed loss threshold; consequently, the forward (causal) direction converges in fewer expected optimization steps. CCA operates in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Generative Adversarial Networks and Image Synthesis · Advanced Graph Neural Networks
