Causal Contrastive Learning for Counterfactual Regression Over Time
Mouad El Bouchattaoui, Myriam Tami, Benoit Lepetit, Paul-Henry, Courn\`ede

TL;DR
This paper presents a novel RNN-based approach with contrastive predictive coding and InfoMax principles for long-term counterfactual regression, outperforming transformer-based models in efficiency and accuracy.
Contribution
It introduces the first use of Contrastive Predictive Encoding in causal inference, enhancing long-term treatment effect estimation with invertible representations.
Findings
Achieves state-of-the-art counterfactual estimation results
Effectively captures long-term dependencies with CPC and RNNs
Avoids computational costs of transformer models
Abstract
Estimating treatment effects over time holds significance in various domains, including precision medicine, epidemiology, economy, and marketing. This paper introduces a unique approach to counterfactual regression over time, emphasizing long-term predictions. Distinguishing itself from existing models like Causal Transformer, our approach highlights the efficacy of employing RNNs for long-term forecasting, complemented by Contrastive Predictive Coding (CPC) and Information Maximization (InfoMax). Emphasizing efficiency, we avoid the need for computationally expensive transformers. Leveraging CPC, our method captures long-term dependencies in the presence of time-varying confounders. Notably, recent models have disregarded the importance of invertible representation, compromising identification assumptions. To remedy this, we employ the InfoMax principle, maximizing a lower bound of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Fault Detection and Control Systems · Adversarial Robustness in Machine Learning
MethodsAttention Is All You Need · Softmax · Layer Normalization · InfoNCE · Contrastive Predictive Coding · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Label Smoothing · Adam
