SCouT: Synthetic Counterfactuals via Spatiotemporal Transformers for Actionable Healthcare
Bhishma Dedhia, Roshini Balasubramanian, Niraj K. Jha

TL;DR
This paper introduces SCouT, a spatiotemporal Transformer-based method for more accurately estimating counterfactual outcomes in healthcare, capturing complex dynamics and providing actionable insights for policy and clinical decisions.
Contribution
It proposes a novel Transformer model with specialized embeddings and training tasks to improve counterfactual estimation over traditional linear methods.
Findings
Effective on synthetic data with small donor pools
Robust against noise in counterfactual estimation
Provides actionable healthcare insights through simulations
Abstract
The Synthetic Control method has pioneered a class of powerful data-driven techniques to estimate the counterfactual reality of a unit from donor units. At its core, the technique involves a linear model fitted on the pre-intervention period that combines donor outcomes to yield the counterfactual. However, linearly combining spatial information at each time instance using time-agnostic weights fails to capture important inter-unit and intra-unit temporal contexts and complex nonlinear dynamics of real data. We instead propose an approach to use local spatiotemporal information before the onset of the intervention as a promising way to estimate the counterfactual sequence. To this end, we suggest a Transformer model that leverages particular positional embeddings, a modified decoder attention mask, and a novel pre-training task to perform spatiotemporal sequence-to-sequence modeling.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health Research Topics · Machine Learning in Healthcare · Health, Environment, Cognitive Aging
MethodsAttention Is All You Need · Linear Layer · Softmax · Dense Connections · Absolute Position Encodings · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Residual Connection · Multi-Head Attention
