Observing the unobserved confounding through its effects: toward randomized trial-like estimates from real-world survival data
Vasiliki Stoumpou, Dimitris Bertsimas, Samuel Singer, Georgios Antonios Margonis

TL;DR
This paper introduces a three-step framework to infer and balance unobserved confounding factors in observational survival data, aiming to produce estimates similar to randomized trials.
Contribution
The authors develop a novel method to infer latent prognostic factors from survival data and demonstrate its effectiveness in reducing confounding bias.
Findings
Balancing the inferred latent factor improved agreement with RCT benchmarks.
The method reduced log-HR errors by approximately ten-fold in strong settings.
Balancing U across centers decreased cross-center survival estimate variability.
Abstract
Background: Randomized controlled trials (RCTs) are costly, time-consuming, and often infeasible, while treatment-effect estimation from observational data is limited by unobserved confounding. Methods: We developed a three-step framework to address unobserved confounding in observational survival data. First, we infer a latent prognostic factor (U) from restricted mean survival time (RMST) discrepancies between patients with similar observed factors, the same treatment, and divergent outcomes, leveraging the idea that the aggregate effect of unmeasured factors can be inferred even if individual factors cannot. Second, we balance U with observed baseline covariates using prognostic matching, entropy balancing, or inverse probability of treatment weighting. Third, we apply multivariable survival analysis to estimate hazard ratios (HRs). We evaluated the framework in three observational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
