On the estimation of inclusion probabilities for weighted analyses of nested case control studies
Tomeu L\'opez-Nieto-Veitch, Rossella De Sabbata, Ryung Kim, Sven Ove Samuelsen, Nathalie C. St{\o}er, Vivian Viallon

TL;DR
This paper evaluates weighted analysis methods in nested case-control studies, identifying scenarios where Kaplan-Meier and GAM weights may introduce bias, and offers a framework for selecting appropriate variables based on causal relationships.
Contribution
It systematically analyzes biases in KM- and GAM-weights in NCC studies and proposes a DAG-based framework for optimal variable inclusion tailored to specific estimands.
Findings
KM-weights can be biased with small case proportions or many matching factors
GAM-weights may be biased if interactions are not modeled
DAG framework guides variable selection based on causal structure
Abstract
Nested case-control (NCC) studies are a widely adopted design in epidemiology to investigate exposure-disease relationships. This paper examines weighted analyses in NCC studies, focusing on two prominent weighting methods: Kaplan-Meier (KM) weights and Generalized Additive Model (GAM) weights. We consider three target estimands: log-hazard ratios, conditional survival, and associations between exposures. While KM- and GAM-weights are generally robust, we identify specific scenarios where they can lead to biased estimates. We demonstrate that KM-weights can lead to biased estimates when a proportion of the originating cohort is effectively ineligible for NCC selection, particularly with small case proportions or numerous matching factors. Instead, GAM-weights can yield biased results if interactions between matching factors influence disease risk and are not adequately incorporated into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Health, Environment, Cognitive Aging · Nutritional Studies and Diet
