Long-term memory effects of an incremental blood pressure intervention in a mortal cohort
Maria Josefsson, Nina Karalija, Michael J Daniels

TL;DR
This study investigates whether lowering blood pressure over 15 years affects memory, considering both mortality and memory outcomes.
Contribution
The study introduces a Bayesian semi-parametric estimation approach and a novel sparsity-inducing Dirichlet prior for longitudinal data.
Findings
No significant memory effects were found from the blood pressure intervention.
The intervention did not strongly affect memory in the overall population or in the always survivor stratum.
The proposed Bayesian method was validated through simulations and compared to other approaches.
Abstract
In the present study, we examine long-term population-level effects on episodic memory of an intervention over 15 years that reduces systolic blood pressure in individuals with hypertension. A limitation with previous research on the potential risk reduction of such interventions is that they do not properly account for the reduction of mortality rates. Hence, one can only speculate whether the effect is due to changes in memory or changes in mortality. Therefore, we extend previous research by providing both an etiological and a prognostic effect estimate. To do this, we propose a Bayesian semi-parametric estimation approach for an incremental threshold intervention, using the extended G-formula. Additionally, we introduce a novel sparsity-inducing Dirichlet prior for longitudinal data, that exploits the longitudinal structure of the data. We demonstrate the usefulness of our approach…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1| A | B | C | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Bias | RMSE | CP (%) | Bias | RMSE | CP (%) | Bias | RMSE | CP (%) | |
|
| |||||||||
| LBART |
|
| 96.1 | 1.85 | 0.34 | 96.0 | 1.73 | 0.32 |
|
| SBART | 0.89 | 0.41 |
| 1.88 | 0.34 |
|
|
| 95.8 |
| SBART-l1 | 0.98 | 0.44 | 92.1 |
|
| 94.7 | 1.85 | 0.34 | 92.8 |
| HDART | 0.99 | 0.46 | 95.8 | 1.99 | 0.37 | 96.1 | 2.02 | 0.37 | 96.4 |
| HBART | 1.00 | 0.45 | 96.4 | 2.12 | 0.39 | 97.0 | 1.94 | 0.35 | 97.5 |
|
| |||||||||
| LBART |
|
| 96.1 | 2.61 | 0.48 | 95.5 | 2.31 |
| 96.0 |
| SBART | 1.24 | 0.56 |
| 2.58 | 0.47 | 95.2 |
| 0.42 |
|
| SBART-l1 | 1.22 | 0.56 | 94.0 |
|
|
| 2.47 | 0.45 | 92.6 |
| HDART | 1.32 | 0.60 | 96.6 | 2.79 | 0.51 | 96.4 | 2.66 | 0.49 | 96.2 |
| HBART | 1.33 | 0.61 | 96.0 | 2.87 | 0.53 | 96.5 | 2.71 | 0.49 | 97.3 |
|
| |||||||||
| LBART |
|
|
| 4.97 | 0.91 |
| 4.67 | 0.85 | 92.6 |
| SBART | 2.33 | 1.06 | 95.0 | 4.84 | 0.88 | 94.5 |
|
|
|
| SBART-l1 | 2.33 | 1.07 | 93.8 |
|
| 96.0 | 4.65 | 0.85 | 90.2 |
| HDART | 2.38 | 1.09 | 95.4 | 5.21 | 0.94 | 96.3 | 4.85 | 0.89 | 96.4 |
| HBART | 2.37 | 1.08 | 95.3 | 5.27 | 0.96 | 96.3 | 4.87 | 0.89 | 96.6 |
- —NIH10.13039/100000002
- —Swedish Research Council10.13039/501100004359
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Insurance, Mortality, Demography, Risk Management
Introduction
1
Hypertension is a well-established risk factor for cognitive impairment and dementia (Livingston et al., 2020). However, the extent to which antihypertensive treatment can enhance cognitive abilities remains poorly understood, partly due to its long-term nature. Cognitive changes caused by hypertension likely develop over decades rather than years, making randomized clinical trials (RCTs) difficult to conduct (Iadecola et al., 2016). In this study, we extend previous research by examining the long-term population-level effects of an intervention that reduces systolic blood pressure (sBP) in individuals with hypertension. Our data come from the Betula Study, a representative prospective cohort study on aging, memory, and dementia (Nyberg et al., 2020). A key limitation of previous research is the insufficient consideration of altered mortality rates when estimating risk reduction. To address this, our study provides both etiological and prognostic intervention effect estimates. These estimates are critical for public health practice, offering a more comprehensive understanding of the intervention’s impact on cognitive health.
The G-formula (Robins, 1986) is commonly used to estimate causal effects in longitudinal settings where time-varying confounders influence both treatment and outcome. It simulates an intervention and, under a set of assumptions, yields causal effect estimates when these assumptions hold. This approach can provide valuable insights when RCTs are not feasible, and have been used previously for epidemiological research (eg, Zhou et al., 2020; Taubman et al., 2009). For a binary risk factor, this often involve setting the risk factor to a fixed value (Yes/No) (eg, Zhou et al., 2020), or by modifying a subject’s odds of receiving treatment through a hypothetical intervention based on incremental propensity scores (Kennedy, 2019). However, for a continuous risk factor, setting the treatment dose to a fixed value for all subjects is often considered to result in highly unrealistic scenarios with limited practical use. For example, a hypothetical intervention where all subjects in the population fix their sBP levels at 140 mm Hg would not take into account the natural variation that is also seen among non-hypertensive subjects. Another complicating factor for a continuous risk factor arises when only subjects meeting a diagnostic criteria are offered treatment. For instance, only subjects with hypertension would be offered antihypertensive agents to control their blood pressure levels.
Two approaches for hypothetical interventions on a continuous risk factor, which only intervene on subjects that do not reach a prespecified threshold, are the threshold intervention (Taubman et al., 2009) and the representative intervention (eg, Picciotto et al., 2012; Young et al., 2019). The former assigns the threshold value to all individuals who do not meet the prespecified target, while the latter assigns a value drawn from the distribution among those in the population meeting the target level. Hence, both approaches ensure treatment remains within a prespecified interval. Nevertheless, it is common for a significant proportion of the population to be unable to control their sBP at optimal levels, due to factors such as poor response to medications. One alternative approach in these settings is an incremental intervention, where a subject’s treatment is instead shifted downward (or upwards) by some prespecified function (Haneuse and Rotnitzky, 2013; Muñoz and Van Der Laan, 2012). However, these approaches are not valid for longitudinal data with dropout and deaths.
Attrition due to dropout and deaths is common in longitudinal aging studies, especially when subjects are followed for an extended duration. The G-formula can address both missing response and missing covariate data under an assumption of missing at random (MAR; Robins et al., 1995; Young et al., 2019; Kim et al., 2021). Further strategies for sensitivity analyses have been proposed in situations where the missingness is thought to be missing not at random (MNAR; Josefsson and Daniels, 2021; Josefsson et al., 2023. However, studying cognition after death is typically not defined and is not of interest, leading to consideration of a mortal cohort approach (Dufouil et al., 2004). Mortal cohort inferences truncate subjects’ outcomes after death. The first approach, partly conditional inference (PCI), conditions on the sub-population that is still alive at the end of follow-up (Kurland and Heagerty, 2005; Wen et al., 2018; Wen and Seaman, 2018). The second approach, principal stratification, conditions on the subpopulation that would survive through the end of follow-up regardless of treatment assignment, known as the always survivor stratum (Frangakis and Rubin, 2002; Frangakis et al., 2007). Although the two approaches have similarities, they differ with respect to the type of information they generate. PCI has been proposed for estimating non-causal associations and is particularly useful for prognostication, that is when the interest is in the health outcome among all individuals who are alive (eg, Taubman et al., 2009). In contrast, principal stratification is useful in situations where survival differs between treated and control groups and has been applied when the interest is in etiological effect estimates (eg, Hayden et al., 2005; Tchetgen Tchetgen, 2014; Wang et al., 2017).
A drawback of many approaches for estimating the Survivor Average Causal Effect is that they rely on the assumption of monotonicity for valid inference (see, eg, Josefsson and Daniels, 2021; Tan et al., 2021; Shardell and Ferrucci, 2018). This assumption implies that for a beneficial treatment, individuals who survive under treatment would also have survived in the absence of treatment. However, we consider this to be an implausible assumption in most, if not all, real-world settings. This is particularly true for populations with heterogeneous health profiles or competing risks of mortality. To address this limitation, Zaidi et al. (2019) proposed methods for sensitivity analyses that characterize causal effects at the population level while allowing for violations of monotonicity. In the context of clinical trials, Roy et al. (2008) and Lee et al. (2010) developed methods based on the less restrictive stochastic monotonicity assumption. This assumption relaxes standard monotonicity by allowing the probability that survival is higher under treatment than under control, conditional on covariates, to be less than one. However, these approaches are not valid for prospective data with time-varying treatments and time-varying confounding.
Our study extends previous research by providing both etiological and prognostic intervention effect estimates, which are both critical for public health practice, offering a more comprehensive understanding of the intervention’s impact. For estimation, we propose a Bayesian semi-parametric approach based on the extended G-formula (Robins et al., 2004). In addition, we introduce a novel sparsity-inducing Dirichlet prior designed for regression with longitudinal data, and demonstrate the usefulness of our approach in a simulation relative to other Bayesian tree ensemble approaches.
The paper is organized as follows. Section 2 introduces the intervention in a Bayesian setting; Section 3, the two types of mortal cohort inferences; Section 4, computational details and the sparsity-inducing Dirichlet prior; Section 5, a simulation study; Section 6, the Betula application; and Section 7, conclusions.
Methods
2
Consider a prospective cohort study where measurements on a set of variables are collected on each of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} i = 1, 2, \ldots , n\end{document} subjects over a specified follow-up period at \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t=1, 2, \ldots ,T\end{document} fixed time points. At time \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t\end{document} , let \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} X_{it}\end{document} denote the vector of time-varying confounders. We define \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} A_{it}\end{document} as a continuous prognostic variable reflecting actionable treatment recommendations; for simplicity, we refer to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} A_{it}\end{document} as the “treatment” throughout. Denote the continuous outcome measure by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} Y_{it}\end{document} , where interest in this study is the outcome measure at the last time point \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} T\end{document} . Furthermore, let \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} S_{it}=1\end{document} indicate if a subject is alive at time point \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t\end{document} ; and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} S_{it}=0\end{document} otherwise. Let \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} R_{it}=1\end{document} indicate if subject \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} i\end{document} has completed the cognitive testing at time \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t\end{document} ; and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} R_{it}=0\end{document} otherwise. Let the observed history of a time-varying variable be denoted with an \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \leftarrow\end{document} and the future with an \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \rightarrow\end{document} . At each time point \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t\end{document} , we assume data are observed in the temporal ordering \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} S_{it}, R_{it}, X_{it}, A_{it}, Y_{it}\end{document} , and for each subject survival status is observed throughout follow-up, while the remaining variables are observed only during the period the subject actively participates in the study. Lower-case letters denote possible realizations of a random variable. We also write \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} H_{it}\end{document} to denote the observed past history prior to the outcome variable at time point \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t\end{document} . Specifically, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} H_{it}=(\overleftarrow{Y}{it-1}=\overleftarrow{y}{it-1}, \overleftarrow{X}{it}=\overleftarrow{x}{it}, \overleftarrow{R}{it}=\overleftarrow{r}{it}, \overleftarrow{S}{it}=1)\end{document} , with support on the set of possible histories \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathcal {H}{it}\end{document} . Note, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{A}{it}=\overleftarrow{a}{it}\end{document} is not included in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} H_{it}\end{document} . In what follows, we suppress the subscript \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} i\end{document} to simplify notation.
A hypothetical incremental threshold intervention
2.1
For the Betula data, we are interested in the long-term population-level effects of a 15-year intervention that reduces sBP in individuals with hypertension. As such, we consider two contrasting regimes: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_\end{document} , represents a setting where all individuals follow the intervention during the studied period. The natural course, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_0 = \lbrace \overleftarrow{A}{T} = \overleftarrow{a}{T} \rbrace\end{document} , represents a setting in which all individuals follow their natural progression in the absence of an intervention. At each follow-up time point, we consider an intervention strategy in which treatment assignment is determined by a fixed rule based on the so-called the natural value of treatment (NVT), denoted by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} A_{t}^{}\end{document} . Following Young et al. (2014), \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} A_{t}^{}\end{document} is defined as the treatment measure that would have been observed at time \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t\end{document} if the intervention had been discontinued right before \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t\end{document} . In particular, for the Betula data, we consider an intervention in which we intervene only on subjects whose \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} A_{t}^{} > \tau\end{document} . The threshold \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \tau\end{document} is set by the researcher. We consider three cut-offs using values of 130 mmHg, 140 mmHg, and 150 mmHg for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \tau\end{document} .
We propose an intervention that assigns a new treatment value to individuals whose \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} A_{t}^{} > \tau\end{document} . The new value is obtained by shifting \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} A_{t}^{}\end{document} downward by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \delta t\end{document} . To quantify uncertainty in the intervention, we treat the shift \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \delta t\end{document} as an unknown parameter and impose a prior distribution on it. The prior reflects the researcher’s beliefs about the plausible values that treatment can take under the intervention. Specifically, if \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} a_t^{*} \le \tau\end{document} , then the conditional intervention density is \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} p*(a{t} | \overleftarrow{y}{t-1}, a{t}^{}, \overleftarrow{a}{t-1}, \overleftarrow{x}{t}, \overleftarrow{r}{t}, \overleftarrow{s}{t} = 1) = I\lbrace a_t = a_t^{}\rbrace\end{document} . However, if \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} a_t^{} > \tau\end{document} , then \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} p_(a_{t} | \overleftarrow{y}{t-1}, a{t}^{}, \overleftarrow{a}{t-1}, \overleftarrow{x}{t}, \overleftarrow{r}{t}, \overleftarrow{s}{t}=1) = p(\delta _t) I\lbrace a_t = a_t^{} - \delta _t\rbrace\end{document} .
Intervention effects for mortal cohorts
3
In this section, we describe two types of mortal cohort inferences under the proposed intervention: PCI and inference on the the always survivor stratum. Let \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{Y}_T(g)\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{A}_T(g)\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{R}_T(g)\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{S}_T(g)\end{document} , and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{X}_T(g)\end{document} represent the counterfactual outcome, treatment, missingness, survival, and covariate histories associated with the regime \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g\end{document} .
Partly conditional inference
3.1
Partly conditional models have been proposed to address truncation by death under hypothetical interventions. For these models, inference is conditional on the sub-population alive at a specific time-point, in our application being alive at end of follow-up (time \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} T\end{document} ). Hence, inferences focus on what we call a partly conditional intervention effect: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathrm{PCIE} = E[Y_T(g_) \mid S_T(g_) = 1] - E[Y_T(g_0) \mid S_T(g_0) = 1]\end{document} .
Although the PCIE gives a crude comparison of differences in potential outcomes under the two regimes, which is useful for prognostication, it often implicitly compares potential outcomes for different underlying populations. For example, it is likely that an sBP intervention may affect survival rates; as such, the population surviving under the natural course will differ from the one surviving under the intervention.
The identification of the PCIE from longitudinal observational data relies on standard assumptions in causal inference: C1, consistency, C2, conditional exchangeability, and C3, sequential positivity. Additionally, we assume: C4, missing at random conditional on survival (MARS) for the outcome, treatment, and time-varying confounders. For a detailed discussion of these assumptions and their implications, see Supplementary Appendix A. Note that, for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{0}\end{document} , assumption C2 implies that, at each time point, treatment assignment is independent of potential outcomes given the observed past, which we denote as C2a. For \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{*}\end{document} , this assumption states that the NVT at time \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t\end{document} has no direct effect on the outcome (or survival) except through future measurements of the treatment. An implication of this assumption is that the NVT is not itself a time-varying confounder (Richardson and Robins, 2013; Young et al., 2014). We denote this assumption as C2b. For the Betula data we perform sensitivity analyses to investigate robustness of results to violations of assumptions C2, C3, and C4, (see Section 6).
Under the assumptions C1, C2a, C3, and C4, the mean outcome at time \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} T\end{document} under regime \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{0}\end{document} , can be computed using the g-formula (Robins, 1986) conditioning on survival,
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \begin{eqnarray*} & \int _{\overleftarrow{r}_{T}}\int _{\overleftarrow{x}_{T}}\int _{\overleftarrow{a}_{T}}\int _{\overleftarrow{y}_{T-1}} E[Y_T | \overleftarrow{y}_{T-1}, \overleftarrow{a}_{T}, \overleftarrow{x}_T, \overleftarrow{r}_{T}, \overleftarrow{s}_{T}=1] \\& \times \prod _{k=1}^T p(a_k | \overleftarrow{y}_{k-1}, \overleftarrow{a}_{k-1}, \overleftarrow{x}_{k}, \overleftarrow{r}_{k}, \overleftarrow{s}_{k}=1) \times \\& \qquad p(x_k | \overleftarrow{y}_{k-1}, \overleftarrow{a}_{k-1}, \overleftarrow{x}_{k-1}, \overleftarrow{r}_{k}, \overleftarrow{s}_{k}=1) \times \\& \qquad p(r_k | \overleftarrow{y}_{k-1}, \overleftarrow{a}_{k-1}, \overleftarrow{x}_{k-1}, \overleftarrow{r}_{k-1}, \overleftarrow{s}_{k}=1) \times \\& \qquad p(s_k | \overleftarrow{y}_{k-1}, \overleftarrow{a}_{k-1}, \overleftarrow{x}_{k-1}, \overleftarrow{r}_{k-1}, \overleftarrow{s}_{k-1}=1) \times \\& \qquad p(y_{k-1} | \overleftarrow{y}_{k-2}, \overleftarrow{a}_{k-1}, \overleftarrow{x}_{k-1}, \overleftarrow{r}_{k-1}, \overleftarrow{s}_{k-1}=1) \\& \times \, \displaystyle d\overleftarrow{y}_{T-1} d\overleftarrow{a}_T d\overleftarrow{x}_T d\overleftarrow{r}_T. \end{eqnarray*}\end{document}In contrast, for the intervention, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{}\end{document} , under assumptions C1, C2b, C3, and C4, the mean outcome at time \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} T\end{document} can be computed using the Extended g-formula (Robins et al., 2004). The key difference is that \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} p(a_k | \overleftarrow{y}{k-1}, \overleftarrow{a}{k-1}, \overleftarrow{x}{k}, \overleftarrow{r}{k}, \overleftarrow{s}_{k}=1)\end{document} in (1) is replaced by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \int {a{t}^{}}p_{}(a_k | \overleftarrow{y}_{k-1}, a^{}{k}, \overleftarrow{a}{k-1}, \overleftarrow{x}{k}, \overleftarrow{s}{k}=1) \times p(a^{*}k | \overleftarrow{y}{k-1}, \overleftarrow{a}{k-1}, \overleftarrow{x}{k}, \overleftarrow{s}_{k}=1)\end{document} ; details are given in Supplementary Appendix B.
The PCIE is then computed as the differences in expected outcomes under the two regimes \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{0}\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{*}\end{document} . More details are given in Supplementary Appendix C.
Inference on the always survivor stratum
3.2
Conditioning on survival as described in Section 3.1 may introduce bias due to the fact that survival is a post-randomization event (Zhang and Rubin, 2003). Hence, the partly conditional approach in Section 3.1 cannot address etiological research questions in mortal cohorts. An alternative estimand is the causal effect on the subpopulation who would survive through the end of follow-up regardless of treatment regime (Frangakis and Rubin, 2002; Frangakis et al., 2007). Our goal here is to estimate the difference in expected potential outcomes between \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{0}\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{*}\end{document} in the always survivor stratum. That is, what we call a survivors average intervention effect: . Note, previous studies have shown a the link between reduced sBP and reduced all-cause mortality (Bundy et al., 2017). Hence, it is likely that an intervention targeting sBP not only may have an effect on cognition but also on mortality rates. In such situations, the PCIE would compare potential outcomes for different underlying populations. In particular, if subjects live longer under the risk-reducing intervention this population may include less healthy subjects who would die under the natural course, masking a possible intervention effect. The SAIE on the other hand, compare potential outcomes from the same population, namely subjects who would survive both when intervening on sBP and under the natural course. As such, it provides an estimate that adjusts for differences in mortality rates between the two regimes. In our application, we are interested in two types of effect estimates: a prognostic effect, the PCIE, and an etiological effect, the SAIE.
For identification of the SAIE, we need two additional assumptions.
C5 Stochastic Monotonicity: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Pr [S_{T}(g_) = 1 \mid S_{T}(g_0) = 1] \ge \Pr [S_{T}(g_) = 1 \mid S_{T}(g_0) = 0]\end{document} . In the current application, we impose an intervention that improves individuals’ health. As such, we assume the probability of survival under the intervention \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{}\end{document} is higher among those subjects that would survive under the natural course \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{0}\end{document} , compared to those who would not. We denote \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Pr [S_T(g) =1]\end{document} by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \psi ^g\end{document} for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g\end{document} in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \lbrace g_0, g_\rbrace\end{document} , and we assume \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \psi ^{g_} \ge \psi ^{g_0}\end{document} . We further assume \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Pr [S_T(g_) = 1 \mid S_T(g_0) = 1] = \psi ^{g_} + \lambda \left[\min \left\lbrace 1, \frac{\psi ^{g_}}{\psi ^{g_0}}\right\rbrace - \psi ^{g_}\right]\end{document} , where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \lambda\end{document} is a sensitivity parameter (Roy et al., 2008; Lee et al., 2010). Note, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} 0\le \lambda \le 1\end{document} . If \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \lambda =1\end{document} this assumption corresponds to a deterministic monotonicity assumption where subjects who were to be alive under the observed regime \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{0}\end{document} would also be alive under the intervention \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g_{}\end{document} , i.e., \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} S_{T}(g_0) \le S_{T}(g_)\end{document} . In contrast, when \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \lambda = 0\end{document} , this corresponds to assuming independence between \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} S_{T}(g_0)\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} S_{T}(g_)\end{document} ; we consider this assumption unlikely in the present application.
C6 Difference in expectations of outcomes when comparing different strata. \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g \in \lbrace g_, g_0\rbrace\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g^{\prime } \in \lbrace g_, g_0\rbrace\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g \ne g^{\prime }\end{document} , we assume, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} E[Y_T(g) | S_T(g^{\prime }) = S_T(g)= 1] - E[Y_T(g) | S_T(g) = 1, S_T(g^{\prime }) = 0] = \Delta ^g\end{document} , where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Delta ^g\end{document} is a sensitivity parameter. We further assume that \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Delta ^g=\Delta ^{g^{\prime }}=\Delta\end{document} , where the difference in expectations of potential outcomes when comparing different strata is the same for the two contrasting regimes. Note, if \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Delta =0\end{document} there is no difference in potential outcomes when comparing the always survivor strata to the strata where individuals were to live under regime \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g\end{document} but not under the contrasting regime \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g^{\prime }\end{document} . In contrast, if \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Delta >0\end{document} this implies a higher potential outcome is observed for the always survivor stratum. In the current application, we expect higher memory scores (better memory) for the always survivor stratum.
The identification details of the SAIE are given in Supplementary Appendix C. Briefly, under assumptions C1-C6, the SAIE is given by: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathrm{PCIE} + \Delta \left\lbrace \psi ^{g_} + \lambda \left(U - \psi ^{g_} \right) \right\rbrace \left(1 - \frac{1}{U} \right)\end{document} , where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} U = \min \left\lbrace 1, \frac{\psi ^{g_}}{\psi ^{g_0}} \right\rbrace\end{document} . The equation represents a location shift upwards from the PCIE for a beneficial intervention where we expect the participants to improve their health under the intervention compared to the natural course. Note, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \psi ^{g} = E_{(H_T,g)}\lbrace \Pr [S_T=1| H_T,\overleftarrow{A}T=g]\rbrace\end{document} , for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} g\in \lbrace g, g_0\rbrace\end{document} , are computed by marginalizing over the distributions of the set of temporally preceding variables using the g-formula and the extended g-formula.
A semi-parametric approach for estimation
4
In this section, we develop a Bayesian semi-parametric (BSP) modeling approach using soft Bayesian additive regression trees (BART) models to estimate each of the conditional distributions in (1) and (??) in Supplementary Appendix B. The soft BART model improves upon the traditional hard BART model by addressing the lack of smoothness inherent in ensemble approaches. To enhance valid inference, especially in scenarios with multiple time points and a large set of regressors, we extend the current soft BART algorithm by introducing a novel sparsity-inducing prior for longitudinal data. This longitudinal Dirichlet prior promotes parsimony by penalizing groups of predictors less likely to be important in longitudinal prediction settings. We refer to our model as a longitudinal Dirichlet additive regression trees (LDART) approach.
As in traditional (hard) BART models, the conditional mean function is given by: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mu = \mu _{0} + \sum _{b=1}^B g(\mathbf {d};\mathcal {T}_b, \mathcal {M}_b)\end{document} , where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mu _0\end{document} is a global intercept, and the BART function is represented as a sum of regression trees. Here, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathbf {d}=\lbrace d_1, \ldots , d_P\rbrace\end{document} denotes the set of predictors, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathcal {T}_b\end{document} the structure of the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} b\end{document} th tree, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathcal {M}_b\end{document} the corresponding terminal node parameters. Continuous outcomes are typically modeled with a Gaussian distribution, whereas binary outcomes are modeled using a probit link.
The regularization prior, controlling the complexity of the sum-of-trees, differ in two ways between the standard BART algorithm and soft BART. First, for soft BART, the path down the decision tree is probabilistic, where the sharpness of the decision is controlled by the bandwidth parameter. Thereby adapting to the smoothness level of the regression function even for low dimensional tree ensembles, a known shortcoming for (hard) BART models. Second, the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} j\end{document} th predictor, from the set of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} P\end{document} predictors, is chosen according to a probability vector \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathbf {q} = (q_1, \ldots , q_P)\end{document} , typically specified as uniform. In contrast, soft BART models incorporate a Dirichlet distribution for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathbf {q}\end{document} , such that \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathbf {q} \sim \mathcal {D}\left( \alpha /P, \ldots , \alpha /P \right)\end{document} . The sparsity-inducing prior was first introduced for high dimensional settings (Linero, 2018), and can be extended to allow sparsity both within and between groups (Linero and Yang, 2018; Du and Linero, 2019). Determining the degree of sparsity in the model is done by placing a prior on \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \alpha\end{document} , as such, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \frac{\alpha }{\alpha + \rho } \sim Beta\left(a, b \right)\end{document} . Default specifications for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} a, b\end{document} , and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \rho\end{document} is 0.5, 1, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} P\end{document} , respectively. This sparsity inducing prior is implemented for both hard and soft BART. We refer to BART models with this prior as Dirichlet additive regression trees (DART).
A grouping prior for longitudinal data
4.1
In a longitudinal setting, one may want to introduce a larger degree of sparsity for predictors collected at earlier time points, if they are thought to be less strongly associated with the response. Below we introduce a novel sparsity-inducing prior.
We consider a hierarchical structure where the predictors are divided into two groups: predictors measured at the current time point \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t\end{document} and predictors measured at past time points \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} k=1,\ldots ,t-1\end{document} . Moreover, baseline predictors, eg, age and sex, are grouped with the current predictors into one group, since we want to impose the same degree of sparsity on these predictors. Let \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} w\end{document} refer to the probability of a predictor being from the set of current predictors, versus from the set of past predictors; we specify a beta prior, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} w \sim Beta\left(a, b \right)\end{document} . For the current predictors, let \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} v^t_{j}\end{document} refer to the probability of selecting the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} j\end{document} th predictor from the set of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} P_t\end{document} current predictors \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} {\bf v}^t = (v^t_{1},\ldots ,v^t_{P_t}\end{document} ); \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} {\bf v}^t\end{document} is given a Dirichlet prior \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} {\bf v}^t \sim \mathcal {D}\left( \eta /P_t, \ldots , \eta /P_t \right)\end{document} . So a priori, the probability of selecting the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} j\end{document} th current predictor to construct a given split is equal to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} w \times v^t_{j}\end{document} . The past predictors are divided into \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t-1\end{document} groups, where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} P_{k}\end{document} is the size of group associated with time \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} k\end{document} . Let \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} u^k\end{document} denote the probability of selecting the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} k\end{document} th set of time-varying predictors, and let \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} v^k_{j}\end{document} denote the probability of selecting the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} j\end{document} th predictor in the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} k\end{document} th set of time-varying predictors. To introduce sparsity related to proximity in time to the response we give \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} {\bf u}\end{document} a grouping Dirichlet prior \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} {\bf u} \sim \mathcal {D}\left( \alpha _1/t-1, \ldots , \alpha _{t-1}/t-1 \right)\end{document} , where we determine the degree of sparsity in the model by placing a prior on \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \alpha k\end{document} such that a larger degree of sparsity is obtained for smaller \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} k\end{document} ’s (see details below). Similarly \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} {\bf v}^k\end{document} is also given Dirichlet prior \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} {\bf v}^k \sim \mathcal {D}\left( \varphi ^k/P_k, \ldots , \varphi ^k/P_k \right)\end{document} . So a priori, the probability of selecting the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} j\end{document} th predictor from the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} k\end{document} th time point to construct a given split is equal to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} (1-w) \times u^k \times v^k{j}\end{document} .
For the hyperparameters \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \eta\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \varphi ^k\end{document} we use the default Beta hyperprior as implemented in DART. For \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \alpha _j\end{document} we specify a Beta hyperprior \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \frac{\alpha _j}{\alpha _j + \rho _j} \sim Beta\left(c_j, d_j \right)\end{document} ; we let \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} c_j\end{document} vary as a linear function of time, set \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} d_j=1\end{document} , and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \rho _j=t\end{document} for all \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} j\end{document} . To introduce sparsity related to proximity in time to the response we set \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} c_j = 1-\frac{(t - j)}{t-1} \times 0.5\end{document} . This results in ordered hyperpriors for the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \alpha j\end{document} s, where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} c_1=0.5\end{document} for predictors measured at the first time point and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} 0.5{<}c{j-1}{<}c_j{<} 1\end{document} for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} j{>}1\end{document} . That is, a higher degree of sparsity is introduced for predictors measured earlier in time and less sparsity for predictors measured closer in time to the response. The simulations in Section 5 shows the strong performance of these prior choices. Details about the posterior computations of the L-Dirichlet (LDART) prior are obtained in the Supplementary Appendix D.
The PCIE and SAIE use the algorithm described in Supplementary Appendix D, implemented in the R package GcompBART (Josefsson, 2025).
Simulation study
5
In this simulation study, we evaluate the performance of LDART and compare it to other BART specifications.
Data were generated according to a modified version of the Friedman’s five-dimensional test function (Friedman, 1991), mirroring a longitudinal study with five time-varying predictors and a continuous outcome. Specifically, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} f(\overleftarrow{\mathbf {x}}{i}) = \sum {t=1}^T c_t \left[ 10 \sin (\pi x{1,it} x{2,it}) + 20 (x_{3,it} - 0.5)^2 + 10 x_{4,it} + 5 x_{5,it} \right],\end{document} , where
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} Y_{iT} \sim \mathrm{Normal}\left\lbrace f(\overleftarrow{\mathbf {X}}_{i}), \sigma _T\right\rbrace\end{document} , with \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} T=4\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \sigma _T=10\end{document} , and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} c_t\end{document} determines the time-specific association between predictors and outcome.
The longitudinal predictors, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{\mathbf {X}}{i} = \lbrace \mathbf {X}{i1}, \ldots , \mathbf {X}{iT}\rbrace\end{document} , with \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathbf {X}{it} = (x_{1,it}, \ldots , x_{5,it})^{\prime }\end{document} , were generated as correlated uniform random variables. Specifically, for each individual \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} i\end{document} and predictor \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} p = 1, \ldots , 5\end{document} , the sequence \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} (x_{p,i1}, \ldots , x_{p,iT})\end{document} was generated to be correlated over time according to a multivariate normal distribution with mean zero and covariance matrix \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Sigma\end{document} . Here, the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} (t,k)\end{document} element of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Sigma\end{document} is given by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} d_{tk}^2\end{document} . We specified \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} d_{t,t-1} = 0.4\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} d_{t,t-2} = 0.2\end{document} , and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} d_{t,t-3} = 0.1\end{document} , reflecting stronger correlations for measurements closer in time. The resulting normal variables were then transformed to uniform margins using the standard normal cumulative distribution function, such that \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} x_{p,it} \sim \mathrm{Uniform}(0,1)\end{document} . Correlations were introduced only across time within each predictor, and all between-predictor correlations ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} x_{p,it}\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} x_{p^{\prime },it^{\prime }}\end{document} for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} p \ne p^{\prime }\end{document} ) were set to zero.
We considered missingness in the form of dropout and assumed that approximately 15% of participants dropped out at each follow-up time point \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t = 2, \ldots , T\end{document} . Dropout at time \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} j\end{document} was generated as \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \pi j(\overleftarrow{\mathbf {x}}{i}) = \mathrm{expit}!\left[ h_j \sum {t=1}^{j-1} \left(x{1,it} + x_{2,it} + x_{3,it} + x_{4,it} + x_{5,it}\right) \right]\end{document} , where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \mathrm{expit}(x) = 1 / (1 + \exp (-x))\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} R_{ij} \sim \mathrm{Binomial}\lbrace \pi j(\overleftarrow{\mathbf {x}}{i})\rbrace\end{document} , and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{h} = (0.75, 0.4, 0.25)\end{document} .
We simulated 2000 datasets of size \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} n = \left(250, 1000, 2000\right)\end{document} , and three different scenarios by varying \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{c}=\left(c_T,\ldots ,c_1\right)\end{document} . Scenario A: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{c}=(1,0.75,0.5,0.25)\end{document} , reflecting a decreasing association over time between the predictors and the outcome; Scenario B: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{c}=(1,0,0,0)\end{document} , where only predictors at the closest time point are associated with the outcome; Scenario C: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \overleftarrow{c}=(0.25,0.25,0.25,0.25)\end{document} , where a stable association with the outcome for all time points. For all scenarios we estimated \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} E_{(H_T, g_0)}\lbrace E[Y_T(g_0) \mid H_T, g_0]\rbrace\end{document} using equation (1).
We compared the performance of our proposed approach (LDART) with two hard BART models: the standard BART algorithm (HBART) and BART with the sparsity-inducing Dirichlet prior (HDART) (as implemented in the R package BART; Sparapani et al., 2021), as well as two soft BART models: the standard SoftBart algorithm (SDART; as implemented in the R package SoftBart; Linero, 2022) and SoftBart incorporating only the predictors from the last time point in the model (SDART-l1). We specified 50 regression trees for each model. To evaluate their performance, we compared relative absolute bias, root-mean-squared error (RMSE), and 95% coverage probabilities (CPs). Unless stated otherwise, models used default priors, with 1200 warm-up and 2000 sampling iterations (thinning = 4) across 48 chains. Simulations were run in R 4.1 at the High Performance Computing Center North.
Simulation results
5.1
The simulation results are presented in detail in Table 1. For all scenarios and models, the absolute relative biases and RMSEs are smaller for the larger sample sizes, as expected. Moreover, the coverage probabilities are close to the nominal values for most scenarios, with CPs (%): 90.2–97.5. The soft BART models generally exhibited the smallest biases and RMSEs across the three scenarios. The best-performing model varied by scenario: in Scenario A, which reflects a decreasing association over time, LDART performed best; in Scenario B, where only predictors at the closest time point are associated with the outcome, SBART-l1 was the winner; and in Scenario C, characterized by a weak and stable association across all time points, SBART performed best. These results highlight the influence of the underlying data-generating mechanisms. Altough LDART was not the winner in Scenarios B and C, it was highly competitive, and that SBART-l1 achieved markedly smaller CPs in Scenarios A and C. The two hard BART models produces biases and RMSEs that are notably larger than the other models. In addition, they produce overly conservative credible intervals in many scenarios, with CPs close to 97.0%.
Table 1: Simulation results for estimating \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \end{document} using (1) and the simulated data as described in Section 5 with 50 regression trees. For scenario A: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \end{document}, B: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \end{document}, and C: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \end{document}. The results are presented in terms of absolute relative bias \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \end{document}, root-mean-squared-error (RMSE) and 95% coverage probabilities as percentages [cp (%)]. The best-performing model for each scenario is highlighted.
Analysis of the betula data
6
In the Betula study, we analyzed the long-term population-level effects of a 15-year intervention reducing sBP in individuals with hypertension, using the incremental threshold approach described in Sections 2–3. The analysis includes 800 participants from eight age cohorts (35–70 years at baseline) followed across four waves at five-year intervals.
Episodic memory was assessed using a composite score ranging from 0 to 76 (Josefsson et al., 2012). Baseline confounders were age, sex, and education, while time-varying covariates included body mass index, smoking, serum cholesterol, self-reported type 2 diabetes, and prior memory performance. Over the 15-year follow-up, 38% of participants dropped out, including 18% who either died or developed dementia. Dementia and death were treated jointly as a single terminating event, and results therefore reflect intervention effects among individuals who remained alive and cognitively healthy. See Table A1 for baseline summary statistics.
We consider an intervention where we assign sBP using the incremental threshold intervention described in Sections 2 and 3. The intervention loosely imply that, at each follow-up time point, sBP levels is shifted downwards by a prespecified value in individuals with hypertension, while no intervention is imposed on individuals without hypertension. See Fig. A1 for an illustration of the shift intervention using the Betula data. Specifically, we consider three cut-offs using values of 130 mmHg, 140 mmHg, and 150 mmHg for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \tau\end{document} . If a subject’s NVT at times \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} t=2,\ldots ,T\end{document} is greater than \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \tau\end{document} , then sBP is assigned by imposing a incremental shift on the NVT, and otherwise we do not intervene.
Priors for the shift and sensitivity parameters
6.1
Priors for the shift and sensitivity parameters are specified by the analyst and should reflect plausible values. Here, we specify a triangular prior for the shift parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \delta t\end{document} , such that \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \delta t \sim \mathrm{Triangular}(L{\tau }, 0, L{\tau })\end{document} . The lower bound and mode are given by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} L_{\tau }\approx -44\end{document} mm Hg, corresponding to two standard deviations of baseline sBP, and the upper bound is 0. The intervention implies a negative shift from the NVT with between 0 and 44 mm Hg for individuals with a \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} NVT>140\end{document} at each follow-up time point.
To estimate the SAIE, we also specify priors on the sensitivity parameters in Assumptions C5 and C6. We specify triangular priors for the sensitivity parameters \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Delta\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \lambda\end{document} . For \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Delta\end{document} , we assume \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Delta \sim \mathrm{Triangular}(0, U_{\Delta }, U_{\Delta }),\end{document} with upper bound (and mode) set to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} 4.6%\end{document} ; \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} 4.6%\end{document} corresponds to the estimated difference in baseline memory between those who are alive at time point \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} T\end{document} and those who are dead, adjusting for age and sex. It is unlikely the difference between the strata would be bigger than this. We further assume \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \lambda \sim \mathrm{Triangular}(0.5, 1, 1)\end{document} , where the upper bound corresponds to the deterministic monotonicity assumption while the lower bound reflects a moderate correlation between survival under the two regimes.
Results
6.2
We estimated the PCIE and the SAIE using LDART with priors on the sensitivity parameters as described above. For each of the 16 chains, we used a thinning of 100, discarded the first 300 iterations as burn-in, and kept 2400 posterior samples for inference. Note, with a thinning of 100, we ensured precision in terms of close to independent sampling for all age-cohorts. However, a much smaller number (10–20) would have been sufficient in most cases. We sampled pseudo data of size \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} N^{*}=10,000\end{document} at each iteration for the MC integration. Convergence of the posterior samples was monitored using trace plots. Note, the memory outcome, a test score, is presented in %.
Results for the different age-cohorts using 130 mmHg as cut-off are shown in Figure 1. We observe a consistent attenuation in effects as the threshold increases from 130 mmHg to 150 mmHg (see Supplementary Figure A2). In summary, posterior mean memory score at end of follow-up was attenuated with advancing age. This occurred both under the natural course and under the intervention, although to a lesser extent for the latter. Among subjects 70 and older, the shift in memory from the intervention corresponds to postponing memory decline approximately 2 years on average; however this effect is not significant. The results for the PCIE suggest that there is no long-term cognitive effects of monitoring sBP in this population (credible intervals cover 0 for all age-cohorts). We note however, that the effect is notably stronger for the older cohorts compared to middle-aged cohorts. The PCIE and SAIE estimates were similar. For the SAIE, this suggests no long-term effects within the always-survivor stratum, comprising individuals who would survive and remain free of dementia under both the natural course and the intervention. Hence, the non-significant results for the PCIE are not driven by differential mortality rates. We also note that only slightly higher survival probabilities were obtained under the intervention across age cohorts compared to the natural course, leading to rather small differences between the PCIE and the SAIE.
Results from the main analysis of Betula data using 130 mmHg as cut-off and default assumptions and sensitivity parameters as described in Section 6. Panel (A) Age-specific posterior mean memory at end of follow-up under the natural course (plain line) and under the hypothetical intervention (dotted line). Panel (B) Results for the PCIE and the SAIE. Panel (C) Posterior mean survival probabilities at end of follow-up under the natural course and the hypothetical intervention. All plots include 95% pointwise credible intervals.
We performed sensitivity analyses to assess robustness to violations of assumptions C2–C6 (Supplementary Appendix E; Table A2). For clarity, analyses were stratified into middle-aged ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} age_{iT}< 65\end{document} ) and older ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} age_{iT}\ge 65\end{document} ) groups. Results were largely insensitive to alternative prior specifications for \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \lambda\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} \Delta\end{document} (data not shown) and remained consistent when accounting for an unmeasured time-varying confounder. Excluding 61 subjects with limited propensity overlap slightly attenuated effects but did not alter conclusions. Assuming missing-not-at-random (MNARS) outcomes reduced posterior mean memory scores under both regimes, yet PCIE estimates remained stable. Results were similar when fitting alternative soft BART variants (S-DART, S-DART-l1) instead of LDART.
Conclusions
7
Using data from a prospective cohort study on aging, cognition, and dementia, we investigated population effects on episodic memory of an hypothetical incremental threshold intervention over 15 years that reduced sBP in individuals with hypertension. In our study, there were no significant prognostic or etiological effects, although effects were markedly stronger in older individuals. Nevertheless, these findings provide important insights into long-term blood pressure control and cognitive aging, suggesting that sBP interventions are unlikely to have a strong effect on memory, whether at the overall population level or among individuals who would remain alive and dementia free under both the natural course and the intervention. Importantly, the methodology applied in this study has broad applicability to interventional research in contexts where mortality is a major concern, including conditions as diverse as cancer, COVID-19, and congestive heart failure.
We propose a Bayesian semi-parametric approach for the proposed incremental threshold intervention, extending prior research in several ways. First, our approach introduces an incremental intervention for prospective cohort data with attrition, offering a viable alternative when clinical trials are not available or feasible to conduct. Second, we offer two types of mortal cohort inferences, providing both prognostic and etiological effect estimates, both of which are critical for public health practice, especially in older populations. Last, our modeling approach employs soft BART models with a novel sparsity-inducing prior for longitudinal data, promoting parsimony in longitudinal prediction settings and demonstrating good performance in simulations.
Concerns have been raised about principal stratification methods relying on assumptions that are often hard to justify in practice (eg, Feller et al., 2017). Monotonicity is one such assumption. Our approach relies on the less strong stochastic monotonicity assumption and uses sensitivity parameters that allow researcher to investigate sensitivity of results to this unverifiable assumption. However, a limitation with proposed principal stratification approach is that we only included age as a survival-predictive covariate for partial identification of the SAIE. In future work we will explore the possibility of incorporating more baseline covariates (see for example Roy et al., 2008; Tchetgen Tchetgen, 2014; Wang et al., 2017).
Supplementary Material
ujaf176_Supplemental_FilesWeb Appendices, Supplementary results, and R code referenced in Sections 3, 4, and 6, are available with this paper at the Biometrics website on Oxford Academic. Web Appendices A–E provide details on assumptions, the extended G-formula, SAIE identification, computation (Dirichlet prior, G-computation), and supplementary Betula results. The R package GcompBART is available on https://github.com/m4ryjo/GcompBART (Josefsson, 2025).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bundy J. D., Li C., Stuchlik P., Bu X., Kelly T. N., Mills K. T. et al. (2017). Systolic blood pressure reduction and risk of cardiovascular disease and mortality: a systematic review and network meta-analysis. JAMA Cardiology, 2, 775–781.28564682 10.1001/jamacardio.2017.1421 PMC 5710614 · doi ↗ · pubmed ↗
- 2Du J., Linero A. (2019). Incorporating grouping information into Bayesian decision tree ensembles. In Proceedings of the 36th International Conference on Machine Learning, pp. 1686–1695., Long Beach, California, USA: PMLR.
- 3Dufouil C., Brayne C., Clayton D. (2004). Analysis of longitudinal studies with death and drop-out: a case study. Statistics in Medicine, 23, 2215–2226.15236426 10.1002/sim.1821 · doi ↗ · pubmed ↗
- 4Feller A., Mealli F., Miratrix L. (2017). Principal score methods: Assumptions, extensions, and practical considerations. Journal of Educational and Behavioral Statistics, 42, 726–758.
- 5Frangakis C. E., Rubin D. B. (2002). Principal stratification in causal inference. Biometrics, 58, 21–29.11890317 10.1111/j.0006-341x.2002.00021.x PMC 4137767 · doi ↗ · pubmed ↗
- 6Frangakis C. E., Rubin D. B., An M.-W., Mac Kenzie E. (2007). Principal stratification designs to estimate input data missing due to death. Biometrics, 63, 641–649.17824995 10.1111/j.1541-0420.2007.00847_1.x · doi ↗ · pubmed ↗
- 7Friedman J. H. (1991). Multivariate adaptive regression splines. The Annals of Statistics, 19, 1–67.
- 8Haneuse S., Rotnitzky A. (2013). Estimation of the effect of interventions that modify the received treatment. Statistics in Medicine, 32, 5260–5277.23913589 10.1002/sim.5907 · doi ↗ · pubmed ↗
