Propensity Process: a Balancing Functional
Pallavi S. Mishra-Kalyani, Brent A. Johnson, Qi Long

TL;DR
This paper introduces the propensity process, a novel extension of the propensity score that balances entire time-varying covariate histories in observational studies with irregular treatment timings.
Contribution
The paper proposes the propensity process, a new method that balances complex covariate histories and enhances causal inference in observational data with irregular treatment timing.
Findings
Propensity process balances entire covariate history.
Treatment assignment is strongly ignorable given the propensity process.
Method demonstrated using ALS Registry data.
Abstract
In observational clinic registries, time to treatment is often of interest, but treatment can be given at any time during follow-up and there is no structure or intervention to ensure regular clinic visits for data collection. To address these challenges, we introduce the time-dependent propensity process as a generalization of the propensity score. We show that the propensity process balances the entire time-varying covariate history which cannot be achieved by existing propensity score methods and that treatment assignment is strongly ignorable conditional on the propensity process. We develop methods for estimating the propensity process using observed data and for matching based on the propensity process. We illustrate the propensity process method using the Emory Amyotrophic Lateral Sclerosis (ALS) Registry data.
| Prior to | Propensity | Generalized | Propensity | |
|---|---|---|---|---|
| Covariate | Matching | Function | Propensity Score | Process |
| Body mass index | 0.277 | 0.245 | 0.986 | 0.991 |
| Forced vital capacity | 0.764 | 0.539 | 0.201 | 0.317 |
| Negative inspiratory force | 0.151 | 0.022 | 0.016 | 0.704 |
| Age | 0.162 | 0.718 | 0.378 | 0.195 |
| Sex | 0.577 | 0.695 | 0.002 | 0.706 |
| Site | 0.001 | 0.003 | 1.000 | 0.341 |
| Time from diagnosis | 0.676 | 0.633 | 0.033 | 0.854 |
| Median Difference | P-value | |
|---|---|---|
| Naïve | 0.035 | 0.673 |
| Propensity Function | 0.030 | 0.466 |
| Generalized Propensity Score | 0.360 | 0.453 |
| Propensity Process | 0.830 | 0.022 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Inference · Economic Policies and Impacts
Propensity Process: a Balancing Functional
Pallavi S. Mishra-Kalyani
Department of Biostatistics and Bioinformatics
Emory University
Brent A. Johnson
Department of Biostatistics and Computational Biology
University of Rochester
and
Qi Long11footnotemark: 1
Department of Biostatistics, Epidemiology, and Informatics
University of Pennsylvania
Abstract
In observational clinic registries, time to treatment is often of interest, but treatment can be given at any time during follow-up and there is no structure or intervention to ensure regular clinic visits for data collection. To address these challenges, we introduce the time-dependent propensity process as a generalization of the propensity score. We show that the propensity process balances the entire time-varying covariate history which cannot be achieved by existing propensity score methods and that treatment assignment is strongly ignorable conditional on the propensity process. We develop methods for estimating the propensity process using observed data and for matching based on the propensity process. We illustrate the propensity process method using the Emory Amyotrophic Lateral Sclerosis (ALS) Registry data.
Keywords: Balancing Score; Generalized Propensity Score; Propensity Process; Propensity Score; Observational Registry; Time-Varying Covariates
1 Introduction
Amyotrophic lateral sclerosis (ALS) is a rare progressive neurological disorder resulting in the degeneration of both upper motor neurons of the cerebral cortex and lower motor neurons of the spinal cord and peripheral nervous system, with a very poor prognosis. Currently, there is no cure for ALS and clinical care is generally limited to treating secondary infections and palliative care, such as surgically inserting a percutaneous endogastrostomy (PEG) tube to provide enteral nutrition for individuals having difficulty swallowing (Procaccini and Nemergut, 2008). Our objective is to assess the effect of inserting a PEG feeding tube on preventing weight loss. PEG insertion is an individual decision and one that must be made while the individual is strong enough to proceed with surgery. Hence, a randomized controlled trial to study the effect of PEG would be implausible. We develop new methods to evaluate PEG using data from the Emory ALS Clinic registry.
Let denote the continuously-defined time of PEG insertion for a randomly selected patient from the population. The observed outcome is collected at or just after a fixed point at time , which consequently restricts the time of PEG insertion. If subjects were randomly assigned to receive PEG prior to and randomly assigned to treatment times, then both treatment effect and dose-response curve could be estimated using standard methods. However, treatment assignment depends on patient characteristics and confounds the effect of treatment on outcome. To remove confounding associated with covariate imbalance among treatment levels, we rely on the general concept of the propensity score (Rosenbaum and Rubin, 1983; Rubin and Thomas, 1996).
When treatment assignment is binary, the propensity score (Rosenbaum and Rubin, 1983) is defined as probability of receiving a treatment given a set of observed variables. Generalizations of the propensity score as a balancing score have been investigated in various settings (Hirano and Imbens, 2004; Imai and Van Dyk, 2004; Hansen, 2008; Allen and Satten, 2011; Hu et al., 2014). For continuously-defined treatment levels, Hirano and Imbens (2004) proposed a direct translation of the propensity score by replacing the conditional probability mass function with the conditional density function of treatment assignment given covariates, known as a generalized propensity score (GPS); while this approach leads to as many propensity scores as there are levels of the treatment it uses only one single score at a time. Although Imai and Van Dyk (2004) similarly found that the conditional density function of treatment assignment given covariates could serve as a propensity score, they noted potential limitations of this approach and suggested instead using the linear predictor in regression models or other summary statistic that are of finite dimension. When treatment assignment occurs over time as in the case where an individual chooses to receive PEG insertion or not, we must allow for the possibility of time-dependent confounding. To this end, let denote a set of -dimensional time-dependent covariates at time and denote the history of covariates up to time . Then, the probability of treatment assignment at time given the covariate history up to time is
[TABLE]
where and the hazard function is
[TABLE]
Because uniquely parameterises , either model (1) or model (2) may be regarded as a legitimate treatment assignment model for continuous treatment with time-independent or time-dependent confounding (Li et al., 2001; Lu, 2005).
Of note, is a function of the entire covariate history , whereas the hazard function is a function of only. This subtle, yet important difference can lead to difficulties when extending methods proposed by Imai and Van Dyk (2004) and Hirano and Imbens (2004) to time-dependent confounding via standard hazard modeling. In addition, both Li et al. (2001) and Lu (2005) used the hazard function as a GPS for matching which allows for balancing at the time of treatment in a matched set. However, they did not establish the strong ignorability of treatment assignment given their time-dependent GPS; this property does not hold if is associated with rather than just , in which case their proposed procedures may not lead to valid causal inference. Additionally, their proposed methods are only applicable to studies with data routinely collected at regular intervals, which is often not true in clinical registries.
We propose the propensity process to correct for confounding in observational studies by balancing the covariate history . After the propensity process is estimated, bias-corrected data analyses can be achieved through matching or stratification (Rosenbaum and Rubin, 1983). Establishing formally the theoretical properties of the propensity process for time-independent confounding requires different arguments than those presented in Imai and Van Dyk (2004).
2 Methods
2.1 Notation and Assumptions
Our framework is constructed through potential outcomes (Rubin, 2005). For , we define as the treatment time restricted to time and as the treatment time restricted to time , where denotes the minimum of and . Let define the set of potential treatment times restricted to , , where means that a patient did not receive PEG treatment before . Let be the potential outcome if a subject received PEG treatment at time , , and the potential outcome if a subject did not receive PEG treatment in the interval . It follows that denotes the potential outcome if a subject did not receive PEG treatment in the interval . We also define the treatment-free potential covariate process . Then, the set of potential outcomes and treatment-free potential covariate process for a randomly selected subject from the population is when treatment time is restricted at , . In contrast, the observed data are , where the observed outcome , and the observed covariate history .
Given , we define the propensity process as the sample path of the hazard function from baseline to time , i.e.,
[TABLE]
noting that is dependent on . As is observable only up to , is estimable only up to . While this concept seems similar to the propensity function (Imai and Van Dyk, 2004), the distinguishing factor of the propensity process is that depends on and is of infinite dimension and cannot be fully estimated for subjects receiving PEG before , whereas the propensity function in Imai and Van Dyk (2004) only allows for incorporation of time-independent covariates and can be estimated for all subjects.
In our framework, we make two assumptions.
Assumption 1** (Stable unit treatment value assumption)**
The distributions of potential outcomes for different subjects are independent of one another.
Assumption 2** (Strong Ignorability)**
For every , and for all , , and .
Assumption 1 is a common assumption in causal inference. However, our Assumption 2 is defined for each time point and differs from the standard strong ignorability of treatment assignment assumption used in earlier work for balancing scores. One implication of Assumption 2 is that, conditional on the treatment-free history , receiving treatment at or not is independent of the set of potential outcomes, allowing us to model treatment assignment without conditioning on potential outcomes.
2.2 Main results
We establish the large-sample results of the propensity process assuming that the true propensity process is known along the lines of Rosenbaum and Rubin (1983) and Imai and Van Dyk (2004).
Proposition 1
* is conditionally independent of treatment-free covariate history given , where and are the entire treatment-free covariate history and propensity process, respectively.*
Proposition 1 establishes as a balancing functional that balances the entire covariate history. Proposition 1 requires that is known or can be estimated in the entire domain . In practice, however, we can only observe the covariate process and hence estimate . Proposition 2 establishes the balancing property for every given time point in .
Proposition 2
For every , is conditionally independent of treatment-free covariate history given , where and are the treatment-free covariate history and propensity process through time , respectively.
When in Proposition 2, we have that is independent of treatment-free covariate history given , where is observable and hence is estimable.
Theorem 1
For every , for all , , and .
When in Theorem 1, we have that is independent of potential outcomes given , where is estimable. Several remarks are in order. First, in § 3.1 we suggest modeling the hazard function in (2) through the proportional hazards model (4); one could also use other model formulations for (2) and the results in Propositions 1–2 and Theorem 1 would still apply. Second, Proposition 2 and Theorem 1 provide justifications for matching a subject treated at with an eligible control subject untreated at based on the propensity process up to . It follows that each matched pair would have the same distribution for the covariate process up to and their potential outcomes are independent of their treatment assignments, allowing for valid causal inference. Third, our Proposition 2 is similar in spirit to Proposition 1 in Lu (2005) but is more general in the sense that the propensity process balances the entire covariate history up to not just the covariates measured at . In addition, Lu (2005) did not establish the strong ignorability of treatment assignment given propensity scores similar to our Theorem 1. Proofs for Propositions 1–2 and Theorem 1 are given in the Appendix.
3 Implementation and Practical Considerations
3.1 Interpolated Propensity Processes
In practice, the propensity process must be estimated from the observed data. The challenge for estimating the propensity process is that we may not observe the complete treatment-free covariate process on ; rather, we only get to observe the covariate process at a coarse set of discrete time points as is the case in the motivating ALS study. Here, we propose to borrow strength across subjects in the study sample by modeling each time-dependent covariate as a random curve over time via nonlinear mixed effects models. This allows a predictive curve to be estimated for the entire treatment-free covariate process for each subject.
First, suppose we parameterize the hazard function in (2) through Cox’s proportional hazards model and define the propensity process through the linear predictor,
[TABLE]
where is the unspecified baseline hazard function. Next, write the observed treatment-free covariate history for the -th subject and -th covariate as , with time-dependent covariate measured at time . We note that the observation times may be different for each subject but are assumed to be the same for all covariates within a subject. Then, for each time-dependent covariate, we fit the model,
[TABLE]
where are independent, mean-zero random errors. To provide greater flexibility in modeling the covariate process over time, we use spline-type models (Ruppert et al., 2003) in (5) where denotes a set of basis functions and and are regression coefficients corresponding to the basis functions for the fixed and random effects, respectively. The interpolated treatment-free can be obtained from model (5) by replacing regression coefficients and with their estimates and , respectively. Then the estimated propensity process can be obtained from (4) by plugging in the interpolated and , where is the estimated regression coefficient vector in the Cox proportional hazards model.
3.2 Matching
The use of matched analyses based on propensity scores for testing causal null hypotheses has been advocated by several other authors; for example, see Rosenbaum and Rubin (1983), Li et al. (2001) and Lu (2005) and references therein. Matching can be performed by minimizing the integrated squared error between the estimated propensity process of a subject who received PEG treatment at time and that of each eligible control with . To accomplish this task, we implement a sequential matching algorithm. We start by ordering chronologically subjects according to their time of PEG treatment or censoring, namely . Set the matched pair counter to and select the subject with the smallest time to PEG treatment, say subject . Define the integrated squared difference in interpolated propensity processes between and as for all subjects in the set of eligible controls . The matched control for is the nearest neighbor in interpolated propensity processes among eligible controls, i.e., . Increment the matched pair counter by one to and select the subject with the smallest time to PEG treatment, say , excluding the two subjects in the first matched pair. Therefore, the set of eligible controls, say , contains subjects: all subjects less the two subjects in the first matched pair and . The matched control for is the nearest neighbor in interpolated propensity processes among the set of eligible controls, . Increment the matched pair counter by one and continue until all treated individuals are matched or until there are no suitable controls available for matching.
4 Analysis of the ALS Registry Data
Using a data set from the Emory ALS Registry, we assess the association of PEG treatment with the change in body mass index (BMI) from baseline to 18 months, i.e., = 18 months. The data set includes 240 patients who survived past and had at least one clinic visit between baseline and . The patients who received PEG did so after their first clinic visit. The timing of recommending PEG by the physician involved many factors and the final decision to have PEG was made by each patient. We model treatment assignment through the proportional hazards model (4) including the following covariates. The baseline risk factors are age at diagnosis, sex, site of onset of disease, negative inspiratory force, and time from diagnosis to the first clinic visit. Two time-varying covariates are forced vital capacity and body mass index, which may not be measured at every clinic visit for every patient. Each time-varying covariate is modeled over time using the mixed model (5), where polynomial spline basis functions are used. The estimated curves are used to interpolate the covariate values needed for estimating the propensity process based on (4).
We compare three alternative approaches to the proposed propensity process. First, a naïve analysis compares all treated individuals to those who are untreated prior to . The second approach is the propensity function (Imai and Van Dyk, 2004) that uses baseline risk factors only in the treatment assignment model (4), where defines the propensity function. The third approach is the interpolated generalized propensity score, which uses the interpolated treatment-free defined in § 3.1 to obtain the GPS for each subject in the spirit of Lu (2005), noting that may not be observed at time for a subject and its eligible controls as defined in § 3.2. The same sequential matching algorithm in § 3.2 is used for all propensity score methods. Our matching algorithm resulted in pairs for the analysis using the propensity function and pairs for both analyses using the generalized propensity score and propensity process.
Following Li et al. (2001) and Lu (2005), we assess balance of covariates by examining Type I errors from a log-rank test of the effect of the covariate on time to treatment, one covariate at a time. In the matched analyses, this model is stratified by the matched pairs. As shown in Table 1, prior to matching, balance is not achieved. While other methods improve covariate balance, they do not balance all covariates. However, matching using the propensity process results in balance across all covariates. This indicates that the propensity process outperforms the baseline propensity function or interpolated GPS in terms of balancing covariates and there may be residual confounding after matching by the other propensity score methods.
After matching, we test the causal null hypothesis that the mean potential outcome is the same whether a patient received PEG treatment at time versus PEG treatment at some time after or untreated by , which can be written as for all . We test this hypothesis by a Wilcoxon signed rank test on matched pairs for all the matched analyses. The Wilcoxon rank sum test is used for hypothesis testing in the naïve analysis. Table 2 presents the median difference in BMI change at 18 months and p-value of the Wilcoxon test for each approach. The propensity process matched analysis suggests a protective effect of PEG on BMI, whereas the other three methods all show effects that are attenuated towards 0 and are not statistically significant.
5 Discussion
Compared to the existing propensity score methods, the propensity process offers the advantage of balancing time-varying covariate history from baseline to time of treatment. A key component of this approach is the interpolation of covariate curves. We propose to use nonlinear mixed models to provide flexibility for modeling covariate history, though there must be enough individual longitudinal data collected to estimate these curves, which is a potential limitation in settings with sparsely collected longitudinal data. However, data interpolation may not be needed in settings such as critical care in intensive care units where time series data including heart rate and blood pressure are continuously recorded (Lehman et al., 2013).
In our data analysis, we use a straightforward approach for hypothesis testing after matching. Future extensions may include conditional likelihood methods for estimating treatment effects based on matched pairs/sets and methods for stratification and covariate adjustment using the propensity process. Additionally, our analysis excludes individuals who died prior to in order to avoid complications due to censoring by death (Rubin et al., 2006; Zhang and Rubin, 2003), which could be addressed in future extensions such that no such exclusion is necessary.
Acknowledgements
The authors thank Dr. Jonathan Glass and Ms. Meraida Polak at the Emory ALS Center for providing the ALS data and Dr. Xin Qi at the Georgia State University for helpful comments.
Appendix: Proofs of Propositions 1, 2, and Theorem 1
Proof of Propositions 1 and 2: We prove Propositions 1 and 2 based on the treatment assignment model defined in (1) and (2). Given ,
[TABLE]
where the first equality is due to the fact that is redundant given . It follows from integrating both sides in that . The result in Proposition 1 follows immediately, i.e., is conditionally independent of given . Along similar lines, we can prove the result in Proposition 2, i.e., is conditionally independent of given for all .
Proof of Theorem 1: For every , all , , and ,
[TABLE]
Let and denote the -field generated by and , respectively. By the definition of in (3), we have and then (7) follows immediately (cf. Billingsley, 2008, Theorem 34.4). (8) is due to Assumption 2 while (9) follows from the fact that is redundant given . The fourth expression (10) is due to Proposition 2 while (11) follows from (Appendix: Proofs of Propositions 1, 2, and Theorem 1), i.e., is independent of .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Allen and Satten (2011) Allen, A. S. and Satten, G. A. (2011). Control for confounding in case-control studies using the stratification score, a retrospective balancing score. American journal of epidemiology , 173(7):752–760.
- 2Billingsley (2008) Billingsley, P. (2008). Probability and measure . John Wiley & Sons.
- 3Hansen (2008) Hansen, B. B. (2008). The prognostic analogue of the propensity score. Biometrika , 95(2):481–488.
- 4Hirano and Imbens (2004) Hirano, K. and Imbens, G. W. (2004). The propensity score with continuous treatments. Applied Bayesian modeling and causal inference from incomplete-data perspectives , pages 73–84.
- 5Hu et al. (2014) Hu, Z., Follmann, D. A., and Wang, N. (2014). Estimation of mean response via the effective balancing score. Biometrika , page asu 022.
- 6Imai and Van Dyk (2004) Imai, K. and Van Dyk, D. A. (2004). Causal inference with general treatment regimes: Generalizing the propensity score. Journal of the American Statistical Association , 99(467):854–866.
- 7Lehman et al. (2013) Lehman, L.-w. H., Nemati, S., Adams, R. P., Moody, G., Malhotra, A., and Mark, R. G. (2013). Tracking progression of patient state of health in critical care using inferred shared dynamics in physiological time series. In Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE , pages 7072–7075. IEEE.
- 8Li et al. (2001) Li, Y. P., Propert, K. J., and Rosenbaum, P. R. (2001). Balanced risk set matching. Journal of the American Statistical Association , 96(455):870–882.
