Treatment effect estimation using the propensity score in clinical trials with historical control

Saki Kanamori; Masahiro Takeuchi

PMC · DOI:10.1186/s12874-023-02127-9·February 22, 2024

Treatment effect estimation using the propensity score in clinical trials with historical control

Saki Kanamori, Masahiro Takeuchi

PDF

Open Access

TL;DR

This paper introduces a new statistical method using propensity scores to better estimate treatment effects in clinical trials that use historical control data.

Contribution

The novel contribution is incorporating information on whether data are from RCT or historical control into the propensity score model.

Findings

01

The proposed method performs similarly to conventional methods when covariate distributions are similar between RCT and historical data.

02

When covariate distributions differ, the proposed method outperforms conventional approaches in estimating treatment effects.

03

The new method is useful even when similarity between RCT and historical data is unknown.

Abstract

Clinical trials assessing new treatment effects require a control group to compare the pure treatment effects. However, in clinical trials on regenerative medicine, rare diseases, and intractable diseases, it may be ethically difficult to assign participants to the control group. In recent years, the use of historical control data has attracted attention as a method for supplementing the number of participants in the control group. When combining historical control data with new randomized controlled trial (RCT) data, the assessment of heterogeneity using outcome data is not sufficient. Therefore, several statistical methods that consider participant outcomes and baseline characteristics, including the propensity score (PS) method have been proposed. We propose a new method considering “information on whether the data are RCT data or not” in the PS model when combining the RCT and…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Keywords

Historical controlPropensity scoreCausal inferenceRandomized controlled trialClinical trial

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Statistical Methods in Clinical Trials · Health Systems, Economic Evaluations, Quality of Life

Full text

Introduction

Clinical trials that assess new treatment effects require a control group to compare the pure treatment effects, which exclude baseline characteristics [1]. Randomized controlled trials (RCTs) are considered the gold standard approach in confirmatory trials for reducing bias and assessing objective effects. However, in clinical trials for regenerative medicine, rare diseases, and intractable diseases, random assignment of participants to the control group may be ethically difficult. Recently, there has been active collection of real-world data and construction of a disease registry [2–4], and the utilization of historical control data has attracted attention as a supplement for the number of control group participants in clinical trials. Appropriate utilization of historical control data can ensure that patients are offered promising treatments faster by reducing the number of participants assigned to control groups, thus accelerating drug development [5, 6]. The U.S. Food and Drug Administration has issued draft guidance on natural history studies for rare disease drug development [7], and further utilization of external control is expected [2–4].

The use of historical control data is still being debated [8–10]. Frequentist approaches include, the pooling method in which historical control data are equated with the new trial control group and merged as is, and the test-then-pool method, which is used after determining the similarity between both outcome data by hypothesis test [11]. Bayesian approaches include power priors [12] and hierarchical modeling [13, 14], which discount the amount of information in historical control data [11, 15]. A previous study proposed a method that calculates the difference between outcome data of a new trial control group and historical control data and used weighting as an estimate of heterogeneity [16]. Evaluation of heterogeneity with outcome data is useful, but not sufficient in situations with different measurement periods and conditions. Besides, the information from historical control data may distort the true results from new trials [15], or conversely, historical control data may be hardly used [16], which poses a large risk for implementation.

In the causal inference framework, propensity scores (PS) [17, 18] may be used to compare groups that are not randomized. The PS indicates the probability of treatment allocation calculated using baseline characteristics. Thus, by aligning the baseline characteristics between treatment groups, it is possible to estimate the treatment effect while minimizing the effect of confounding on treatment allocation. When utilizing historical control data, a method using the PS has been proposed for considering the heterogeneity of baseline characteristics. In general, the matching [19, 20] and inverse probability of treatment weighting (IPTW) [21] methods are used as PS methods [22, 23]. Methods using PS to assess the generalizability of the population participating in RCT to the patient population [24], and to merge RCT data with observational data [25] have also been proposed. Additionally, a method combining the PS methods and Bayesian dynamic borrowing framework has been proposed [26].

Furthermore, as this study considers a special clinical trial that uses historical data in combination with new RCT data includes information on whether the data are RCT or historical control data. This information could be an important confounding factor along with baseline characteristics. Accordingly, we evaluate the performance of the method used for the clinical trial that newly considers “information on whether the data are RCT data or not” in the conventional PS model when estimating the treatment effect using simulation data.

Proposal of the PS model

In a clinical trial in which the primary endpoint is binary outcome $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Y$$\end{document}$ (presence or absence of an event), we assume historical control data are combined with new two-armed RCT data as part of a control group. $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Y}_{i}=1$$\end{document}$ indicates that an event has occurred, and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Y}_{i}=0$$\end{document}$ indicates that no event has occurred with participant $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i\ \left(i=1\dots l\right)$$\end{document}$ . We set $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$T$$\end{document}$ as the treatment group indicator ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${T}_{i}=1$$\end{document}$ for the treatment and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${T}_{i}=0$$\end{document}$ for the control groups for participant $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i$$\end{document}$ ) and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X$$\end{document}$ as the vector of all covariates $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${X}_{j}$$\end{document}$ ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$j=1\dots\textit{k}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${X}_{ij}$$\end{document}$ denotes the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\textit{j}$$\end{document}$ th covariate of participant $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i$$\end{document}$ ), which are the possible confounding factors. When estimating the PS, a model would generally be expressed as

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}\pi =\mathrm{logit}\left\{{\text{Pr}}\left(T=1|X\right)\right\}={\beta }_{0}+{\beta }_{1}{X}_{1}+{\beta }_{2}{X}_{2}+\dots +{\beta }_{k}{X}_{k},\end{array}$$\end{document}

using a logistic regression [27, 28], where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta_j\ \left(j=1\dots\textit{k}\right)$$\end{document}$ denotes a coefficient of the regression model.

Here, we might consider the information on whether the data were derived from the new RCT or historical control data as an important confounding factor. Therefore, in the proposed method of this study, the PS model newly considers information on whether the data are RCT data or not and sets that information as indicator variable $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ . $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${X}_{ir}=1$$\end{document}$ indicates that participant $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i$$\end{document}$ is from the RCT, and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${X}_{ir}=-1$$\end{document}$ indicates that participant $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i$$\end{document}$ is from the historical control group. As a proposed method including $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ , the PS model could be expressed as

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}{\pi }^{*}=\mathrm{logit}\left\{{\text{Pr}}\left(T=1|X,{X}_{r}\right)\right\}={\beta }_{0}+{\beta }_{1}{X}_{1}+{\beta }_{2}{X}_{2}+\dots +{\beta }_{k}{X}_{k}+{\beta }_{r}{X}_{r}.\end{array}$$\end{document}

We considered that the performance in estimating the treatment effect between the conventional method using $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ and proposed method using $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ may vary due to the difference in the distribution of covariates between the RCT and historical control data. In Stimulation study section, we evaluate the performance of the method using simulated data.

As a PS method, although the matching method is easy to understand, there is a possibility that the amount of information will be drastically reduced. In this study, we apply the IPTW method to utilize more information when evaluating the model’s performance. When estimating the treatment effect, each participant’s weight $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w$$\end{document}$ could be $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w=T/{\text{expit}}\left(\pi \right)+\left(1-T\right)/\left\{1-{\text{expit}}\left(\pi \right)\right\}$$\end{document}$ in the conventional method and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w=T/{\text{expit}}\left({\pi }^{*}\right)+\left(1-T\right)/\left\{1-{\text{expit}}\left({\pi }^{*}\right)\right\}$$\end{document}$ in the proposed method.

Simulation study

Settings

In this simulation study, to evaluate the treatment effect, we set the total number of participants as n = 900 and the allocation ratio between the RCT treatment group, RCT control group, and historical control group as 1:1:2. Moreover, we set the outcome event rates as 50%, 10%, and 5%; the odds ratios as 1.0, 2.0, 5.0, and 10.0; and the two-sided significance level as 5%. Furthermore, we also examined cases where the number of participants was small. The simulation results assuming the total number of participants as n = 200 are shown. The method and conditions in the simulation setting are the same as those shown in the setting assuming that n = 900, except for the total number of participants. The supplementary examination was conducted by assuming a situation with odds ratios of 1.5 and 2.5 (Additional file 1: Appendix A). In addition, we assume a situation wherein the allocation ratios are different (Additional file 1: Appendix B) and one of the four covariates is binary data (Additional file 1: Appendix C). We also conducted simulations in which the assignment of treatment variables was completely random in the RCT population (Additional file 1: Appendix D), and simulations were based on parameter settings from the actual clinical trial [29] (Additional file 1: Appendix G). To estimate the treatment effect, the IPTW using the PS method is applied, and the odds ratio based on the weight is estimated by the logistic regression model.

The performance measurements of the simulation result include the following: (1) difference of the estimated log odds ratio from the true log odds ratio (bias), (2) mean squared error (MSE), (3) coverage of 95% confidence interval (coverage), and (4) type I error rate and power. The simulation data are generated while assuming two scenarios wherein the distribution of covariates is either similar or not similar between the RCT and historical control data.

Scenario (I)

In this situation, the distribution of covariates is similar between the RCT and historical control data. From the multivariate standard normal distribution, four covariates are generated for participant $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$i$$\end{document}$ as

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}\left\{X_{i1},X_{i2},X_{i3},X_{i4}\right\}\sim N\left(0,1\right)\end{array}.$$\end{document}

Here, the true PS model $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi_{\textit{i},\textit{true}}$$\end{document}$ is

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}\pi_{\textit{i},\textit{true}}=\mathrm{logit}\left\{\text{Pr}\left(T=1\vert X\right)\right\}=\beta_0+\beta_1X_{i1}+\beta_2X_{i2}+\beta_3X_{i3}+\beta_4X_{i4},\end{array}$$\end{document}

and the parameters are $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\left\{{\beta }_{0}, { \beta }_{1}, { \beta }_{2},{ \beta }_{3},{ \beta }_{4}\right\}=\left\{{b}_{0}, 1.00, -0.50, 0.25, 0.10\right\}$$\end{document}$ . $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${b}_{0}$$\end{document}$ is a constant correction value corresponding to the treatment allocation ratio (Additional file 1: Appendix E). Based on Eq. (4), each participant’s treatment allocation is determined from the Bernoulli distribution:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}T_i\sim Bernoulli\left\{\frac{\text{exp}\left(\pi_{\textit{i},\textit{true}}\right)}{1+\text{exp}\left(\pi_{\textit{i},\textit{true}}\right)}\right\}.\end{array}$$\end{document}

The model that generates outcome data $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${y}_{i}$$\end{document}$ is as follows:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}y_\textit{i}=\mathrm{logit}\left\{\text{Pr}\left(Y=1\vert X\right)\right\}=\alpha_0+\beta_{treat}T_i+\alpha_1X_{i1}+{\alpha_2X}_{i2}+{\alpha_3X}_{i3}+\alpha_4X_{i4}+\varepsilon_i/100,\end{array}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\left\{{\alpha }_{0},{ \alpha }_{1},{ \alpha }_{2},{ \alpha }_{3},{ \alpha }_{4}\right\}=\left\{{a}_{0}, 0.274, 0.137, -0.137, 0.137\right\}$$\end{document}$ . Here, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\beta }_{treat}$$\end{document}$ is the true log odds ratio of the treatment effect, and the error term $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varepsilon }_{i}\sim N\left(0, 1\right)$$\end{document}$ is generated according to independent normal distribution. Besides, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${a}_{0}$$\end{document}$ is a constant correction value corresponding to the outcome event rate (Additional file 1: Appendix E). Based on Eq. (6), each participant’s outcome $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Y}_{i}$$\end{document}$ is determined from the Bernoulli distribution:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}Y_i\sim Bernoulli\left\{\frac{\text{exp}\left(y_\textit{i}\right)}{1+\text{exp}\left(y_\textit{i}\right)}\right\}.\end{array}$$\end{document}

Scenario (II)

In this situation, the distribution of covariates is not similar between the RCT and historical control data. As with scenario (I), after generating covariates from the multivariate standard normal distribution,

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}\left\{{{X}^\prime}_{i1},{ {X}^\prime}_{i2},{ {X}^\prime}_{i3},{ {X}^\prime}_{i4}\right\}\sim N\left(0, 1\right),\end{array}$$\end{document}

each covariate in the RCT data are transformed as follows:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}\left\{{X}_{i1}={{X}^\prime}_{i1}-1, { X}_{i2}={ {X}^\prime}_{i2}\times 0.7, { X}_{i3}={\text{ln}}\left|{{X}^\prime}_{i3}\right|,{ X}_{i4}={ {X}^\prime}_{i4}\right\}.\end{array}$$\end{document}

For historical control data, the covariates without transformation, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${X}_{i1},{ X}_{i2},{ X}_{i3},\mathrm{and}\, {X}_{i4}$$\end{document}$ , are simply used from the generation of standard multivariate normal distributions, that is,

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}\left\{{X}_{i1}={{X}^\prime}_{i1}, { X}_{i2}={ {X}^\prime}_{i2}, { X}_{i3}={{X}^\prime}_{i3},{ X}_{i4}={ {X}^\prime}_{i4}\right\}.\end{array}$$\end{document}

Here, the true PS model $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi_{\textit{i},\textit{true}}^{\mathit\ast}$$\end{document}$ is provided as

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}\pi_{\textit{i},\textit{true}}^{\mathit\ast}=\mathrm{logit}\left\{\text{Pr}\left(T=1\vert X,X_r\right)\right\}=\beta_0+{\beta_1X}_{i1}+{\beta_2X}_{i2}+{\beta_3X}_{i3}+{\beta_4X}_{i4}+{\beta_rX}_{ir},\end{array}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\left\{{\beta }_{0}, { \beta }_{1}, { \beta }_{2},{ \beta }_{3},{ \beta }_{4},{ \beta }_{r}\right\}=\left\{\left({b}_{0}-{b}_{r}\right), 1.00, -0.50, 0.25, 0.10, {b}_{r}\right\}$$\end{document}$ . $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${b}_{r}$$\end{document}$ is the coefficient value of indicator variable $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${X}_{r}$$\end{document}$ in the true PS model for each treatment allocation ratio (Additional file 1: Appendix F). These parameters are simultaneously calculated using a true PS model for only RCT data,

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}\pi_{\textit{i},\textit{true},\textit{RCT}}=\mathrm{logit}\left\{\text{Pr}\left(T=1\vert X\right)\right\}=b_0+{1.00X}_{i1}-{0.50X}_{i2}+{0.25X}_{i3}+{0.10X}_{i4};\end{array}$$\end{document}

the true PS model for only historical control data,

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}\pi_{\textit{i},\textit{true},\textit{HC}}=\mathrm{logit}\left\{\text{Pr}\left(T=1\right)\right\}=0;\end{array}$$\end{document}

and a covariate of each participant (Additional file 1: Appendix F; calculation method). Based on Eq. (12), treatment allocation $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${T}_{i}$$\end{document}$ for each participant is determined from the Bernoulli distribution:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}T_i\sim Bernoulli\left\{\frac{\text{exp}\left(\pi_{\textit{i},\textit{true}}^{\mathit\ast}\right)}{1+\text{exp}\left(\pi_{\textit{i},\textit{true}}^{\mathit\ast}\right)}\right\}.\end{array}$$\end{document}

Outcome data $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${y}_{i}$$\end{document}$ are generated by

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}y_\textit{i}=\mathrm{logit}\left\{\text{Pr}\left(Y=1\vert X\right)\right\}=\alpha_0+\beta_{treat}T_i+\alpha_1X_{i1}+\alpha_2X_{i2}+\alpha_3X_{i3}+\alpha_4X_{i4}+\alpha_rX_{ir}+\varepsilon_i/100,\end{array}$$\end{document}

where $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\left\{{\alpha }_{0},{ \alpha }_{1},{ \alpha }_{2},{ \alpha }_{3},{ \alpha }_{4},{ \alpha }_{r}\right\}=\left\{{a}_{0},0.274, 0.137, -0.137, 0.137, 0.137\right\}$$\end{document}$ . Based on Eq. (15), each participant’s outcome $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Y}_{i}$$\end{document}$ is determined from the Bernoulli distribution:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{array}{c}Y_i\sim Bernoulli\left\{\frac{\text{exp}\left(y_\textit{i}\right)}{1+\text{exp}\left(y_\textit{i}\right)}\right\}.\end{array}$$\end{document}

Results

The usual number of participants

In scenario (I), wherein the distribution of covariates is similar between the RCT and historical control data, not much difference in the proposed and conventional methods was found in the bias, MSE, coverage of 95% confidence interval, and type I error (Table 1).Table 1. Scenario (I): performance of the estimated propensity score (PS) modelPerformance measurementPS modelOutcome event rateOdds ratio1.02.05.010.0Bias $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ (without $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ )50%0.004-0.016-0.034-0.03210%-0.026-0.022-0.018-0.0135%-0.060-0.031-0.0090.017 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ (with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ )50%0.0350.014-0.005-0.00710%0.0190.0200.0230.0305%0.0000.0230.0450.074MSE $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%0.0450.0500.0690.09710%0.1170.0920.0860.0975%0.2280.1750.1600.201 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%0.0370.0390.0520.07410%0.0970.0800.0840.1005%0.1870.1540.1600.217Coverage (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%95.094.694.093.210%93.994.394.894.75%93.293.994.394.4 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%94.994.894.894.310%94.794.994.694.45%94.394.194.594.1Type I error and power (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%5.086.699.8100.010%6.163.7100.0100.05%6.841.798.5100.0 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%5.094.199.9100.010%5.272.499.8100.05%5.750.798.499.8 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ (without $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ ): the conventional method; $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ (with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ ): the proposed method

On the other hand, in scenario (II), wherein the distribution of covariates is not similar between the RCT and historical control data, the proposed method tended to have a smaller bias, coverage of 95% confidence interval closer to 95%, and a type I error rate closer to 5%. In addition, there was not much difference between the proposed and conventional methods in the MSE (Table 2).Table 2. Scenario (II): performance of the estimated propensity score (PS) modelPerformance measurementPS modelOutcome event rateOdds ratio1.02.05.010.0Bias $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ (without $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ )50%0.1690.1510.1350.12810%0.1540.1550.1510.1515%0.1350.1530.1720.190 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ (with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ )50%0.0440.0260.0100.00610%0.0290.0340.0350.0385%0.0070.0320.0620.091MSE $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%0.0530.0490.0520.06510%0.0900.0790.0800.0925%0.1540.1320.1400.212 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%0.0360.0380.0500.07010%0.0940.0800.0840.1035%0.1830.1500.1610.248Coverage (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%81.585.690.492.710%89.689.690.692.65%91.892.193.294.9 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%94.495.194.794.310%94.594.194.194.15%94.194.293.793.8Type I error and power (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%18.599.9100.0100.010%10.494.2100.0100.05%8.274.7100.0100.0 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%5.696.1100.0100.010%5.574.999.9100.05%5.852.598.899.9 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ (without $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ ): the conventional method, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ (with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ ): the proposed method

When the allocation ratios between the RCT treatment group, RCT control group, and historical control group were 2:1:3 (Additional file 1: Appendix Table B.3), 1:1:4 (Additional file 1: Appendix Table B.5), 2:1:6 (Additional file 1: Appendix Table B.6), 2:1:1 (Additional file 1: Appendix Table B.11), and 3:1:2 (Additional file 1: Appendix Table B.12)—that is, different but not extremely skewed—the same tendency in all performance measurements was observed as in the allocation ratio of 1:1:2. However, when the allocation ratios were 9:1:10 (Additional file 1: Appendix Table B.4), 9:1:20 (Additional file 1: Appendix Table B.7), 1:1:18 (Additional file 1: Appendix Table B.8), 2:1:27 (Additional file 1: Appendix Table B.9), and 9:1:90 (Additional file 1: Appendix Table B.10)—that is, extremely skewed—the bias and MSE had increased.

In addition, the same trends were observed for all performance measures when one of the four covariates was binary data (Additional file 1: Appendix Table C.1) as when the four covariates were continuous data.

Moreover, the simulation where the treatment variable in RCT population was generated independent of covariates (Additional file 1: Appendix Table D.1) shown also almost the same result in the text. In a simulation where the parameter settings of an actual clinical trial were applied (Additional file 1: Appendix Table G.1) was also similar result in the text.

Small number of participants

In the case where the total number of participants is n = 200, the same tendency was observed in all performance measurements as in the case where the number of participants is n = 900.

That is, in scenario (I), wherein the distribution of covariates is similar between the RCT and historical control data, not much difference in the proposed and conventional methods was found in the bias, MSE, coverage of 95% confidence interval, and type I error rate (Table 3).Table 3. Scenario (I): performance of the estimated propensity score (PS) model by simulation setting assuming n = 200Performance measurementPS modelOutcome event rateOdds ratio1.02.05.010.0Bias $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ (without $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ )50%-0.0010.0020.0340.11210%-0.199-0.076-0.0070.1045%-1.264-0.3560.3121.424 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ (with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ )50%0.0300.0290.0480.10510%-0.1010.0060.0690.1875%-1.087-0.2180.4471.593MSE $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%0.2290.2450.3360.86510%1.8350.5550.6111.6185%22.0167.3587.51625.242 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%0.1870.2020.2710.71110%1.6360.4930.6051.7025%20.4476.9418.03127.171Coverage (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%93.393.292.590.610%91.892.593.293.95%88.491.292.185.8 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%94.294.493.893.410%93.693.293.293.65%89.392.892.987.2Type I error and power (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%6.535.788.096.710%8.121.574.994.75%11.416.751.978.2 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%5.641.292.898.110%6.126.479.395.55%10.621.060.383.1 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ (without $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ ): the conventional method; $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ (with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ ): the proposed method

And then, in scenario (II), wherein the distribution of covariates is not similar between the RCT and historical control data, the proposed method tended to have a smaller bias, coverage of 95% confidence interval closer to 95%, and a type I error rate closer to 5%. In addition, there was not much difference between the proposed and conventional methods in the MSE (Table 4).Table 4. Scenario (II): performance of the estimated propensity score (PS) model by simulation setting assuming n = 200Performance measurementPS modelOutcome event rateOdds ratio1.02.05.010.0Bias $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ (without $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ )50%0.1680.1640.1680.22410%0.0320.1320.1770.2685%-0.908-0.1060.5161.758 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ (with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${X}_{{\text{r}}}$$\end{document}$ )50%0.0500.0500.0600.12610%-0.0960.0110.0770.1875%-0.989-0.2030.4471.750MSE $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%0.1440.1510.2020.86210%1.6860.4590.3261.4315%19.5746.2747.60527.879 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%0.1740.1930.2610.88010%1.6400.5320.4211.5845%18.5056.1208.21029.796Coverage (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%92.494.095.295.810%93.994.295.196.65%89.894.695.588.4 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%94.594.093.694.010%94.094.093.993.75%90.093.493.486.8Type I error and power (%) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ 50%7.568.699.9100.010%6.039.193.199.55%10.225.272.392.3 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ 50%5.444.494.198.710%5.926.681.196.25%9.920.798.899.9 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi$$\end{document}$ (without $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ ): the conventional method, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\pi }^{*}$$\end{document}$ (with $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ ): the proposed method

Discussion

The results in this study suggest that a situation wherein the distribution of covariates is similar between the RCT and historical control data—that is, scenario (I)—the estimation bias of the treatment effect in the PS model would not be affected by including the information on whether the participant data is RCT data or not. On the other hand, a situation wherein the distribution of covariates is not similar between the RCT and historical control data—that is, scenario (II)—the use of the proposed PS method is recommended because the performance of estimating the treatment effect is improved by including the information on whether the participant data is RCT data or not.

As for the relationship between the outcome event rate and performance of estimating the treatment effect, it is considered appropriate that the higher the outcome event rate, the higher the performance of the estimation. Therefore, in the situation where the distributions of covariates are similar, the treatment effect could be estimated appropriately using both the proposed and conventional methods for this situation. Meanwhile, where the distributions of covariates are not similar, a similar tendency is observed when using the proposed method, and so it is considered that the appropriate treatment effect can be estimated. However, in the conventional method, the lower the outcome event rate, the higher the performance that can be estimated, and so there is a possibility that the appropriate treatment effect cannot be estimated.

Moreover, even when the allocation ratio between the RCT treatment group, RCT control group, and historical control group is changed, if the allocation ratio is not extremely skewed, the same consideration is possible as in the allocation ratio of 1:1:2 in this situation. Namely, in the situation where the distributions of covariates are similar, when considering the information on whether the data are RCT data or not in the PS model, the effect on the performance of estimating the treatment effect was not as marked. And also, in the situation where the distributions of covariates are not similar, the performance of estimating the treatment effect was improved by considering whether the data are RCT data or not. Meanwhile, when the allocation ratio was extremely skewed, bias and MSE increased tremendously, and the estimation could not be conducted appropriately. This is because the number of participants in the RCT control group was extremely small when the allocation ratio was extremely biased.

As another situation, even if the total number of participants is small or and the covariates include binary data, the same consideration is possible as that when the total number of participant is n = 900 and the covariates are all continuous data. The same trend is suggested when the treatment variables in the RCT population are considered completely independently and randomly from the covariates. In other words, when the distribution of covariates is similar between the RCT and historical control data, not much difference in performance is found between the proposed and conventional methods to estimate the treatment effect. And, when the distribution of covariates is not similar between the two kinds of data, the proposed method shows higher performance. In addition, the same argument as above can be considered to apply even when there is variation in data such as actual clinical trial data.

For these reasons, when combining the RCT and historical control data in the clinical trial setting, it is important to consider whether the distribution of important participant baseline characteristics that influence the outcomes is similar or not. Moreover, for appropriate utilization of historical control data, it is useful to apply the proposed PS model that considers $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$X_\textit{r}$$\end{document}$ while assessing possible differences. However, when considering the utilization of historical control data to reinforce the number of participants in the RCT control group, it is necessary to simulate several patterns of allocation ratio and evaluate the performance of the allowable range of how small the control group can be from the planning stage of the clinical trial, and use this with caution. In addition, since the proposed method uses PS, the possibility of the presence of unmeasured confounding factors, that is, whether the covariates used in the PS model are sufficient, should also be considered. And, this method is assuming that use single historical control data set, and have limited that could not have considered for difference between two or more historical control data set. Furthermore, in this study, we focused on the treatment effect in the entire population, including historical control data, and investigated a method for estimating the Average Treatment Effect (ATE). However, there may be situations in which it is desirable to estimate the Average Treatment Effect on the Treated (ATT) in the RCT population or treatment group, and we would like to consider the performance evaluation in such cases to be a future issue. While paying attention to issues such as the increase in type I error rate, it is possible to appropriately reduce the number of participants assigned to the RCT control group. We believe that this will help improve the efficiency of clinical trials, solve ethical problems, and thus save more people.

Conclusions

In clinical trials utilizing historical control data, considering information on whether the data are RCT data or not in the proposed PS model is useful for appropriately estimating the treatment effect, even when it is not known whether the RCT data and the historical control data are similar. Promotion of appropriate utilization of historical control data will contribute to the realization of better medical care.

Supplementary Information

Additional file 1: Appendix A. Simulation setting assuming odds ratios 1.5 and 2.5. Appendix B. Simulation setting assuming that the allocation ratio between the RCT treatment group, RCT control group, and historical control data is other than 1:1:2. Appendix C. Simulation setting assuming that one of the covariates is binary data. Appendix D. Simulation setting assuming that the randomized assignment of treatment variables. Appendix E. Probability of treatment allocation correction value b0 and outcome event rate correction value a0. Appendix F. Calculation method of the true PS model in scenario (II). Appendix G. Simulation based on actual clinical trial parameter settings.

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1International Council on Harmonisation (ICH). Guidance for industry E 9 statistical principles for clinical trials. https://www.fda.gov/media/71336/download. Accessed 30 Mar 2023.
2U.S. Food & Drug Administration. Framework for FDA’s real world evidence program. https://www.fda.gov/media/120060/download. Accessed 30 Mar 2023.
3European Medicines Agency. Discussion paper: use of patient disease registries for regulatory purpose - methodological and operational considerations. https://www.ema.europa.eu/documents/other/discussion-paper-use-patient-disease-registries-regulatory-purposes-methodological-operational_en.docx. Accessed 30 Mar 2023.
4Pharmaceuticals and Medical Devices Agency. Notification: basic principles on utilization of registry for applications. https://www.pmda.go.jp/files/000240806.pdf. Accessed 30 Mar 2023.
5Pocock SJ The combination of randomized and historical controls in clinical trials J Chronic Dis 19762931758810.1016/0021-9681(76)90044-8770493 · doi ↗ · pubmed ↗
6van Rosmalen J Dejardin Dvan Norden Y Including historical data in the analysis of clinical trials: is it worth the effort?Stat Methods Med Res 201827103167318210.1177/096228021769450628322129 PMC 6176344 · doi ↗ · pubmed ↗
7U.S. Food & Drug Administration. Rare diseases: natural history studies for drug development guidance for industry. https://www.fda.gov/media/122425/download. Accessed 30 Mar 2023.
8International Council on Harmonisation (ICH). Guidance for industry E 10 choice of control group in clinical trials. https://www.fda.gov/media/71349/download. Accessed 30 Mar 2023.