On quality standards and the timing of pharmaceutical investment

Wei Wei

PMC · DOI:10.3389/fpubh.2025.1738084·January 21, 2026

On quality standards and the timing of pharmaceutical investment

Wei Wei

PDF

Open Access

TL;DR

This paper examines how quality standards set by health authorities affect when pharmaceutical companies invest in drug development and compares this to an ideal coordinated approach.

Contribution

The study introduces a game-theoretic model to analyze the interplay between regulatory quality standards and firm investment timing.

Findings

01

A social planner achieves higher total value by investing earlier than in decentralized settings.

02

Higher remuneration can speed up investment but may delay commercialization if quality standards are too high.

03

Decentralized decision-making leads to inefficiencies due to constrained optimization.

Abstract

This study analyzes how a health authority's quality standard influences a pharmaceutical firm's R&D investment timing, comparing this decentralized process with a social planner's integrated approach. Drug quality evolves as a geometric Brownian motion, accelerating after irreversible investment. Using real options and optimal stopping theory, we model a sequential game between the regulator and firm, and a social planner who jointly sets both the investment threshold and quality standard. The social planner invests earlier and achieves a higher total value than the combined payoff in the decentralized setting. Increasing the firm's remuneration can accelerate investment, but may delay commercialization if quality standards are set too high. Decentralization creates inefficiencies through constrained optimization, whereas the social planner's unconstrained approach maximizes…

Figures4

Click any figure to enlarge with its caption.

The difference of investment thresholds as uncertainty varies. The dashed line represents the investment threshold of the pharmaceutical firm, while the solid line indicates the investment threshold of the social planner.

The investment threshold of the decentralized project varies with uncertainty, and the range between the maximum and minimum investment threshold values shifts under different sunk costs I and remuneration level R. The parameter values are: μ1 = 0.02, μ2 = 0.2, σ = [0.001, 0.9], I = 0.5 or 1, ρ = 0.5, C = 5, K = 10, and R=10, 30 or 60. In the figures above, D represents the difference between the maximum and minimum investment threshold. (A) I = 0.5, R = 10. (B) I = 0.5, R = 30. (C) I = 0.5, R = 60. (D) I = 1, R = 10. (E) I = 1, R = 30. (F) I = 1, R = 60.

The difference of project values as uncertainty varies, when q<qII<qcI. The dashed line represents the total value of PF and HA in the decentralized project, while the solid line shows the project's value managed by SP.

The difference of project values as uncertainty varies, when qII<qcI<q. The dashed line represents the total value of PF and HA in the decentralized project, while the solid line shows the project's value managed by SP.

Equations30

Keywords

health authorityoptimal stoppingpharmaceutical investmentquality standardsreal optionssocial planner

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCapital Investment and Risk Analysis · Pharmaceutical Economics and Policy · Health Systems, Economic Evaluations, Quality of Life

Full text

Introduction

1

This study analyzes a central issue in drug innovation policy: how a health authority's quality standard for a new drug affects a pharmaceutical firm's R&D investment timing under uncertainty. We framed this as a strategic interaction problem, modeled through a sequential game between a regulator (who sets the standard) and a firm (who decides when to invest). To assess the efficiency of this decentralized process, we compare it with the benchmark of a social planner who integrates both the investment timing and standard-setting decisions.

The incentive misalignment of this decentralized structure is particularly relevant for fighting Neglected Tropical Diseases (NTDs). These diseases, defined by the WHO, are causing severe health and economic impacts on more than 1.74 billion people,1 primarily in the poorest developing regions. Historically, there has been insufficient private investment in R&D for NTDs due to a perceived lack of a profitable market (1). To address this, public-sector actors and partnerships have employed pull mechanisms such as Advance Purchase Commitments (APCs) (2), (3), (4). These contracts specify not only a price (remuneration) but also a quality or efficacy threshold that the drug must meet. Thus, the question of how a publicly-set quality standard shapes private investment timing is not only theoretical, but is a question in designing effective incentives for diseases of poverty.

While APCs are recognized as a key tool to stimulate R&D for neglected diseases (5), their design involves critical tradeoffs (6). The level of remuneration must balance compensating the firm for risk with fiscal constraints for the funder. Similarly, the mandated quality standard must balance the requirement for a highly effective drug with the need to encourage timely development. Recent analyses of incentive mechanisms, including APCs for vaccines and antibiotics, continue to highlight this challenge of balancing cost, quality, and speed (7, 8). An alternative approach is direct government-led development, though evidence suggests that creating a conducive innovation ecosystem can be more effective than heavy-handed intervention (9). However, this approach may not be sufficient if the immediate priority is to rapidly produce high-quality drugs rather than to foster a self-sustaining market in the long term.

This study contributes by introducing a formal microeconomic model that captures the essential dynamic and strategic elements of this problem: the irreversible nature of R&D investment, the stochastic evolution of drug quality, and the sequential decision-making between the public and private sectors. More specifically, the pharmaceutical firm makes decisions regarding the timing of investing in a sunk cost to initiate a research and development (R&D) project. The quality of the drug resulting from this research evolves stochastically over time. Upon reaching a predetermined quality threshold set by the health authority, the firm gains the opportunity to commercialize the drug and receive an ex ante known remuneration for providing it to the health authority. The health authority's payoff from the drug is determined by cumulative population health benefits, while its costs include the remuneration paid to the pharmaceutical firm. In our benchmark scenario, we introduced an integrated social planner who makes joint decisions regarding both the timing of investment and the quality threshold. This setup enables us to compare outcomes under different decision-making frameworks and evaluate the efficiency of coordination between the pharmaceutical firm and the health authority in achieving societal welfare objectives.

This study uses techniques from optimal stopping theory to solve the pharmaceutical firm's timing of investment, aligning it with the literature on real options (10). A considerable amount of literature is dedicated to studying investment under uncertainty across multiple industries. A handful of papers incorporate optimal timing, as evidenced by works such as Sarkar (11), Sarkar and Zhang (12), Thijssen (13), Thijssen (14), Tsekrekos and Yannacopoulos (15), and Huberts and Thijssen (16). In this study, we delve into a mechanism design problem intricately tied to the timing of an agent's decision, in this instance, the investment decision of a pharmaceutical firm. More importantly, we compare the social planner benchmark with the “decentralized” game. It draws parallels to the concept of vertical control discussed in the industrial organization literature, as outlined by Tirole (17). In this context, the health authority assumes the role of the upstream firm, while the pharmaceutical firm represents the downstream firm within the decentralized game.

There are four main findings of this study. First, we demonstrate that the optimal investment threshold under the social planner's benchmark is strictly lower than the investment threshold in the decentralized project. This indicates that a social planner would invariably initiate R&D efforts sooner than the pharmaceutical firm operating independently. Second, in scenarios where the budget for remuneration is constrained, and the sunk costs of the project remain relatively low, the social planner tends to select a higher-quality standard than the health authority. Conversely, when sufficient funding is available to afford higher remuneration, the level of remuneration becomes a viable tool for adjusting the quality standard of the drug. However, raising the remuneration may delay commercialization. Finally, the analysis reveals that the social planner's approach generates more value than the combined efforts of the health authority and the pharmaceutical firm. Importantly, this outcome remains consistent across various model parameters, including the discount rate, growth rate, and uncertainty levels.

The remainder of this study is organized as follows. Section 2 grounds the model in practice and introduces a simple game between a health authority and a pharmaceutical firm. Section 3 discusses the optimal stopping problem of the integrated social planner. In Section 4, we compare the thresholds and values of the two projects, shedding light on the implications of different decision-making frameworks. Section 5 offers concluding remarks summarizing the key findings and policy implications of the study.

Model framework and real-world contex

2

Interpreting the model in practice

2.1

Before presenting the formal game, we clarify the real-world counterparts of our modeling constructs. The decentralized framework captures the essential strategic interplay common in many health systems: a health authority ( $[eqn]$ ), acting as both regulator and payer (e.g., akin to the U.S. FDA/CMS or China's NMPA/Healthcare Security Administration), sets a minimum quality standard for market approval or reimbursement. This standard reflects technical requirements for safety and efficacy. A pharmaceutical firm ( $[eqn]$ ) then decides when to commit its own resources to R&D in response to this regulatory hurdle. This separation of decision-makers is the norm. In contrast (the benchmark case), the social planner ( $[eqn]$ ) represents an idealized, fully integrated decision-maker, analogous to a mission-driven, publicly funded research program (e.g., the U.S. BARDA or a dedicated government task force for a pandemic) that internalizes both development costs and population health benefits. As for the model environment, the drug quality evolves stochastically, which is a parsimonious representation of the technical uncertainty inherent in R&D. The acceleration in growth rate post-investment (μ_2_>μ_1_) reflects the intensified effort and resource allocation after a firm or planner commits to full-scale development.

Key parameters in our model are motivated by empirical realities and policy variables:

Quality (qt): A composite index of drug efficacy and safety, the primary target of regulatory review. Standards ( $[eqn]$ , $[eqn]$ ): The minimum q required for the $[eqn]$ to approve/pay, or the $[eqn]$ to deploy the drug. In practice, these are informed by health technology assessment (HTA) and comparator therapies.

Remuneration (R): The (often contractually agreed) payment flow from the $[eqn]$ to the $[eqn]$ for a successful drug. This mirrors prices set in Advance Purchase Commitments (APCs) or negotiated in national reimbursement drug lists (e.g., China's NRDL negotiations), where price is explicitly linked to assessed clinical value. Health benefit (K): Converts quality into monetary health gains, reflecting the payer's willingness-to-pay for health outcomes. Costs (I, C): The sunk R&D cost and marginal production cost, respectively. Uncertainty (σ): Volatility in the quality process, capturing the unpredictable nature of scientific discovery and clinical trial results.

This setup allows us to analyze how the regulatory/payment threshold ( $[eqn]$ ) and the financial incentive (R)—two key policy instruments—jointly determine the private investment timing (τ), and to compare this outcome with the integrated benchmark.

A simple game between a health authority and a pharmaceutical firm

2.2

Consider a pharmaceutical firm ( $[eqn]$ ) that is allowed to invest in developing a drug. The investment is irreversible, entailing a sunk cost I>0 which is paid at the start of the project. We assume that an advance purchase commitments contract is signed between a health authority ( $[eqn]$ ) and the $[eqn]$ specifying: (1) the quality standard of the drug, and (2) the remuneration to the firm for an approved drug. For simplicity, the remuneration is assumed to be a constant and indefinite cash flow R. The amount R is treated as a parameter.2

Upon project completion, an independent adjudication committee (IAC) will determine whether the product meets the predefined technical specifications (2). The pre-specified quality standard is denoted by $[eqn]$ in the model. We simplify the model by not explicitly detailing the verification phase and assume that q is perfectly observable to both the $[eqn]$ and the $[eqn]$ . Similar to many R&D projects, investment lags are present: the initial sunk cost is incurred well before future revenue generation, reflecting the time required to achieve the specified quality standard.

Every pharmaceutical project can be conceptualized as a sequence of sub-projects, encompassing various stages such as discovery, pre-clinical, phase I, II, and III trials, and approval (18), (19). To focus on the specific issues of interest in this study, we simplify the project by merging the aforementioned stages into two major phases: the discovery phase before investment and the R&D phase after investment.

Uncertainty is modeled on a probability space (Ω, $[eqn]$ , P). The firm's revenues depend on the quality of the drug, and the progression is captured through a geometric Brownian motion (GBM) (qt)t≥0. Before initiating the R&D process, the manufacturer will conduct basic research during the discovery phase, incurring lower costs to collect as much beneficial information as possible for the R&D process. Without loss of generality, we assume the cost of the discovery phase is 0. Nonetheless, the basic research is only expected to marginally enhance the quality of the product, as described by the stochastic differential equation (SDE)

[eqn]

where Bt is a Wiener process. We take the natural filtration $[eqn]$ on (Ω, $[eqn]$ ).

Once the $[eqn]$ considers the project sufficiently promising (to be determined endogenously), a sunk cost I will be paid, and the R&D phase starts. This will increase the expected growth rate of the drug's quality to μ_2_ (μ_2_>μ_1_>0). To summarize, if T≥0 is the time of investment, then the drug's quality evolves as

[eqn]

Now consider the following sequential game between the health authority and the firm. The $[eqn]$ moves first by specifying a quality standard $[eqn]$ . After observing $[eqn]$ , the $[eqn]$ chooses an optimal time τ to start the R&D project and pay the sunk cost. The firm stops further development of the product once $[eqn]$ is reached. Due to the stochastic nature of the evolution of q, the time τ takes the form of a stopping time.

The pharmaceutical firm's investment timing problem

2.3

The project timeline is as follows.

Timeline illustrating the drug development process. It begins at “Now” with a blue section labeled “Discovery, μ1”. At T(qi), it transitions to a red section, “R&D process, μ2”. At T(qiA), it changes to black, indicating “New drug approved. Commercialization starts”.

Upon achieving the quality standard $[eqn]$ , the health authority will fulfill the advance purchase commitments by paying a constant flow R to the pharmaceutical company. The production cost of the drug, borne by the firm, is denoted by C. We assume that C is fixed and R>C. Furthermore, it is assumed that all decision makers involved are risk-neutral and share a common discount rate, ρ. The net present value (NPV) of commercialization at the quality level $[eqn]$ is calculated as follows.

[eqn]

The firm's problem is to determine the optimal time ν to invest. After initiating the investment, the drug will be commercialized at the first hitting time of $[eqn]$ , i.e., at

[eqn]

The $[eqn]$ solves the optimal stopping problem

[eqn]

where

[eqn]

The first line of Equation 5 shows the firm's trade-off: it chooses the investment time ν to maximize the expected discounted commercialization revenue (starting at the uncertain future time $[eqn]$ ) when the standard is met), net of the sunk cost I paid at ν. The term $[eqn]$ discounts the revenue back from the uncertain approval time to the investment time, and $[eqn]$ discounts the cost back to the present. The value function $[eqn]$ thus represents the value of the firm's R&D option. The derivations of equations use the Markov property of the process (qt)t≥0. The last line reduces the problem to a standard optimal stopping problem. As in many problems of this kind, the optimal stopping time will take the form of the first hitting time of an endogenously determined investment trigger, which we will denote by $[eqn]$ .3

To solve for this trigger, we first need to compute the value of F(q). Since $[eqn]$ is a constant, it can be taken out of the expectation, and we obtain

[eqn]

We now proceed to compute the expected discount factor, $[eqn]$ . First, the characteristic operator of the process qt after investment is

[eqn]

Note that the general solution of the equation $[eqn]$ _q_φ = ρφ is of the form,

[eqn]

where β_1_(μ_2_)>1 and β_2_(μ_2_) < 0 are the two roots of the quadratic equation,

[eqn]

We now think of the expected discount factor as a solution to this equation. Any such solution should solve the boundary condition φ(0) = 0, which can only be satisfied if B = 0. Therefore, $[eqn]$ , for some constant A, on $[eqn]$ .4

By using the Dynkin's formula,5 the expected discount factor can now easily be derived as

[eqn]

So, the value of $[eqn]$ is

[eqn]

By using the same approach above again, we can compute the expected discount factor for $[eqn]$ :

[eqn]

where β_1_(μ_1_)>1 is the positive root of the quadratic equation

[eqn]

To utilize the results obtained, we can now establish the following proposition, which is proved in Appendix A.

Proposition 1. Suppose that the quality threshold $[eqn]$ is given. Then the value function of the R&D option for the pharmaceutical firm is given by

[eqn]

Furthermore, the optimal investment threshold $[eqn]$ equals

[eqn]

From the pharmaceutical firm's perspective, the value of the project at any point in time is tied to the value of the stochastic process (qt)t≥0, which tracks the product's quality at that moment. If the product's quality is less than the optimal investment threshold $[eqn]$ , the best strategy is to remain in the discovery phase, wait and observe the evolution of (qt)t≥0, and pay the sunk cost as soon as $[eqn]$ is reached. If the product's quality is larger than or equal to $[eqn]$ , but less than the pre-specified quality $[eqn]$ , the best strategy is to invest and commence the R&D process immediately. Finally, if the quality meets or surpasses $[eqn]$ , there is no point in conducting basic research in the discovery phase and starting the R&D process; the best strategy of the pharmaceutical company is to promptly start the commercialization process by paying the operating cost C per period.

Proposition 1 provides the closed-form solution. The optimal investment threshold $[eqn]$ is increasing in the health authority's standard $[eqn]$ , and the sunk cost I, and decreasing in the net revenue flow (R−C). The economic intuition is clear: a higher mandated standard increases the expected time and cost to reach commercialization after investment, raising the value of waiting during the discovery phase. This induces the firm to delay investment (higher $[eqn]$ ) until a higher-quality starting point is reached. Conversely, a more generous remuneration R increases the payoff, encouraging earlier investment (lower $[eqn]$ ), all else equal.

Note that so far we have implicitly assumed $[eqn]$ . The following lemma, which is proved in Appendix B, shows that this is indeed the case.

** Lemma 1**. It holds that $[eqn]$ .

To ensure that proposition 1 makes economic sense, it is assumed that $[eqn]$ , therefore, $[eqn]$ . This condition will need to be satisfied so that the pharmaceutical has an incentive to start the project in the first place. Without meeting this condition, the firm will refrain from launching the R&D process, opting instead to prolong the discovery phase until the quality threshold is reached.

The health authority's standard-setting problem

2.4

In the previous subsection, we have computed the optimal investment threshold of the pharmaceutical firm, conditional on the quality threshold $[eqn]$ being given. For every potential quality standard established by the $[eqn]$ , there exists a unique corresponding optimal investment threshold for the $[eqn]$ . The objective of the $[eqn]$ is now to set an optimal quality standard $[eqn]$ to maximize the net present value of patients' health gain net of payments to the $[eqn]$ .

The $[eqn]$ thus solves a Stackelberg leader's problem: it anticipates the firm's reaction function $[eqn]$ from Proposition 1, and chooses $[eqn]$ to maximize its own objective, which is the discounted stream of health benefits $[eqn]$ minus payments R, starting from the (endogenously determined) commercialization time.

For analytical convenience, we assume that patients' overall health outcomes are linearly correlated to the quality of the drug and that these outcomes can be expressed in monetary terms. In particular, the flow of health benefits is assumed to be fixed and equal to K>0. Hence, the problem of the health authority is given by

[eqn]

where

[eqn]

By using the same approach in the last subsection, we can compute the expected discount factor,

[eqn]

Similar to the problem of the firm, the issue facing the health authority can be reformulated as

[eqn]

The subsequent proposition can be readily obtained.

Proposition 2. The value function of the health authority is

[eqn]

The value of the health authority's project depends on the current quality of the product. If the current quality $[eqn]$ , the pharmaceutical company will stay in the discovery phase and wait until the optimal investment threshold is reached before proceeding with investment. After investment, the R&D process continues till the optimal quality standard is achieved, at which point the $[eqn]$ realizes its payoff. If the current quality $[eqn]$ , the pharmaceutical company will start the R&D process immediately, and the payoff is realized when the optimal quality standard is met. Finally, if $[eqn]$ , indicating the quality of the product has exceeded the optimal quality standard, the $[eqn]$ can promote product sales immediately.

Proposition 3. The optimal quality standard that maximizes the health authority's value is $[eqn]$ .Consequently, the firm's optimal investment threshold is

[eqn]

The proof of this proposition can be found in Appendix C.

Proposition 3 reveals the $[eqn]$ 's trade-off. A higher standard $[eqn]$ directly increases the per-period health benefit $[eqn]$ but, as in Proposition 1, indirectly delays commercialization by raising the firm's investment threshold $[eqn]$ and thus the expected time to approval. The optimal standard balances this benefit-delay trade-off. The solution shows $[eqn]$ is proportional to the remuneration-price ratio R/K. Intuitively, if the $[eqn]$ values health highly (large K) relative to its payment R, it sets a high standard. If the payment is high relative to the health valuation, it opts for a lower standard to accelerate access.

Since the $[eqn]$ 's remuneration to the $[eqn]$ is a constant, the only decision that $[eqn]$ has to make revolves around the timing of its investment, irrespective of the timing of commercialization. In this case, once the quality standard $[eqn]$ is attained, it is optimal for the $[eqn]$ to commercialize the drug immediately. For the $[eqn]$ , the only decision lies in establishing the quality standard, designed to maximize patients' health gains net of payoffs to the firm, conditional on the reaction function of the $[eqn]$ .

The integrated social planner's problem (the benchmark)

3

In the decentralized framework, both the pharmaceutical firm and the health authority play pivotal roles, each overseeing a specific aspect of the project. The health authority, acting first, sets the commercialization approach by establishing a quality benchmark for the final product. Subsequently, based on this benchmark, the pharmaceutical firm decides on the optimal time to initiate the R&D phase, focusing on drug development. The primary objective for each party is to maximize the value of their segment of the project. This narrative shifts when introducing a social planner ( $[eqn]$ ) who consolidates control over both the R&D and commercialization facets. This unified approach necessitates a simultaneous consideration of the investment timing and the determination of an ideal quality standard. The social planner's mission diverges from the decentralized model by aiming to amplify the total value of the project, thereby harmonizing the strategies for development and market introduction under a single, overarching goal.

The optimal investment threshold and the optimal quality standard are denoted by $[eqn]$ and $[eqn]$ , respectively, in the social planner's project. The problem of the social planner is

[eqn]

The social planner's value function $[eqn]$ internalizes the entire net social benefit stream ( $[eqn]$ ), avoiding the transfer payment R. The key distinction from the decentralized case is that both control variables, $[eqn]$ and $[eqn]$ , are chosen simultaneously to maximize this unified objective.

The investment problem can be solved by treating $[eqn]$ as a constant and taking the partial derivatives of $[eqn]$ with respect to $[eqn]$ , and $[eqn]$ gives

[eqn]

The problem of determining the optimal quality standard to enhance social welfare can be solved by taking the partial derivatives of $[eqn]$ with respect to $[eqn]$ , and $[eqn]$ gives

[eqn]

Replacing $[eqn]$ in Equation 24, we have

[eqn]

In addition, to ensure Equation 24 makes economic sense, it is assumed that $[eqn]$ . Thus, the following condition must be satisfied.

[eqn]

Equations 24, 26 are the core results for the social planner. The optimal quality standard $[eqn]$ is proportional to the cost-to-benefit ratio (C/K), and crucially, is independent of the sunk cost I. This contrasts sharply with the decentralized standard $[eqn]$ (Proposition 3), which depends on the remuneration R. Here, the planner sets the standard based solely on the fundamental trade-off between the marginal cost of production (C) and the marginal social value of quality (K).

Proposition 4. The value of the project managed by the social planner $[eqn]$ is maximized if the R&D process starts at $[eqn]$ and the quality standard is set to be $[eqn]$ .

The proof of this proposition can be found in Appendix D.

It then immediately follows that

Proposition 5. the project value of the social planner is

[eqn]

The project value of the social planner is dependent on the quality of the product. If the current quality is below the optimal investment threshold, i.e., $[eqn]$ , the social planner will consider staying in the discovery phase, waiting, and collecting more information, and invest until the quality is high enough. If the current quality is higher than the optimal investment threshold but lower than the optimal quality standard, i.e., $[eqn]$ , the social planner will skip the discovery phase and start the R&D process immediately. The R&D process will be completed when the optimal quality standard is first met. Finally, if the current quality is larger than the optimal quality standard, i.e., $[eqn]$ , both the discovery phase and R&D process can be skipped. The best strategy is to proceed directly to commercialization.

Analysis of thresholds, project values, and practical implications

4

Throughout our analysis, we have identified four distinct thresholds for the two projects including (1) $[eqn]$ , the optimal investment threshold of the $[eqn]$ , (2) $[eqn]$ , the optimal quality standard set by the $[eqn]$ , (3) $[eqn]$ , the optimal investment threshold set by the $[eqn]$ , and (4) $[eqn]$ , the optimal quality standard set by the $[eqn]$ . To make the investment problems more interesting, we assume that the optimal investment thresholds are positioned below the optimal quality standards. Otherwise, achieving the quality standard during the discovery phase alone, without initiating the R&D process, would be an outcome that seldom aligns with the practical complexities and challenges encountered in real-world scenarios.

More specifically, in the decentralized project, we assume that $[eqn]$ . The corresponding condition to be satisfied is

[eqn]

In the project conducted by the $[eqn]$ , we assume that $[eqn]$ . The corresponding condition to be satisfied is

[eqn]

Next, we discuss the relationship between the two optimal investment thresholds.

Proposition 6. The investment threshold chosen by the $[eqn]$ is always strictly less than the one chosen by the $[eqn]$ , i.e., $[eqn]$ .

The proof of this proposition can be found in Appendix E.

This result underscores a fundamental inefficiency of decentralization: strategic separation induces delay. The social planner, by internalizing the full health benefit of earlier availability, finds it optimal to trigger development sooner. In practical terms, this suggests that mission-driven, publicly coordinated R&D programs (which approximate the social planner) can be expected to initiate projects earlier than those relying solely on private firms reacting to regulatory and market signals. This is clearly relevant to time-sensitive health crises, such as pandemics, where accelerating the start of R&D is paramount.

To illustrate the distinction between the two investment thresholds more clearly, we consider a numerical example with the following parameter values: μ_1_ = 0.02, μ_2_ = 0.2, σ∈[0.001, 0.9], I = 10, ρ = 0.5, C = 20, K = 100, and R = 80. In Figure 1, we demonstrate that the investment threshold determined by the social planner is consistently lower. Given the identical growth rates μ_1_ and μ_2_, as well as the volatility σ for the stochastic process (qt)t≥0 in both projects, a lower investment threshold implies earlier investment and commencement of the R&D process.

The difference of investment thresholds as uncertainty varies. The dashed line represents the investment threshold of the pharmaceutical firm, while the solid line indicates the investment threshold of the social planner.

The divergence in investment thresholds can be attributed to the differing organizational structures of the two projects. In the decentralized model, the objectives of the two parties are to maximize the value of their respective projects. From the perspective of the $[eqn]$ , for a certain quality standard, the sooner the investment takes place, the higher the payoffs. Earlier investments accelerate the development process by increasing growth rates, while simultaneously reducing the discount impact on the anticipated gains in patient health. However, from the perspective of the $[eqn]$ , earlier investments imply giving up information and the opportunity to improve product quality during the discovery phase. To capture option value in the discovery phase, the $[eqn]$ tends to wait and invest when the quality of the product is high enough.

Conversely, in the project managed by the $[eqn]$ , the objective is to maximize the project's overall value. Both the timing of investment and the quality standard are considered simultaneously. For the same quality standard, earlier investments lead to earlier realization of patients' health gains, which is consistent with the benefit of the social planner. Although earlier investments benefit the $[eqn]$ in the decentralized model, they do not necessarily benefit the $[eqn]$ . The conflict between the two players in the decentralized project leads to late investment. Hence, the investment threshold in the social planner's project is lower in the absence of conflicts.

A key distinction between the two projects is that the $[eqn]$ receives remuneration, R, from the $[eqn]$ upon successful completion of the R&D process, provided the drug meets the established quality threshold. However, in the project overseen by the $[eqn]$ , the parameter R is not a consideration. We will further examine how the level of remuneration influences the thresholds within the decentralized project and how R affects the determination of optimal quality standards across different projects.

For computational convenience, we denote M as the right hand side of the inequality (Equation 29), i.e., $[eqn]$ , and denote $[eqn]$ .

Proposition 7. Findings on remuneration R and quality standards. When $[eqn]$ , M < N. If M < R < N, the optimal quality standard of the decentralized project is less than the one managed by the social planner i.e., $[eqn]$ . Hence, $[eqn]$ .

The proof of this proposition can be found in Appendix F.

The optimal quality standard of the social planner $[eqn]$ is not affected by the variations of remuneration level since there is no remuneration in the social planner's project. However, a lower remuneration level will lead to a lower quality standard in the decentralized project $[eqn]$ (see Proposition 3). The sunk cost does not directly affect the quality standards of both projects. In the social planner's project, the decrease in sunk cost will lead to a lower required operating cost (see Equation 30), which will not affect the quality standard provided the operating cost remains sufficiently high. Similarly, the quality standard of the decentralized project $[eqn]$ is indirectly affected by sunk costs. A lower sunk cost will lead to a lower required remuneration (refer to Equation 29). However, as long as the minimum requirement of R is satisfied, i.e., if Equation 29 holds, the sunk cost I will not influence the quality standard of the decentralized project. The proposition shows that when R is low, the optimal quality standard for the decentralized project is lower than that managed by the social planner. Policy advice is that, if the budget for remuneration is limited while the project's sunk cost is relatively low, a social planner's project is preferred. This approach not only ensures an earlier start and, consequently, quicker completion and commercialization of the product, but also guarantees a higher quality standard.

Proposition 8. Findings on Remuneration R and Quality Standards. When $[eqn]$ , M<N and when $[eqn]$ , M≥N. If R≥max{M, N}, the optimal quality standard of the project managed by the social planner is less or equal to that of the decentralized project, i.e., $[eqn]$ . Moreover, there is a unique remuneration level Rc (Rcc) such that the optimal investment threshold of the decentralized project equals the optimal quality standard of the social planner's project, i.e., $[eqn]$ . Thus if N(M) ≤ R<Rc(Rcc), $[eqn]$ . If R≥Rc(Rcc), $[eqn]$ .

The proof of this proposition can be found in Appendix G.

Increasing the remuneration level can reverse the result in Proposition 7. When R≥max{M, N}, the quality standard of the decentralized project $[eqn]$ increases while the quality standard of the social planner's project $[eqn]$ is not affected. Moreover, as R increases, the investment threshold of the decentralized project $[eqn]$ first decreases and then increases. When N(M) ≤ R<Rc(Rcc), the investment threshold of the decentralized project is less than the quality standard of the social planner's project, i.e., $[eqn]$ . If R further increases, when R≥Rc(Rcc), the investment threshold of the decentralized project will be larger or equal to the quality standard of the social planner's project, i.e., $[eqn]$ .

With sufficient funding, the remuneration level R can be used as a tool to adjust the product's quality standard in the decentralized project, which is not possible in the social planner's project. However, this comes at the cost of delaying commercialization later, since $[eqn]$ also increases with R. In an extreme case, for instance, when R takes values in R≥Rc(Rcc), it can be the case that the final product in the social planner's project has been finished while the R&D process has not even started yet in the decentralized project. Moreover, it can be proved that Rcc ≤ Rc. In other words, the higher the sunk cost I, the less remuneration R is required to make $[eqn]$ approach $[eqn]$ from below. Mathematically, this is because $[eqn]$ increases with I while $[eqn]$ is not affected if the operating cost C reaches the minimum value, i.e., $[eqn]$ .

In summary, the central policy insight from Propositions 7, 8 is the explicit trade-off between quality ambition and development speed that a payer/regulator faces in a decentralized system. This directly models a key dilemma in designing pull incentives like APCs: setting a high reward to attract investment toward a high-quality target may perversely slow down the very innovation it seeks to spur, if the associated standard is set too high. This mirrors ongoing debates about balancing the encouragement of innovation with rapid access to new technologies.

In Figure 2, we analyze how the investment threshold of the decentralized project varies with uncertainty, and how the spread of the maximum and minimum value of the threshold changes under different sunk costs and levels of remuneration.

The investment threshold of the decentralized project varies with uncertainty, and the range between the maximum and minimum investment threshold values shifts under different sunk costs I and remuneration level R. The parameter values are: μ1 = 0.02, μ2 = 0.2, σ = [0.001, 0.9], I = 0.5 or 1, ρ = 0.5, C = 5, K = 10, and R=10, 30 or 60. In the figures above, D represents the difference between the maximum and minimum investment threshold. (A) I = 0.5, R = 10. (B) I = 0.5, R = 30. (C) I = 0.5, R = 60. (D) I = 1, R = 10. (E) I = 1, R = 30. (F) I = 1, R = 60.

The following proposition is easily obtained.

Proposition 9. The spread between the maximum and minimum values of the investment threshold narrows as the remuneration level R rises, under varying degrees of uncertainty. Conversely, the spread widens as the sunk cost I increases.

The investment threshold of the decentralized project increases with uncertainty. Moreover, as the remuneration level R increases, the spread between the maximum and minimum values of the investment threshold decreases. The effect of R on the investment threshold $[eqn]$ can be separated into two parts. From Proposition 3, we know that $[eqn]$ which can be considered as a product of two terms. First, an increase in R will lead to a higher quality standard since $[eqn]$ . This is quite intuitive since a product with higher quality often requires more inputs of the available resources, such as time, money, and effort. To increase incentives to consume more resources in the research and development of a product, higher remuneration will be needed. The impact of the remuneration level R on the optimal quality standard $[eqn]$ is termed the “quality effect.” This effect typically leads to a postponement of the investment decision. On the other hand, an elevation in R signals higher anticipated future revenues, prompting the decision maker to prefer accessing these revenues at an earlier stage. The impact of R on the project's Net Present Value (NPV) is referred to as the “NPV effect,” which serves as an incentive to advance the investment timing. This dynamic is encapsulated within the term $[eqn]$ .

Figure 2 shows that when uncertainty is low, the investment threshold goes up faster when R increases. Intuitively, as R increases, the quality standard increases. If uncertainty is low, the chances of large upward jumps are lower. In this case, a better strategy is to wait in the discovery phase and invest after the upward jumps are realized, rather than investing earlier and hoping that the upward jumps will occur during the R&D process, which would lead to a sooner finish of the project. Earlier investment means giving up the upward jumps for free, especially when these jumps are more valuable when uncertainty is low, since they happen less often. Since the upward jumps are more valuable when uncertainty is low, it is more important to capture them by investing later when the current quality is higher, which is represented by a higher optimal investment threshold. In other words, the “quality effect” is larger when R goes up with lower uncertainty. On the other hand, although the “NPV effect” tends to encourage earlier investment, it is limited by the “quality effect.” Sooner investment reduces the project's value when uncertainty is low, since valuable upward jumps are forgone. The stronger “quality effect” when uncertainty is low causes the optimal investment threshold to increase more rapidly.

When uncertainty is high, the “quality effect” is weaker since even if earlier investment means giving up the upward jumps in the discovery phase for free, it is more likely that these jumps will happen in the R&D process. Hence, it is less important to ensure that the benefits of upward jumps are captured before investment. And these jumps are less valuable because they occur more often. Moreover, “NPV effect” is stronger since an earlier investment is a possibility. The combination of the two effects leads to a slower increase in the threshold as R increases with higher uncertainty.

Since the “quality effect,” which leads to the increase of the investment threshold, is much stronger when uncertainty is low, while the “NPV effect,” which reduces the investment threshold, becomes stronger when uncertainty is higher, the spread of the maximum and minimum value of the investment threshold decreases as the remuneration level R increases. Moreover, the spread is also related to the value of sunk cost. The higher the sunk cost, the higher the spread. This is because the “NPV effect” is even weaker when I is large, while the “quality effect” is not affected. The combination of the two effects increases the spread.

To summarize, the finding of Proposition 9 is that the investment threshold's sensitivity to uncertainty varies with the level of remuneration R, offering nuanced guidance for risk-sharing in innovation contracts. In high-uncertainty projects (e.g., novel therapeutic platforms), a high R is more effective at accelerating investment because the “NPV effect” dominates. This supports the use of substantial milestone payments or guaranteed volumes in areas like antimicrobial or cancer vaccine development, where technical risk is high. Conversely, for lower-uncertainty projects, financial rewards are more likely to be channeled into pursuing higher quality, warranting closer payer oversight of standard-setting.

Next, we compare the value of the decentralized project to that of the social planner's project.

Proposition 10. The value of the project managed by the social planner exceeds than that of the decentralized one.

The proof of this proposition can be found in Appendix H.

Two examples are provided to show the difference in the project values. Figure 3 shows the different project values when the initial value of the stochastic process q is smaller than the investment thresholds of both projects, i.e., $[eqn]$ . The parameter values are: q = 0.3, μ_1_ = 0.02, μ_2_ = 0.2, σ = [0.001, 0.9], I = 0.5, ρ = 0.5, C = 5, K = 10, and R = 7. In this case, the best strategy for both decision makers is to wait until each investment threshold is reached before paying the sunk cost. It is shown that the value of the social planner's project is strictly larger than that of the decentralized one.

The difference of project values as uncertainty varies, when q<qII<qcI. The dashed line represents the total value of PF and HA in the decentralized project, while the solid line shows the project's value managed by SP.

Intuitively, although optimal decisions are made in both projects, the maximization problem that $[eqn]$ solves is an unconstrained one, while $[eqn]$ solves a constrained maximization problem, in which the pharmaceutical company moves next for a given quality standard set by the government in advance. In this case, the option value of the pharmaceutical company's project cannot be fully captured, resulting in a lower total value for the decentralized project. Furthermore, as uncertainty σ increases, the value of the decentralized project increases more slowly, indicating that part of the option value is not captured, since the decision of the pharmaceutical company is constrained by the action of the health authority.

Figure 4 shows the different project values when the start value of the stochastic process q is larger than the investment thresholds of both projects, i.e., $[eqn]$ . The parameter values are: q = 2, μ_1_ = 0.02, μ_2_ = 0.2, σ = [0.001, 0.9], I = 1, ρ = 0.5, C = 5, K = 10, R = 20. In this case, both projects will start immediately without further waiting, and it is shown that the social planner's project value is strictly larger than that of the decentralized one.

The difference of project values as uncertainty varies, when qII<qcI<q. The dashed line represents the total value of PF and HA in the decentralized project, while the solid line shows the project's value managed by SP.

In this case, the decision made by the government is also affected by the response of the pharmaceutical company in the decentralized project. In other words, the health authority, although it moves first by setting a quality standard, also solves a constrained maximization problem, restricted by what the pharmaceutical company will react to, given the set quality standard. Thus, the optimal investment threshold $[eqn]$ does not maximize the unconstrained value function x↦NPV(x) (see Appendix H). Hence, the total value of the decentralized project is less than the social planner's project. In addition, as uncertainty goes up, both project values decrease. This is because both projects have no flexibility but to invest immediately. Without the option to wait, the bad outcomes cannot be avoided, and increasing uncertainty reduces the value of both projects.

This fundamental welfare result quantifies the cost of fragmented decision-making. The combined value of the $[eqn]$ and $[eqn]$ is less than the value a coordinated planner could achieve. In policy terms, this efficiency loss justifies exploring hybrid models that better align incentives, such as: (1) Public-private partnerships (PPPs) with integrated governance, which move closer to the social planner benchmark; (2) Regulatory-design partnerships where developers and regulators collaborate early on defining feasible yet meaningful development endpoints (akin to co-setting $[eqn]$ ); and (3) Value-based agreements that more tightly link the level of payment R to the achieved health outcomes, partially internalizing the planner's objective.

Conclusions and policy implications

5

This study provides a formal analysis of how a health authority's quality standard affects a pharmaceutical firm's R&D investment timing, comparing decentralized decision-making with an integrated social planner benchmark. Our findings, grounded in real options theory, offer specific insights into designing innovation incentives in healthcare. The main conclusions and their policy implications are summarized as follows:

First, integrated coordination accelerates development but requires new governance models. Thus, for public health priorities where time is critical (e.g., pandemic preparedness, outbreaks of neglected diseases), there is a strong efficiency case for coordinated, public-sector-led or heavily orchestrated R&D programs that approximate the role of a social planner. This supports the creation of entities with the mandate and resources to manage both technical and incentive alignment, moving beyond simple funding contracts.

Second, remuneration and quality standards are dual policy levers with an inherent trade-off. Health authorities that introduce pull incentives, such as APCs or price-based pricing, are expected to address these trade-offs appropriately. Contracts should target clear minimum viable efficacy levels and strong financial incentives to ensure quick launch. For therapeutic area development where high quality is desired, larger rewards can be linked to correlated levels of efficacy.

Third, project characteristics should inform the choice of incentive framework. For low-budget, high-urgency applications, integrated models or tightly constrained PPPs can be adopted. For large-scale efforts to achieve breakthrough quality, decentralized models with carefully-tuned R and $[eqn]$ can be effective. High technical uncertainty calls for incentive structures that greatly reward success (high R) to overcome risk aversion, possibly based on success-dependent royalty rates or tiered pricing.

Finally, the measured efficiency loss justifies investing in alignment mechanisms. The welfare superiority of the social planner's outcome quantifies the efficiency loss from separated optimization. This loss represents the potential value to be gained from improved alignment mechanisms. Investments should be made in progressive regulatory pathways (e.g., breakthrough therapy designations, rolling reviews), early scientific advice to align on standards, and outcome-based payment models that share risk and better link rewards to societal value. Recent moves toward regulatory agility and managed-entry agreements in various health systems are practical steps in this direction.

In conclusion, our model reframes the challenge of stimulating pharmaceutical R&D as a problem of optimally coordinating two key decisions: how good and when to start. There is no general solution because the choice between decentralized and integrated strategies depends first on policy goals, resources, and disease context. By modeling the tradeoff between quality, financial rewards, and timing, we provide insights into the design of more effective, context-aware innovation policies.

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Buckup S. Global public-private partnerships against neglected diseases: building governance structures for effective outcomes. Health Econ Policy Law. (2008) 3:31–50. doi: 10.1017/S 174413310700439218634631 · doi ↗ · pubmed ↗
2Berndt ER Hurvitz JA. Vaccine advance-purchase agreements for low-income countries: practical issues. Health Aff . (2005) 24:653–65. doi: 10.1377/hlthaff.24.3.65315886157 · doi ↗ · pubmed ↗
3Turner M. Vaccine procurement during an influenza pandemic and the role of advance purchase agreements: lessons from 2009-H 1N 1. Glob Public Health. (2016) 11:322–35. doi: 10.1080/17441692.2015.104374326209064 · doi ↗ · pubmed ↗
4Thornton I Wilson P Gandhi G. “No Regrets”' Purchasing in a pandemic: making the most of advance purchase agreements. Global Health. (2022) 18:62. doi: 10.1186/s 12992-022-00851-335715814 PMC 9204686 · doi ↗ · pubmed ↗
5Farlow A. An Analysis of the Problems of R&D Finance for Vaccines and an Appraisal of Advance Purchase Commitments. Oxford: University of Oxford. (2004).
6Levine R Kremer M Albright A. Making markets for vaccines. Report of the Center for Global Development Washington DC. (2005).
7Gu H Jie Y Lao X. Health service disparity, push-pull effect, and elderly migration in ageing China. Habitat Int. (2022) 125:102581. doi: 10.1016/j.habitatint.2022.102581 · doi ↗
8Matthey MA Hollis A. Pull me-push you? The disparate financing mechanisms of drug research in global health. Global Health. (2024) 20:14. doi: 10.1186/s 12992-024-01019-x 38374045 PMC 10877918 · doi ↗ · pubmed ↗