Phase transition in the Bayesian estimation of the default portfolio
Masato Hisakado, Shintaro Mori

TL;DR
This paper explores a phase transition phenomenon in Bayesian default probability estimation, showing how temporal correlation decay affects convergence and providing conditions for reliable estimation with empirical data.
Contribution
It introduces a hierarchical Bayesian method for PD estimation considering temporal correlations and characterizes the phase transition related to correlation decay.
Findings
Phase transition occurs at power index = 1 for correlation decay.
PD estimator convergence depends on the power index of decay.
Empirical data suggests long memory with a power index greater than one.
Abstract
The probability of default (PD) estimation is an important process for financial institutions. The difficulty of the estimation depends on the correlations between borrowers. In this paper, we introduce a hierarchical Bayesian estimation method using the beta binomial distribution and consider a multi-year case with a temporal correlation. A phase transition occurs when the temporal correlation decays by power decay. When the power index is less than one, the PD estimator does not converge. It is difficult to estimate the PD with limited historical data. Conversely, when the power index is greater than one, the convergence is the same as that of the binomial distribution. We provide a condition for the estimation of the PD and discuss the universality class of the phase transition. We investigate the empirical default data history of rating agencies and their Fourier transformations to…
| No. | Asymptotic behavior | |||
|---|---|---|---|---|
| 1 | ||||
| 2 | ||||
| 3 | ||||
| 4 |
| Exponential decay | Power decay | ||||||
|---|---|---|---|---|---|---|---|
| No. | Model | ||||||
| 1 | Moody’s 1920-2017 | 0.96% | 1.9% | 0.044 | 0.95% | 2.0% | 4.7 |
| 2 | Moody’s 1920-2017 SG | 2.37% | 3.9% | 0.044 | 2.35% | 4.1% | 4.7 |
| 3 | Moody’s 1981- 2017 | 1.49% | 0.7% | 0.023 | 1.46% | 0.7% | 5.9 |
| 4 | Moody’s 1990-2017 | 1.65% | 0.7% | 0.006 | 1.70% | 0.8% | 7.0 |
| 5 | Moody’s 1981-2017 SG | 4.25% | 1.8% | 0.020 | 4.29% | 1.8% | 6.0 |
| 6 | S&P 1981-2017 | 1.54% | 0.8% | 0.024 | 1.54% | 0.8% | 5.7 |
| 7 | S&P 1990-2017 | 1.72% | 0.8% | 0.006 | 1.72% | 0.8% | 7.5 |
| 8 | S&P 1990-2017 SG | 4.21% | 2.0% | 0.024 | 4.17% | 1.9% | 5.7 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCredit Risk and Financial Regulations · Monetary Policy and Economic Impact · Financial Risk and Volatility Modeling
Phase transition in the Bayesian estimation
of the default portfolio
Masato Hisakado
- Nomura Holdings, Inc., Otemachi 2-2-2, Chiyoda-ku, Tokyo 100-8130, Japan
Shintaro Mori
†Department of Mathematics and Physics, Graduate School of Science and Technology, Hirosaki University
Bunkyo-cho 3, Hirosaki, Aomori 036-8561, Japan
Abstract
The probability of default (PD) estimation is an important process for financial institutions. The difficulty of the estimation depends on the correlations between borrowers. In this paper, we introduce a hierarchical Bayesian estimation method using the beta binomial distribution and consider a multi-year case with a temporal correlation. A phase transition occurs when the temporal correlation decays by power decay. When the power index is less than one, the PD estimator does not converge. It is difficult to estimate the PD with limited historical data. Conversely, when the power index is greater than one, the convergence is the same as that of the binomial distribution. We provide a condition for the estimation of the PD and discuss the universality class of the phase transition. We investigate the empirical default data history of rating agencies and their Fourier transformations to confirm the the form of the correlation decay. The power spectrum of the decay history seems to be 1/f, which corresponds to a long memory. But the estimated power index is much greater than one. If we collect adequate historical data, the parameters can be estimated correctly.
pacs:
02.50.Ga, 05.70.Fh, 89.65.Gh, 87.23.Kg
I Introduction
Anomalous diffusion is an emerging subject in many fields Bro ; W2 ; G ; M . The models describing such phenomena depend on long memory. These are related to the phase transition, which has received considerable interest in sociophysics galam ; galam2 and econophysics Man . In previous papers, we investigated voting models that were similar to the Keynesian beauty contest Mori ; Hisakado2 ; Hisakado3 ; Hisakado4 ; Hisakado5 . This model has two kinds of phase transitions. One is the information cascade transition, which is similar to the phase transition of the Ising model Hisakado3 . The other is the convergence transition of super-normal diffusion Hod ; Hisakado2 .
Estimations of the probability of default (PD) and default correlation have been obtained from empirical studies on the historical data from credit events. These two parameters are important for pricing financial products such as synthetic CDOs M2010 ; M2008 ; Sch . Also called “long run PDs”, these parameters are important to financial institutions for portfolio management. If the number of defaults is minimal, it is not easy to estimate these parameters Tas ; FSA .
In this paper, we introduce a Bayesian estimation method using the beta-binomial distribution Hisakado ; Wit . For the usual cases the Merton model, which incorporates the default correlation by the correlation of the asset price movements (asset correlation), is used to estimate the PD and default correlation Mer . Monte Carlo simulations are necessary to estimate these parameters, except in the limit of large homogeneous portfolios, where the Merton model is used Sch . In the beta-binomial case, default correlation, instead of asset correlation, is used Hisakado . Moreover, we consider a multi-year case with temporal correlation, which refers to a time-dependent correlationTas ; FSA .
A phase transition occurs when the temporal correlation decays by power-law. A power-law decay implies that the PD has a long memory compared to that of exponential decay Man . When the power index is less than one, the estimator distribution of the PD does not converge to the delta function. Alternatively, when the power index is greater than one, the convergence is the same as that of the normal case. When the distribution does not converge, it is difficult to estimate the PD with limited data. The required condition for estimating the PD is clarified. The critical exponents for the power-law decay of the correlation function depend on microscopic features of the model. The universality class of the phase transition is different from those of the nonlinear Pólya urns Mori5 ; Mori6 .
To confirm the decay form of the temporal correlation, we investigate the empirical default data history using Fourier transformations. We determine whether the power spectrum of the default history follows 1/f Kes ; Man . When this condition is satisfied, it corresponds to the correlation of the PD with long memory where a phase transition of the convergence exists. However, it is difficult to accurately confirm a 1/f power spectrum when the estimation of the power index is much greater than one. It follows that when there is adequate historical data, parameters such as PD, default correlation, and temporal correlation can be estimated correctly.
The remainder of this paper is organized as follows. In section 2, we introduce a hierarchical Bayesian estimation method using the beta-binomial distribution. In section 3, we consider the convergence of the PD estimator. In section 4, we study the phase transition of the Pólya urn with a discount factor using an analytic method and a finite-size scaling analysis. In section 5, we apply the Bayesian estimation to the empirical data of default history. Finally, the conclusions are presented in section 6.
II Bayesian Estimation using Beta-binomial distribution
We denote the PD estimation as and default correlation as , where and . The distribution of and is . The number of obligors in the portfolio is . and are estimated using a Bayesian estimation. We consider the Bernoulli random variables that take the values or [math] . When the obligor, , is the default (non-default), . We define and consider a default correlation for , and not an asset correlation.
When the number of defaults is , the Bayes formula for the posterior distribution is
[TABLE]
where is the prior distribution.
We use the beta-binomial distribution for . The posterior distribution is given by
[TABLE]
where, and . Hence, we obtain the relations and . Here, we use the beta function .
We consider the maximum a posteriori (MAP) estimation of Eq. (2). When the prior function is a constant function, the maximum point is
[TABLE]
where is the digamma function. The summation from to is a monotonously decreasing function of because increases, while the second summation in Eq. 3 is a monotonously increasing function about because decreases. When , the difference of the two summations is positive. Conversely, when , the difference of the two summations becomes negative. Hence, the function has one peak in the range . The multi-term case is provided in Appendix A.
Next, we consider the variable . The maximum point is
[TABLE]
where .
All the summations in the last term with parenthesis are monotonously decreasing functions about . When , the last term becomes positive. Conversely, when , the last term becomes 0. When or is adequately close to this condition, the last term becomes positive. and become . In this case, the last term increases monotonously, and the peak is and . This implies that the optimization of is zero for the single term model. When is not adequately close to , the last term changes from positive to negative as increases. Therefore, one peak occurs in .
We extend this method to the multi-year case. There are obligors in year and defaults occur. The prior distribution for the second year is the posterior distribution, which is calculated from the first year’s data. In this way, the posterior distribution is updated every year. We write the posterior distribution as
[TABLE]
It is natural to assume that the number of defaults of the current year is affected by the number of defaults in previous years, thus the defaults have a temporal correlation. When the default rate is high (low), it is reasonable to assume that the default rate will be high (low) in the next year. This is similar to volatility clustering, which has a long memory AR ; GA as well. We confirm this using empirical data in the following sections.
We introduce the temporal correlation by adjusting and , and consider the th year. The number of obligors and defaults in the th year are and . In the same year, the correlation is . We set the temporal correlation parameters between the and th years; and . and are adjusted to and Mori18 . This implies that the previous years’ data affects the present defaults. It is easy to confirm that indicates that all the data is correlated to . When , the data is independent each year.
III Correlation Decay
In the previous section, was introduced to represent the temporal correlation. In this section, to clarify the behavior of the parameter , where and , the variance of the stochastic process is considered. In each year the diffusion has steps and defaults, where .
The adjustments related to parameters and are the effects of the temporal correlation from the previous conclusions. We shrink the previous years’ conclusions and add them to the initial parameters for the adjustment process. The shrinking ratio for the interval is
The two term model is examined first. We consider the relation between the first and second years. and are the number of obligors and defaults, respectively, in the first year. The second year’s parameters become and . We consider the shrinking processes from to and to . The variance of the second term of process is , where ; that is, we approximate where is the beta-binomial distribution with parameters and . We approximate this variance by , and the difference becomes . Hence, the approximation is exact when or . However, if or , this approximation can be used. In other cases, the real variance is larger than the approximation. We use this approximation to study the meaning of this process.
For the defaults of the obligors, the hypothesis or and is given. In other words, the temporal correlation is either a high or low case, or a low correlation case. Hereafter, we use this approximation to calculate the variance of this process.
We extend the stochastic process to the multi-year case. Let be an independent and identically distributed (i.i.d.) sequence that is uniformly distributed on [0,1]. The discrete dynamics of the process is described by:
[TABLE]
when . Here is given by
[TABLE]
The expectation value of is When , the process is beta-binomial.
We consider the relationship between the year and . The distribution of year is a beta-binomial distribution. Hence, the conditional variance, , of the year can be evaluated, using the above approximation, as
[TABLE]
Therefore, the difference of the summations and correspond to the the variance of step. Therefore, using this approximation, the correlation between the and years is approximated by . The term plays the role of a discount factor in the correlation . It can be seen that as time progresses, the correlation is discounted. It is reasonable to assume a monotonically decreasing function for because the effects decrease as the distance between and increase.
The total variance for the diffusion is approximated by
[TABLE]
The first, second, and third terms correspond to the variance for binomial distribution, constant correlation in the portfolio, and temporal correlation, respectively.
In summary, when or , the correlation between year and year is approximated by
[TABLE]
The average PD, correlation of the Bernoulli random variables, and temporal correlation using this approximation are , , and , respectively.
In the Bayesian estimation, if the scaled variance converges as the data increases, these parameters can be estimated correctly. Conversely, if the variance does not converge, the parameters cannot be estimated. It should also be considered whether the process has a stationary solution, which will be discussed regarding the spectrum analysis in the following sections.
It is difficult to estimate all the values due to limited data. By introducing a prior distribution for , the estimation becomes a hierarchical Bayesian estimation. It is reasonable to assume that the prior distribution is a monotonically decreasing function. Therefore, we considered two hyperprior distributions, an exponential and power decay, to have long memory.
IV Phase transition in the estimation of PD
In this section we determine whether the PD in the Bayesian estimation converges. To simplify the model, we set in Eq. (7). This does not affect the outcome of the PD estimation. Let be an independent and identically distributed (i.i.d.) sequence that is uniformly distributed on [0,1]. The discrete dynamics of the process is described by:
[TABLE]
Here, is the weighted sum of with the discount factor ,
[TABLE]
This is the Pólya urn modelPolya with a discount factor .
The expectation value of is The PD estimator is ,
[TABLE]
The success of the PD estimation depends on the the behavior of the variance of . More specifically, if the variance of converges, then the PD can be estimated.
IV.1 Stochastic differential equation
First, the stochastic process is rewritten using ;
[TABLE]
where and are the process probabilities. The sum of and is .
For convenience, we define a new variable such that
[TABLE]
We change the variables from to and to . Given , we obtain a random walk model:
[TABLE]
We now consider the continuous limit ,
[TABLE]
where and . On approaching the continuous limit, we obtain the following stochastic partial differential equation:
[TABLE]
where is the continuous function of , the discount factor, and .
We are interested in the behavior of in the limit . We assume that the stationary solution is
[TABLE]
where is a constant. Substituting Eq. (15)into Eq. (14), we obtain
[TABLE]
where .
Eq. (16) is a self-consistent equation. When , Eq. (16) is solved when . The process converges to the average point. On the other hand, when , we can obtain the identity equation , suggesting that the process does not converge to the delta function. The expected value of is . Hence, the phase transition at the point diverges to infinity. When the distribution does not converge, we cannot estimate the parameters correctly, even if the amount of data increases. This is a critical issue when using the Bayesian estimation. In other words, is a compulsory condition for parameter estimation.
IV.2 Correlation function and finite size scaling analysis
To understand the phase transition, we investigated the correlation function, . is defined as the correlation between and such that
[TABLE]
The function represents the propagation of the memory of to later variables . To understand the relationship between the variances of and , the variance of can be written as
[TABLE]
where is the conditional variance of on . The expectation value of is and the probability function is . The second term on the right-hand side of Eq. (18) represents the variance of from the dependence on . In Eq. (18), the second term is related to as it originates from the dependence of on . We write the second term of as
[TABLE]
If , and does not converge.
Using , we obtain the next relation for the conditional expectation value of with the condition , , as
[TABLE]
As , we obtain the following recursive relation for as
[TABLE]
This recursive relation contains all information regarding the asymptotic behavior of . If one assumes a functional form for with the initial condition , we can estimate for .
IV.2.1 Exponential decay case
We consider the exponential decay case, . is finite and there is no phase transition. We decompose the numerator of Eq. (19) as . We rewrite the second term using Eq. (19) for as
[TABLE]
We then obtain the next recursive relation for :
[TABLE]
As we are interested in the asymptotic behavior of , we estimate the decay rate, , with , which gives
[TABLE]
where for , and decays exponentially.
Numerical studies of the system were performed. To estimate , the recursive relation of Eq. (19) is solved for . A Monte Carlo sampling procedure is adopted for the variance of . We obtained sample sequences for and estimated the variance of . Figure 1 (a) shows the plot of vs. . It is clearly shown that decays exponentially. Figure 1 (b) shows the plot of V vs. . For all , decays as . When , the distribution converges to the beta distribution. Hence, there is no phase transition for .
IV.2.2 Power-law decay case
For the case of power-law decay, namely , when and , the process converges to the delta function. On the other hand, when and goes to infinity, the process does not converge.
The behaviors of and were investigated by the numerical method, in the same manner as the exponential decay case. Fig. 2 (a) shows the double logarithmic plot of vs. . It can be seen that decays with a power-law form for . For small , such as , the slope is extremely small. Fig. 2 (b) shows the double logarithmic plot of vs. . For , decays as . At , the slope of the decay is less than one. For , the curve is concave down. These results suggest the validity of the self-consistent equation analysis.
To investigate the phase transition, we apply finite-size scaling (FSS) analysis Mori5 . We define the relaxation and second-moment correlation times, and , respectively, using the moment of as
[TABLE]
For FSS, we assume that the scaling function, , for some observable, , with a scale factor, , is expressed as a function of such that
[TABLE]
We assume the following asymptotic forms for ;
[TABLE]
Here, is the order parameter of the phase transition and is a constant. Using the asymptotic forms, we can classify the behavior of the scaling functions. We show the results for and in Table I. (In detail, see Appendix B)
Figure 3 shows the numerical estimations of and vs. with . The symbols show the fixed points under the renormalization transformation . There are two stable fixed points at and , and one unstable fixed point at . If , then and moves to under the transformation and . diverges linearly with the system size, , at the fixed point, which reflects the memory of that retains. If , and moves to [math]. and the memory of is lost for sufficiently large . At the stable fixed points of and at , becomes 2 and 1, respectively. From the unstable fixed point at , we can estimate using . This estimation is in accordance with the estimation from . These results support the phase transition between the two phases, and , in the limit . At the critical point , and with .
We estimate by and by with and . By comparing the values for and , one can anticipate the limit behavior . The results are shown in Fig. 4. Figure 4 (a) shows vs. . For , is almost zero. For , is positive. The derivative of at is seemingly continuous. Figure 4 (b) shows vs with and . For , one can anticipate that by observing the change from to . For , which suggests that . At the critical point , depends on .
Next, we investigated at the critical point . We assume that . Eq. (19) can be approximated in the continuous limit as
[TABLE]
By the following change of variables, , we obtain
[TABLE]
We see that depends on and through the combination . In the limit , when and [math], and , respectively. The critical exponent is in the range . Figure 4 (c) shows vs. for . We adopt two cases and . The symbols show the results of the numerical estimation, and the solid line shows the results by numerically solving Eq. (24). The results for and collapses onto the same curve vs. , which confirms that depends on and through .
V Is the temporal correlation decay exponential or power?
In this section, we use three data sets from the default data. Two sets are rating agency data, and the other is from a Japanese company.
V.1 Standard & Poor’s data
As discussed in the previous section, temporal correlation is a critical issue for determining whether there is an exponential or a power decay. This affects whether the parameters are estimated correctly. In this section we investigate the temporal correlation using empirical data. First, the S&P default data from 1981 to 2017 Data1 are used. The average PD is 1.58 for all ratings and 3.09 for speculative ratings. A speculative grade rating represents the rating under BBB-(Baa3). In Fig. 5 (a) we show the historical default rate. The solid and dotted lines correspond to all the samples and the speculative grade, respectively, below BBB+(Baa3).
The autocorrelation is shown in Fig. 6 (a). The x-axis represents the year. The exponential decay and cyclical increase are confirmed. This represents the cyclical bubbles and their collapse in recent years. However, it is difficult to confirm whether the decay is exponential or power-law from the autocorrelation data alone. Therefore, a Fourier transformation was applied to the PD data in Fig. 7 (a), but it was still difficult to obtain confirmation because the data was annual, and its size was not very large.
V.2 Moody’s data
Next, we used Moody’s default data from 1920 to 2017 for 98 years Data2 . It includes the Great Depression in 1929 and Great Recession in 2008. It is one of the longest sets of default data Hec . The average default rate is 1.56 for all the ratings and for the speculative ratings. In Fig. 5 (b), we show the historical default rate.
The autocorrelation is shown in Fig. 6 (b). The x-axis represents the year. The exponential decay is confirmed for a short time. Over the long historical data, we cannot confirm the cyclical trend that was observed in recent years. We applied a Fourier transformation to the default ratio data in Fig. 7 (b). as it is difficult to confirm whether the decay is exponential or power-law from the autocorrelation alone.
V.3 Risk Data Bank data
Next, we apply our data to the risk data bank (RDB) data Data3 . The data covers all of the enterprise data without individual owner-managers in Japan. The data is monthly from 2001 to 2017 and the seasonal effects were adjusted. The historical data and autocorrelation are shown in Fig. 8 (a), which is different from the previous two samples. The slow decay of the correlation was confirmed. In Fig. 8 (b) 1/f fluctuations were confirmed. This corresponds to the power decay of the Wiener-Khinchin theorem, which shows the relationship between the autocorrelation and power spectrum by a Fourier transformation. In Fig. 9 we show the power spectrum for each sector, namely, construction, wholesale, real estate, retail sales, other services, and manufacturing. The solid line represents the trend. We can conclude that the temporal correlation may contain a long memory for this data. However, it is difficult to confirm a strict power law.
VI Estimation of parameters
We estimate the long run probability of default, , and the default correlation, , for S&P and Moody’s data by the MAP estimation. We use a uniform distribution for the prior distribution . As discussed in the previous section, the exponential and power decays are used for the temporal correlation. The conclusions are listed in Table 2 for the exponential and power decay models. We confirmed a small value that represents the small temporal correlation. The parameter for the power decay is greater than the phase transition point, . The PD and default correlation are almost the same as the estimations by the exponential and power decay models. The reason is that the power index, , is adequately large and there is only a small difference between the exponential and power decay models. The first- and second-year’s temporal correlations, and , respectively, are important for representing the data.
The parameters depend on the data terms. In the recent past, the default and temporal correlations have become minimal. This may depend on the smooth financial operations of governments and central banks. Alternatively, the long history data of 100 years have long correlations that are less than the phase transitions. This depends on the old data before the 1980s. For the RDB data, we can estimate , which is in the normal convergence phase. Hence, we can estimate the PD by the Bayesian formula, which we introduced.
VII Concluding Remarks
In this paper, we introduced a hierarchical Bayesian estimation method using the beta-binomial distribution to estimate the parameters, probability of default (PD), and default correlation. Moreover, we considered a multi-year case with temporal correlation. We confirmed phase transitions when the temporal correlation decayed by a power curve, which means that the correlation had a long memory. Conversely, for the case of exponential decay, there was no phase transition. When the power index, , was above or equal to one, the estimator distribution of the PD converged. Conversely, when the power index was less than 1, the distribution did not converge. The critical exponent depended on the microscopic feature of the model and the universality class of the phase transition differed from those of the nonlinear Pólya urn. We call this phase transition a ”short memory-long memory transition”. In summary, the condition for the estimation of parameters is .
To confirm the form of the decay, we investigated the empirical default history data using a Fourier transformation. We determined that the power spectrum of the default history was seemingly 1/f, which implies that the correlation had a long memory for the RDB monthly data. We applied this method to the historical data and estimated the parameters. The region of the power index provided normal convergence. We have demonstrated that, for adequate data collection, these parameters can be estimated correctly.
Acknowledgements.
This work is supported by JPSJ KAKENHI [Grant No. 17K00347].
Appendix A MAP estimation for Multi-year case
We extend the maximum a posteriori (MAP) estimation, which we discussed in section 2 for the multi-year case. The number of obligors and defaults in the year are and , respectively. When a prior function, , is a constant function, the maximum point is
[TABLE]
where is the digamma function. and are the adjusted and . and . The first term in the last set of parentheses in Eq. (25) is a monotonously decreasing function about , because increases. The second term in the last set of parentheses is a monotonously increasing function about , because decreases. When , the difference of the two terms is positive because . In contrast, when , the difference of the two terms becomes negative because . Hence, the function, , has one peak in the range .
Appendix B Scaling functions and
We define the relaxation and second-moment correlation times, and , respectively, using the moment of as in Eq. (LABEL:moment). If we assume that , behaves as
[TABLE]
Using the asymptotic behavior of , we find behaves as
[TABLE]
behaves as
[TABLE]
The scaling function for is defined as . From the asymptotic behavior of , we have
[TABLE]
For , and the scaling function is given in terms of as
[TABLE]
and in the limit .
The scaling function for is defined as . We have
[TABLE]
By the renormalization transformation , for . For , . The critical state of the system exists at .
We assume and rapidly decays to zero. and . and holds.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) D. Brockmann, L. Hufinage, and Geisel, Nature 439 , 462 (2006).
- 2(2) I. T. Wong, M. L. Gardel, D. R. Reichman, E. R. Weeks, M. T. Valentine, A. R. Bausch, and D. A. Weitz, Phys. Rev. Lett. 92 , 178101 (2004).
- 3(3) Y. Gefen Y., A. Aharony, and S. Alexander S., Phys. Rev. Lett. 50 , 77 (1983).
- 4(4) R. Metzler and J. Klafter, Phys. Rep. 339 , 1 (2000).
- 5(5) G. Galam, Stat. Phys. 61 , 943 (1990).
- 6(6) G. Galam, Inter J. Mod. Phys. C 19(03) , 409 (2008).
- 7(7) M. Hisakado, F. Sano, and S. Mori, Phys Soc. Jpn 87 (2) , 024002 (2018).
- 8(8) N. M. Mantegna and H. E. Stanley, Introduction to Econophysics: Correlations and Complexity in Finance (Cambridge University Press, 2000).
