Variance Reduction Applied to Machine Learning for Pricing   Bermudan/American Options in High Dimension

Ludovic Gouden\`ege; Andrea Molent; Antonino Zanette

arXiv:1903.11275·q-fin.CP·December 4, 2019

Variance Reduction Applied to Machine Learning for Pricing Bermudan/American Options in High Dimension

Ludovic Gouden\`ege, Andrea Molent, Antonino Zanette

PDF

Open Access

TL;DR

This paper introduces a variance reduction technique combined with machine learning and Monte Carlo methods to efficiently price high-dimensional multi-asset American options, overcoming the curse of dimensionality.

Contribution

It develops a novel algorithm that integrates control variates with Gaussian process regression for high-dimensional American option pricing.

Findings

01

The method is fast and reliable for large baskets.

02

Variance reduction improves accuracy in high dimensions.

03

The approach outperforms traditional methods in high-dimensional settings.

Abstract

In this paper we propose an efficient method to compute the price of multi-asset American options, based on Machine Learning, Monte Carlo simulations and variance reduction technique. Specifically, the options we consider are written on a basket of assets, each of them following a Black-Scholes dynamics. In the wake of Ludkovski's approach (2018), we implement here a backward dynamic programming algorithm which considers a finite number of uniformly distributed exercise dates. On these dates, the option value is computed as the maximum between the exercise value and the continuation value, which is obtained by means of Gaussian process regression technique and Monte Carlo simulations. Such a method performs well for low dimension baskets but it is not accurate for very high dimension baskets. In order to improve the dimension range, we employ the European option price as a control…

Tables9

Table 1. Table 1: European price results for the Geometric and Arithmetic Basket Put option obtained by using the GPR-EI formula. In the last column the prices obtained by using a Monte Carlo simulation. The values in brackets are the computational times (in seconds).

		Geometric Basket Put					Arithmetic Basket Put
		GPR-EI				Bm	GPR-EI				Bm
$d$	$P$	$250$	$500$	$1000$	$8000$		$250$	$500$	$1000$	$8000$
2		$\underset{(2)}{4.10}$	$\underset{(3)}{4.11}$	$\underset{(15)}{4.13}$	$\underset{(44)}{4.17}$	$4.18$	$\underset{(2)}{3.83}$	$\underset{(1)}{3.85}$	$\underset{(12)}{3.86}$	$\underset{(41)}{3.90}$	$3.92$
5		$\underset{(2)}{2.90}$	$\underset{(1)}{2.98}$	$\underset{(3)}{3.01}$	$\underset{(24)}{3.04}$	$3.06$	$\underset{(1)}{2.49}$	$\underset{(1)}{2.57}$	$\underset{(3)}{2.60}$	$\underset{(26)}{2.63}$	$2.64$
10		$\underset{(1)}{2.48}$	$\underset{(1)}{2.45}$	$\underset{(3)}{2.52}$	$\underset{(26)}{2.59}$	$2.59$	$\underset{(1)}{2.01}$	$\underset{(1)}{2.03}$	$\underset{(3)}{2.08}$	$\underset{(25)}{2.13}$	$2.14$
20		$\underset{(1)}{2.28}$	$\underset{(1)}{2.33}$	$\underset{(4)}{2.26}$	$\underset{(31)}{2.31}$	$2.33$	$\underset{(1)}{1.81}$	$\underset{(1)}{1.84}$	$\underset{(4)}{1.80}$	$\underset{(26)}{1.85}$	$1.86$
40		$\underset{(1)}{2.12}$	$\underset{(1)}{2.18}$	$\underset{(5)}{2.21}$	$\underset{(44)}{2.17}$	$2.20$	$\underset{(1)}{1.73}$	$\underset{(1)}{1.74}$	$\underset{(4)}{1.73}$	$\underset{(37)}{1.71}$	$1.72$
100		$\underset{(1)}{2.03}$	$\underset{(1)}{2.07}$	$\underset{(7)}{2.09}$	$\underset{(43)}{2.08}$	$2.11$	$\underset{(1)}{1.93}$	$\underset{(1)}{1.63}$	$\underset{(5)}{1.67}$	$\underset{(35)}{1.62}$	$1.63$

Table 2. Table 2: American price results for a Geometric basket Put option obtained by using the GPR-MC method. In the last column the exact benchmark. The values in brackets are the computational times (in seconds).

$d$		$10^{3}$	$10^{4}$	$10^{5}$	$10^{3}$	$10^{4}$	$10^{5}$	$10^{3}$	$10^{4}$	$10^{5}$	Ekvall	Bm
	$P$	$250$			$500$			$1000$
2		$\underset{(8)}{4.54}$	$\underset{(36)}{4.57}$	$\underset{(347)}{4.58}$	$\underset{(19)}{4.57}$	$\underset{(140)}{4.59}$	$\underset{(1340)}{4.57}$	$\underset{(72)}{4.52}$	$\underset{(605)}{4.60}$	$\underset{(5121)}{4.57}$	$4.62$	$4.62$
5		$\underset{(8)}{3.55}$	$\underset{(42)}{3.43}$	$\underset{(498)}{3.44}$	$\underset{(25)}{3.43}$	$\underset{(146)}{3.45}$	$\underset{(1378)}{3.44}$	$\underset{(56)}{3.45}$	$\underset{(508)}{3.42}$	$\underset{(4761)}{3.43}$	$3.44$	$3.45$
10		$\underset{(9)}{2.99}$	$\underset{(46)}{3.03}$	$\underset{(431)}{3.02}$	$\underset{(29)}{2.96}$	$\underset{(142)}{2.98}$	$\underset{(1517)}{2.95}$	$\underset{(55)}{2.98}$	$\underset{(616)}{2.95}$	$\underset{(5281)}{2.96}$		$2.97$
20		$\underset{(10)}{2.68}$	$\underset{(85)}{2.68}$	$\underset{(463)}{2.69}$	$\underset{(30)}{2.75}$	$\underset{(210)}{2.70}$	$\underset{(1441)}{2.72}$	$\underset{(64)}{2.72}$	$\underset{(597)}{2.69}$	$\underset{(5598)}{2.70}$		$2.70$
40		$\underset{(14)}{2.71}$	$\underset{(104)}{2.58}$	$\underset{(621)}{2.58}$	$\underset{(24)}{2.60}$	$\underset{(263)}{2.61}$	$\underset{(2094)}{2.62}$	$\underset{(74)}{2.51}$	$\underset{(655)}{2.55}$	$\underset{(6373)}{2.54}$		$2.56$
100		$\underset{(26)}{2.50}$	$\underset{(110)}{2.51}$	$\underset{(1822)}{2.50}$	$\underset{(42)}{2.48}$	$\underset{(321)}{2.45}$	$\underset{(3817)}{2.45}$	$\underset{(112)}{2.43}$	$\underset{(892)}{2.45}$	$\underset{(12410)}{2.43}$		$2.47$

Table 3. Table 4: American price results for a Geometric basket Put option obtained by using the GPR-Tree and GPR-EI methods (without control variate). In the last column the exact benchmark. The values in brackets are the computational times (in seconds).

$d$	$250$	$500$	$1000$	$250$	$500$	$1000$	Ekvall	Bm
	GPR-Tree			GPR-EI
2	$\underset{(4)}{4.61}$	$\underset{(7)}{4.61}$	$\underset{(22)}{4.61}$	$\underset{(4)}{4.58}$	$\underset{(9)}{4.58}$	$\underset{(26)}{4.57}$	$4.62$	$4.62$
5	$\underset{(9)}{3.44}$	$\underset{(15)}{3.43}$	$\underset{(23)}{3.44}$	$\underset{(4)}{3.40}$	$\underset{(14)}{3.43}$	$\underset{(27)}{3.41}$	$3.44$	$3.45$
10	$\underset{(10)}{3.00}$	$\underset{(33)}{2.96}$	$\underset{(60)}{2.93}$	$\underset{(4)}{2.85}$	$\underset{(9)}{2.88}$	$\underset{(30)}{2.93}$		$2.97$
20				$\underset{(4)}{2.63}$	$\underset{(9)}{2.73}$	$\underset{(29)}{2.63}$		$2.70$
40				$\underset{(4)}{2.45}$	$\underset{(10)}{2.52}$	$\underset{(38)}{2.53}$		$2.56$
100				$\underset{(5)}{2.27}$	$\underset{(15)}{2.32}$	$\underset{(45)}{2.39}$		$2.47$

Table 4. Table 6: American price results for a Arithmetic basket Put option obtained by using the GPR-MC . In the last column the exact benchmark. The values in brackets are the computational times (in seconds).

	$P$	$250$			$500$			$1000$
$d$		$10^{3}$	$10^{4}$	$10^{5}$	$10^{3}$	$10^{4}$	$10^{5}$	$10^{3}$	$10^{4}$	$10^{5}$	Ekvall
2		$\underset{(8)}{4.34}$	$\underset{(43)}{4.37}$	$\underset{(365)}{4.38}$	$\underset{(21)}{4.37}$	$\underset{(145)}{4.39}$	$\underset{(1588)}{4.37}$	$\underset{(62)}{4.32}$	$\underset{(767)}{4.40}$	$\underset{(5183)}{4.37}$	$4.42$
5		$\underset{(9)}{3.25}$	$\underset{(40)}{3.12}$	$\underset{(380)}{3.14}$	$\underset{(20)}{3.12}$	$\underset{(149)}{3.14}$	$\underset{(1531)}{3.13}$	$\underset{(60)}{3.14}$	$\underset{(565)}{3.11}$	$\underset{(5713)}{3.11}$	$3.15$
10		$\underset{(11)}{2.64}$	$\underset{(40)}{2.69}$	$\underset{(419)}{2.67}$	$\underset{(18)}{2.65}$	$\underset{(149)}{2.66}$	$\underset{(1551)}{2.64}$	$\underset{(55)}{2.66}$	$\underset{(641)}{2.62}$	$\underset{(5149)}{2.63}$
20		$\underset{(17)}{2.27}$	$\underset{(68)}{2.27}$	$\underset{(631)}{2.28}$	$\underset{(22)}{2.39}$	$\underset{(164)}{2.35}$	$\underset{(1626)}{2.36}$	$\underset{(70)}{2.39}$	$\underset{(620)}{2.36}$	$\underset{(5817)}{2.37}$
40		$\underset{(16)}{2.21}$	$\underset{(94)}{2.11}$	$\underset{(780)}{2.10}$	$\underset{(35)}{2.17}$	$\underset{(226)}{2.19}$	$\underset{(2165)}{2.19}$	$\underset{(105)}{2.15}$	$\underset{(692)}{2.19}$	$\underset{(8739)}{2.18}$
100		$\underset{(20)}{1.94}$	$\underset{(110)}{1.95}$	$\underset{(1494)}{1.94}$	$\underset{(34)}{1.94}$	$\underset{(306)}{1.93}$	$\underset{(3452)}{1.92}$	$\underset{(91)}{1.95}$	$\underset{(884)}{1.97}$	$\underset{(9820)}{1.95}$

Table 5. Table 8: American price results for a Arithmetic basket Put option obtained by using the GPR-Tree and GPR-EI methods (without control variate). In the last column the exact benchmark. The values in brackets are the computational times (in seconds).

		GPR-Tree			GPR-EI
$d$	$P$	$250$	$500$	$1000$	$250$	$500$	$1000$	Ekvall
2		$\underset{(5)}{4.42}$	$\underset{(9)}{4.42}$	$\underset{(25)}{4.42}$	$\underset{(4)}{4.38}$	$\underset{(9)}{4.38}$	$\underset{(28)}{4.37}$	$4.42$
5		$\underset{(5)}{3.15}$	$\underset{(9)}{3.12}$	$\underset{(24)}{3.13}$	$\underset{(6)}{3.09}$	$\underset{(9)}{3.12}$	$\underset{(44)}{3.10}$	$3.15$
10		$\underset{(10)}{2.71}$	$\underset{(21)}{2.64}$	$\underset{(70)}{2.62}$	$\underset{(5)}{2.49}$	$\underset{(9)}{2.56}$	$\underset{(38)}{2.60}$
20					$\underset{(6)}{2.26}$	$\underset{(14)}{2.31}$	$\underset{(42)}{2.28}$
40					$\underset{(4)}{2.18}$	$\underset{(10)}{2.18}$	$\underset{(31)}{2.16}$
100					$\underset{(7)}{2.35}$	$\underset{(13)}{2.01}$	$\underset{(42)}{2.06}$

Table 6. Table 10: European price results for a Call on the Maximum option, obtained by using the GPR-EI formula. In the last column the prices obtained by using a Monte Carlo simulation. The values in brackets are the computational times (in seconds).

		GPR-EI				Bm
$d$	$P$	$250$	$500$	$1000$	$8000$
2		$\underset{(1)}{10.77}$	$\underset{(2)}{10.94}$	$\underset{(12)}{10.99}$	$\underset{(89)}{11.14}$	$11.19$
5		$\underset{(1)}{22.36}$	$\underset{(2)}{22.68}$	$\underset{(10)}{22.82}$	$\underset{(43)}{22.99}$	$23.04$
10		$\underset{(1)}{34.37}$	$\underset{(2)}{34.38}$	$\underset{(8)}{34.86}$	$\underset{(43)}{35.49}$	$35.59$
20		$\underset{(1)}{48.31}$	$\underset{(1)}{49.60}$	$\underset{(7)}{48.57}$	$\underset{(44)}{49.28}$	$49.45$
30		$\underset{(1)}{57.47}$	$\underset{(1)}{57.46}$	$\underset{(5)}{57.05}$	$\underset{(28)}{57.62}$	$57.68$
50		$\underset{(1)}{66.65}$	$\underset{(1)}{67.60}$	$\underset{(1)}{67.94}$	$\underset{(76)}{68.13}$	$68.03$
100		$\underset{(1)}{80.34}$	$\underset{(1)}{81.20}$	$\underset{(5)}{81.45}$	$\underset{(34)}{82.00}$	$82.14$

Table 7. Table 11: American price results for a Call on the Maximum option obtained by using the GPR-MC . In the last column the exact benchmark. The values in brackets are the computational times (in seconds). In the last column the confidence intervals reported in [ 7 ] .

$d$		$10^{3}$	$10^{4}$	$10^{5}$	$10^{3}$	$10^{4}$	$10^{5}$	$10^{3}$	$10^{4}$	$10^{5}$	$95 % c.i.$
	$P$	$250$			$500$			$1000$			Becker et al.
2		$\underset{(10)}{14.04}$	$\underset{(32)}{13.89}$	$\underset{(404)}{13.91}$	$\underset{(17)}{13.87}$	$\underset{(128)}{13.87}$	$\underset{(1156)}{13.92}$	$\underset{(53)}{13.67}$	$\underset{(679)}{13.89}$	$\underset{(4479)}{13.92}$	$[13.88, 13.91]$
5		$\underset{(8)}{26.98}$	$\underset{(36)}{26.65}$	$\underset{(362)}{26.76}$	$\underset{(19)}{26.19}$	$\underset{(144)}{26.54}$	$\underset{(1633)}{26.39}$	$\underset{(57)}{26.47}$	$\underset{(566)}{26.40}$	$\underset{(4837)}{26.43}$	$[26.14, 26.17]$
10		$\underset{(9)}{38.84}$	$\underset{(42)}{38.96}$	$\underset{(430)}{38.86}$	$\underset{(18)}{39.09}$	$\underset{(137)}{39.31}$	$\underset{(1363)}{39.37}$	$\underset{(54)}{38.42}$	$\underset{(503)}{38.62}$	$\underset{(4304)}{38.59}$	$[38.30, 38.37]$
20		$\underset{(10)}{60.16}$	$\underset{(72)}{59.79}$	$\underset{(378)}{59.87}$	$\underset{(26)}{59.61}$	$\underset{(141)}{59.66}$	$\underset{(1431)}{59.61}$	$\underset{(58)}{58.23}$	$\underset{(543)}{58.20}$	$\underset{(4835)}{58.21}$	$[51.55, 51.80]$
30		$\underset{(15)}{73.97}$	$\underset{(73)}{73.73}$	$\underset{(439)}{73.75}$	$\underset{(23)}{80.00}$	$\underset{(185)}{79.60}$	$\underset{(1789)}{79.69}$	$\underset{(65)}{73.40}$	$\underset{(550)}{73.15}$	$\underset{(5192)}{73.22}$	$[59.48, 59.87]$
50		$\underset{(15)}{93.27}$	$\underset{(84)}{93.43}$	$\underset{(1058)}{93.26}$	$\underset{(27)}{21.12}$	$\underset{(234)}{21.31}$	$\underset{(2178)}{21.30}$	$\underset{(69)}{113.61}$	$\underset{(604)}{113.24}$	$\underset{(6247)}{113.18}$	$[69.56, 69.95]$
100		$\underset{(19)}{0.01}$	$\underset{(85)}{0.01}$	$\underset{(1318)}{0.01}$	$\underset{(32)}{88.44}$	$\underset{(284)}{88.05}$	$\underset{(3105)}{88.14}$	$\underset{(88)}{138.42}$	$\underset{(747)}{138.18}$	$\underset{(8602)}{138.25}$	$[83.36, 83.86]$

Table 8. Table 13: American price results for a Call on the Maximum option by using the GPR-Tree and GPR-EI methods (without control variate). In the last column the exact benchmark. The values in brackets are the computational times (in seconds).

$d$	$250$	$500$	$1000$	$1000$	$2000$	$4000$	et al.
	GPR-Tree			GPR-EI			Becker
2	$\underset{(4)}{13.83}$	$\underset{(8)}{13.83}$	$\underset{(18)}{13.85}$	$\underset{(19)}{13.50}$	$\underset{(49)}{13.51}$	$\underset{(53)}{13.51}$	$[13.88, 13.91]$
5	$\underset{(5)}{25.95}$	$\underset{(9)}{25.82}$	$\underset{(22)}{25.78}$	$\underset{(20)}{25.23}$	$\underset{(70)}{25.33}$	$\underset{(60)}{25.39}$	$[26.14, 26.17]$
10	$\underset{(7)}{37.76}$	$\underset{(18)}{37.79}$	$\underset{(47)}{37.64}$	$\underset{(21)}{35.90}$	$\underset{(67)}{36.69}$	$\underset{(75)}{37.09}$	$[38.30, 38.37]$
20				$\underset{(23)}{46.67}$	$\underset{(73)}{49.31}$	$\underset{(100)}{49.74}$	$[51.55, 51.80]$
30				$\underset{(29)}{53.66}$	$\underset{(94)}{54.00}$	$\underset{(111)}{59.14}$	$[59.48, 59.87]$
50				$\underset{(30)}{62.17}$	$\underset{(86)}{25.84}$	$\underset{(131)}{71.86}$	$[69.56, 69.95]$
100				$\underset{(32)}{70.36}$	$\underset{(145)}{74.84}$	$\underset{(262)}{51.74}$	$[83.36, 83.86]$

Table 9. Table 15: Standard deviation for the prices of an American Geometric Basket Put option computed by means of the GPR-MC method (100 repetitions). Values between brackets are 95 % percent 95 95\% confidence intervals for the standard deviation. All results must be multiplied by 10 − 3 superscript 10 3 10^{-3} .

$d$	$M$	$10^{3}$	$10^{4}$	$10^{3}$	$10^{4}$	$10^{3}$	$10^{4}$
	$P$	$250$		$500$		$1000$
2		$\underset{[61.3, 81.1]}{69.8}$	$\underset{[19.6, 25.9]}{22.3}$	$\underset{[57.6, 76.3]}{65.7}$	$\underset{[18.0, 23.8]}{20.5}$	$\underset{[56.3, 74.5]}{64.2}$	$\underset{[17.7, 23.4]}{20.2}$
5		$\underset{[47.1, 62.3]}{53.6}$	$\underset{[15.1, 20.0]}{17.2}$	$\underset{[48.5, 64.2]}{55.2}$	$\underset{[16.2, 21.4]}{18.5}$	$\underset{[41.3, 54.6]}{47.0}$	$\underset{[12.7, 16.9]}{14.5}$
10		$\underset{[46.7, 61.8]}{53.2}$	$\underset{[14.1, 18.7]}{16.1}$	$\underset{[44.0, 58.3]}{50.1}$	$\underset{[13.2, 17.4]}{15.0}$	$\underset{[42.8, 56.6]}{48.7}$	$\underset{[13.1, 17.3]}{14.9}$
20		$\underset{[46.6, 61.7]}{53.1}$	$\underset{[14.7, 19.5]}{16.8}$	$\underset{[43.8, 58.0]}{49.9}$	$\underset{[14.4, 19.1]}{16.4}$	$\underset{[40.8, 53.9]}{46.4}$	$\underset{[13.3, 17.7]}{15.2}$
40		$\underset{[66.2, 87.6]}{75.4}$	$\underset{[21.6, 28.5]}{24.6}$	$\underset{[150.1, 66.3]}{57.1}$	$\underset{[15.1, 19.9]}{17.2}$	$\underset{[44.0, 58.2]}{50.1}$	$\underset{[13.3, 17.6]}{15.1}$
100		$\underset{[6.75, 89.3]}{76.8}$	$\underset{[21.4, 28.3]}{24.3}$	$\underset{[60.0, 79.4]}{68.3}$	$\underset{[18.0, 23.8]}{20.6}$	$\underset{[53.6, 70.9]}{61.1}$	$\underset{[16.7, 23.2]}{19.0}$

Equations85

d S_{t}^{i} = (r - η_{i}) S_{t}^{i} d t + σ_{i} S_{t}^{i} d W_{t}^{i}, i = 1, \dots, d,

d S_{t}^{i} = (r - η_{i}) S_{t}^{i} d t + σ_{i} S_{t}^{i} d W_{t}^{i}, i = 1, \dots, d,

v^{A M} (t, x) = τ \in T_{t, T} sup E_{t, x} [e^{- r (τ - t)} Ψ (S_{τ})],

v^{A M} (t, x) = τ \in T_{t, T} sup E_{t, x} [e^{- r (τ - t)} Ψ (S_{τ})],

d S_{t}^{i} = S_{t}^{i} ((r - η_{i}) d t + σ_{i} Σ_{i} d B_{t}),

d S_{t}^{i} = S_{t}^{i} ((r - η_{i}) d t + σ_{i} Σ_{i} d B_{t}),

Γ = 1 ρ_{21} ⋮ ρ_{d 1} ρ_{12} 1 ⋱ \dots \dots ⋱ ⋱ \dots ρ_{1 d} ⋮ ⋮ 1 .

Γ = 1 ρ_{21} ⋮ ρ_{d 1} ρ_{12} 1 ⋱ \dots \dots ⋱ ⋱ \dots ρ_{1 d} ⋮ ⋮ 1 .

y_{p} = f_{p} + ε_{p},

y_{p} = f_{p} + ε_{p},

f \sim N (0, K (X, X)),

f \sim N (0, K (X, X)),

y \sim N (0, K (X, X) + σ_{P}^{2} I_{P}),

y \sim N (0, K (X, X) + σ_{P}^{2} I_{P}),

\left[\begin{array}[]{c}\mathbf{y}\\ \mathbf{\tilde{f}}\end{array}\right]\sim\mathcal{N}\left(\left[\begin{array}[]{c}\mathbf{0}_{P}\\ \mathbf{0}_{M}\end{array}\right],\left[\begin{array}[]{cc}K\left(X,X\right)+\sigma_{P}^{2}I_{P}&K\left(X,\tilde{X}\right)\\ K\left(\tilde{X},X\right)&K\left(\tilde{X},\tilde{X}\right)\end{array}\right]\right)

\left[\begin{array}[]{c}\mathbf{y}\\ \mathbf{\tilde{f}}\end{array}\right]\sim\mathcal{N}\left(\left[\begin{array}[]{c}\mathbf{0}_{P}\\ \mathbf{0}_{M}\end{array}\right],\left[\begin{array}[]{cc}K\left(X,X\right)+\sigma_{P}^{2}I_{P}&K\left(X,\tilde{X}\right)\\ K\left(\tilde{X},X\right)&K\left(\tilde{X},\tilde{X}\right)\end{array}\right]\right)

\tilde{f} ∣ \tilde{X}, y, X \sim N (E [\tilde{f} ∣ \tilde{X}, y, X], C o v [\tilde{f} ∣ \tilde{X}, y, X]),

\tilde{f} ∣ \tilde{X}, y, X \sim N (E [\tilde{f} ∣ \tilde{X}, y, X], C o v [\tilde{f} ∣ \tilde{X}, y, X]),

E [\tilde{f} ∣ \tilde{X}, y, X] = K (\tilde{X}, X) [K (X, X) + σ_{P}^{2} I_{P}]^{- 1} y

E [\tilde{f} ∣ \tilde{X}, y, X] = K (\tilde{X}, X) [K (X, X) + σ_{P}^{2} I_{P}]^{- 1} y

C o v [\tilde{f} ∣ \tilde{X}, y, X] = K (\tilde{X}, \tilde{X}) - K (\tilde{X}, X) [K (X, X) + σ_{P}^{2} I_{P}]^{- 1} K (X, \tilde{X}) .

C o v [\tilde{f} ∣ \tilde{X}, y, X] = K (\tilde{X}, \tilde{X}) - K (\tilde{X}, X) [K (X, X) + σ_{P}^{2} I_{P}]^{- 1} K (X, \tilde{X}) .

f^{GP R} (\tilde{x})

f^{GP R} (\tilde{x})

= p = 1 \sum P k (\tilde{x}, x^{p}) ω_{p},

ω = [K (X, X) + σ_{P}^{2} I_{P}]^{- 1} y .

ω = [K (X, X) + σ_{P}^{2} I_{P}]^{- 1} y .

k_{M a} (x, x^{'}) = σ_{f}^{2} (1 + \frac{3 ∥ x - x ^{'} ∥ _{2}}{σ _{l}}) exp (- \frac{3 ∥ x - x ^{'} ∥ _{2}}{σ _{l}}) for x, x^{'} \in R^{d},

k_{M a} (x, x^{'}) = σ_{f}^{2} (1 + \frac{3 ∥ x - x ^{'} ∥ _{2}}{σ _{l}}) exp (- \frac{3 ∥ x - x ^{'} ∥ _{2}}{σ _{l}}) for x, x^{'} \in R^{d},

k_{S E} (x, x^{'}) = σ_{f}^{2} exp (- \frac{∥ x - x ^{'} ∥ _{2}^{2}}{2 σ _{l}^{2}}) for x, x^{'} \in R^{d} .

k_{S E} (x, x^{'}) = σ_{f}^{2} exp (- \frac{∥ x - x ^{'} ∥ _{2}^{2}}{2 σ _{l}^{2}}) for x, x^{'} \in R^{d} .

- \frac{1}{2} lo g (det (K (X, X) + σ_{P}^{2} I_{P})) - \frac{1}{2} y^{⊤} [K (X, X) + σ_{P}^{2} I_{P}]^{- 1} y .

- \frac{1}{2} lo g (det (K (X, X) + σ_{P}^{2} I_{P})) - \frac{1}{2} y^{⊤} [K (X, X) + σ_{P}^{2} I_{P}]^{- 1} y .

z_{i}^{q} = T σ_{i} Σ_{i} h^{q},

z_{i}^{q} = T σ_{i} Σ_{i} h^{q},

u (z) := Ψ (S_{0} exp ((r - η - \frac{1}{2} σ^{2}) T + z)) .

u (z) := Ψ (S_{0} exp ((r - η - \frac{1}{2} σ^{2}) T + z)) .

u^{GP R} (z) = q = 1 \sum Q k_{S E} (z^{q}, z) ω_{q},

u^{GP R} (z) = q = 1 \sum Q k_{S E} (z^{q}, z) ω_{q},

v^{E U} = e^{- r T} E_{0, S_{0}} [Ψ (S_{T})] \approx e^{- r T} q = 1 \sum Q ω_{q} σ_{f}^{2} σ_{l}^{d} \frac{e ^{- \frac{1}{2} (z^{q})^{⊤} (T \cdot Π + σ_{l}^{2} I_{d})^{- 1} (z^{q})}}{det ( T \cdot Π + σ _{l}^{2} I _{d} )}

v^{E U} = e^{- r T} E_{0, S_{0}} [Ψ (S_{T})] \approx e^{- r T} q = 1 \sum Q ω_{q} σ_{f}^{2} σ_{l}^{d} \frac{e ^{- \frac{1}{2} (z^{q})^{⊤} (T \cdot Π + σ_{l}^{2} I_{d})^{- 1} (z^{q})}}{det ( T \cdot Π + σ _{l}^{2} I _{d} )}

\tilde{S} = S_{0} exp ((r - η - \frac{1}{2} σ^{2}) \tilde{t} + \tilde{z}) .

\tilde{S} = S_{0} exp ((r - η - \frac{1}{2} σ^{2}) \tilde{t} + \tilde{z}) .

v^{E U} = e^{- r (T - \tilde{t})} E_{\tilde{t}, \tilde{S}} [Ψ (S_{T})] \approx e^{- r (T - \tilde{t})} q = 1 \sum Q ω_{q} σ_{f}^{2} σ_{l}^{d} \frac{e ^{- \frac{1}{2} (z^{q} - \tilde{z})^{⊤} ((T - \tilde{t}) \cdot Π + σ_{l}^{2} I_{d})^{- 1} (z^{q} - \tilde{z})}}{det ( ( T - t ~ ) \cdot Π + σ _{l}^{2} I _{d} )}

v^{E U} = e^{- r (T - \tilde{t})} E_{\tilde{t}, \tilde{S}} [Ψ (S_{T})] \approx e^{- r (T - \tilde{t})} q = 1 \sum Q ω_{q} σ_{f}^{2} σ_{l}^{d} \frac{e ^{- \frac{1}{2} (z^{q} - \tilde{z})^{⊤} ((T - \tilde{t}) \cdot Π + σ_{l}^{2} I_{d})^{- 1} (z^{q} - \tilde{z})}}{det ( ( T - t ~ ) \cdot Π + σ _{l}^{2} I _{d} )}

v^{B M} (t_{n}, x) = max (Ψ (x), E_{t_{n}, x} [e^{- r Δ t} v (t_{n + 1}, S_{t_{n + 1}})]) .

v^{B M} (t_{n}, x) = max (Ψ (x), E_{t_{n}, x} [e^{- r Δ t} v (t_{n + 1}, S_{t_{n + 1}})]) .

X^{n} = {x^{n, p} = (x_{1}^{n, p}, \dots, x_{d}^{n, p}), p = 1, \dots, P} \subset R^{d} .

X^{n} = {x^{n, p} = (x_{1}^{n, p}, \dots, x_{d}^{n, p}), p = 1, \dots, P} \subset R^{d} .

\tilde{X}_{p}^{n} = {\tilde{x}^{n, p, m} = (\tilde{x}_{1}^{n, p, m}, \dots, \tilde{x}_{d}^{n, p, m}), m = 1, \dots, M} \subset R^{d}

\tilde{X}_{p}^{n} = {\tilde{x}^{n, p, m} = (\tilde{x}_{1}^{n, p, m}, \dots, \tilde{x}_{d}^{n, p, m}), m = 1, \dots, M} \subset R^{d}

\tilde{x}_{i}^{n, p, m} = x_{i}^{n, p} e^{(r - η_{i} - \frac{1}{2} σ_{i}^{2}) Δ t + Δ t σ_{i} Σ_{i} G^{n, p, m}},

\tilde{x}_{i}^{n, p, m} = x_{i}^{n, p} e^{(r - η_{i} - \frac{1}{2} σ_{i}^{2}) Δ t + Δ t σ_{i} Σ_{i} G^{n, p, m}},

\overset{v}{^}^{B M} (t_{n}, x^{n, p}) = max (Ψ (x^{n, p}), \frac{e ^{- r Δ t}}{M} m = 1 \sum M v^{B M} (t_{n + 1}, \tilde{x}^{n, p, m})),

\overset{v}{^}^{B M} (t_{n}, x^{n, p}) = max (Ψ (x^{n, p}), \frac{e ^{- r Δ t}}{M} m = 1 \sum M v^{B M} (t_{n + 1}, \tilde{x}^{n, p, m})),

\overset{v}{^}^{B M} (t_{n - 1}, x^{n - 1, p}) = max (Ψ (x^{n - 1, p}), \frac{e ^{- r Δ t}}{M} m = 1 \sum M v_{n}^{B M, GP R} (\tilde{x}^{n - 1, p, m})) .

\overset{v}{^}^{B M} (t_{n - 1}, x^{n - 1, p}) = max (Ψ (x^{n - 1, p}), \frac{e ^{- r Δ t}}{M} m = 1 \sum M v_{n}^{B M, GP R} (\tilde{x}^{n - 1, p, m})) .

\overset{v}{^}^{B M} (0, S_{0}) = max (Ψ (S_{0}), \frac{e ^{- r Δ t}}{M} m = 1 \sum M \tilde{v}_{1}^{B M, GP R} (\tilde{x}^{0, m}))

\overset{v}{^}^{B M} (0, S_{0}) = max (Ψ (S_{0}), \frac{e ^{- r Δ t}}{M} m = 1 \sum M \tilde{v}_{1}^{B M, GP R} (\tilde{x}^{0, m}))

\tilde{x}_{i}^{0, m} = S_{0}^{i} e^{(r - η_{i} - \frac{1}{2} σ_{i}^{2}) Δ t + Δ t σ_{i} Σ_{i} G^{0, m}},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications · Reservoir Engineering and Simulation Methods

Full text

Variance Reduction Applied to Machine Learning for Pricing Bermudan/American Options in High Dimension

Ludovic Goudenège Fédération de Mathématiques de CentraleSupélec - CNRS FR3487, France - [email protected]

Andrea Molent Dipartimento di Scienze Economiche e Statistiche, Università degli Studi di Udine, Italy - [email protected]

Antonino Zanette 111Corresponding author. Dipartimento di Scienze Economiche e Statistiche, Università degli Studi di Udine, Italy - [email protected]

Abstract

In this paper we propose an efficient method to compute the price of multi-asset American options, based on Machine Learning, Monte Carlo simulations and variance reduction technique. Specifically, the options we consider are written on a basket of assets, each of them following a Black-Scholes dynamics. In the wake of Ludkovski’s approach [33], we implement here a backward dynamic programming algorithm which considers a finite number of uniformly distributed exercise dates. On these dates, the option value is computed as the maximum between the exercise value and the continuation value, which is obtained by means of Gaussian process regression technique and Monte Carlo simulations. Such a method performs well for low dimension baskets but it is not accurate for very high dimension baskets. In order to improve the dimension range, we employ the European option price as a control variate, which allows us to treat very large baskets and moreover to reduce the variance of price estimators. Numerical tests show that the proposed algorithm is fast and reliable, and it can handle also American options on very large baskets of assets, overcoming the problem of the curse of dimensionality.

Keywords:

Finance; Gaussian process regression; Control variate; American options; Monte Carlo methods.

1 Introduction

In this paper we consider one of the most compelling problems among the still open issues in the field of computational finance: pricing and hedging American options in high dimension. From a practical point of view, the efficient numerical evaluation of American options which consider as underlying a baskets of $d$ assets is very challenging because of the so-called “curse of dimensionality”, which avoids the direct application of standard numerical schemes such as finite difference or tree methods. Specifically, this curse of dimensionality means that the computational cost and the memory requirement increase exponentially with the dimension of the problem.

Several new ideas have appeared in this research area, which can be divided into five groups. The first type of approach consists in employing a recombinant tree in order to obtain a discretization of the underlying diffusion. An example of this mode is given by the stochastic mesh method of Broadie and Glasserman [9], the quantization algorithms of Bally, et al. [4], the stochastic grid method of Jain and Oosterlee [22]. The second idea makes use of regression on a truncated basis of $L^{2}$ in order to compute the conditional expectations. This is done in Longstaff and Schwartz [32] and in Tsisiklis and Van Roy [41]. The third concept consists in exploiting the representation formulas for the conditional expectation using Malliavin calculus. This has been done by Lions and Reigner [31], Bouchard and Touzi [8], Bally et al. [3] Caramellino and Zanette [11] and Abbas-Turki and Lapeyre[1]. Another group of ideas relies on duality-based approaches for Bermudan option pricing, which are proposed by Rogers [38], Haugh and Kogan [21], Andersen and Broadie [2], Schoenmakers et al. [39] and Lelong [28], which can be used to construct bounds on the option value. Finally, the last group consists of methods that employ Machine Learning techniques to learn the continuation value or the stopping rules. This has been proposed by Becker et al. [7], Kohler et al. [26] and Ludkowski [33].

European prices can be used as control variate while pricing American options, as done, for example, by Bally et al. [3] and by Caramellino and Zanette [11]. Since multi-asset products are considered, efficiently computing European prices is not trivial and many authors developed valid methods in this field. Some of them focused on computing lower and upper bounds, such as Deelstra et al. [14], Carmona and Durrleman [12], Caldana et al. [10]. Other approaches for basket options are based on the approximation of the sum of the log-normal distributions with a simple distribution by matching some moments, as done by Levy [29], Milevsky and Posner [35, 36], Zhou and Wang [42], Korn and Zeytun [27]. Moreover, an approximation approach is also proposed by Li and Wu [30] for options on several mean-reverting assets. Recently, Glau et al. [17] and Glau et al. [18] consider Chebyshev based methods for pricing. Deep Learning techniques are nowadays widely used in solving large differential equations, which is intimately related to option pricing: recent progresses in this field have been achieved by Han et al. [20], E et al. [15] and Beck et al. [6]. Finally, efficient Monte Carlo approaches are developed by Jourdain and Lelong [23] and more recently by Bayer et al. [5].

In this paper, we propose a new method that combines Machine Learning, Monte Carlo simulations and variance reduction control variate technique. In particular, the use of a control variate makes the method more stable and extends its applicability range to high very large baskets. Moreover, the variance of price estimator is significantly reduced.

First of all, we implement a version of the Ludkovski’s algorithm [33]. Such an algorithm proceeds backward over time by computing the price function on a set of prearranged points which represents possible values of the underlying. In particular, at each time step, it uses a set of Monte Carlo simulations together with Gaussian Process Regression (GPR) to approximate the continuation value at these points. The option price is then obtained as the maximum between the continuation value and the intrinsic value of the option. We term such an algorithm GPR Monte Carlo (GPR-MC). The GPR-MC algorithm works very well for small baskets (in his paper, Ludkovski considers up to 5 dimensional basket), but it does not for large ones. In this paper, we show that, if one considers the European price as a control variate, the algorithm improves significantly and the variance of the price estimator is reduced. We term GPR Monte Carlo Control Variate (GPR-MC-CV) this new algorithm. Moreover, in order to compute the European prices, we suggest to use a semi-analytical formula, named GPR-EI formula, introduced by Goudenège et al. in [19], which proves to be efficient when many repeated computations of European prices have to be performed, or alternatively, Quasi-Monte Carlo simulations. Finally, we investigate the benefits brought by control variate technique to the GPR-Tree and GPR-EI approaches introduced by Goudenège et al. [19]. The paper is organized as follows. In Section 2 we present American options for the Black-Scholes $d$ -dimensional model. In Section 3 we briefly review Gaussian Process Regression, we present the GPR-EI formula, the GPR-MC method and the GPR-MC-CV method. Furthermore, we also investigate the use of control variate technique for the GPR-Tree and GPR-EI methods. In Section 4 we report some numerical results about pricing and variance reduction. Finally, Section 5 draws the conclusions.

2 American options in the multi-dimensional Black-Scholes model

An American option with maturity $T$ is a derivative instrument whose holder can exercise the intrinsic optionality at any moment, from inception up to maturity. Let $\mathbf{S}=(\mathbf{S}_{t})_{t\in[0,T]}$ denote the $d$ -dimensional underlying process. Such a stochastic process is assumed to randomly evolve according to the multidimensional Black-Scholes model: under the risk neutral measure, such a model is given by the following equation

[TABLE]

with $\mathbf{S}_{0}=\left(s_{0,1},\dots,s_{0,d}\right)^{\top}\in{\mathbb{R}}_{+}^{d}$ the spot price, $r$ the (constant) spot interest rate, $\boldsymbol{\eta}=(\eta_{1},\dots,\eta_{d})^{\top}$ the vector of (constant) dividend rates, $\boldsymbol{\sigma}=\left(\sigma_{1},\dots,\sigma_{d}\right)^{\top}$ the vector of (constant) volatilities, $\mathbf{W}$ a $d$ -dimensional correlated Brownian motion and $\rho_{ij}$ the instantaneous correlation coefficient between $W^{i}_{t}$ and $W^{j}_{t}.$ Moreover, let $\Psi(\mathbf{S}_{T})$ denote the cash-flow associated with the option. Thus, the price at time $t$ of an American option having maturity $T$ and payoff function $\Psi\,:\,{\mathbb{R}}_{+}^{d}\to{\mathbb{R}}$ is then

[TABLE]

where ${\cal T}_{t,T}$ stands for the set of all the stopping times taking values on $[t,T]$ and ${\mathbb{E}}_{t,\mathbf{x}}\left[\cdot\right]$ is the expectation given all the information at time $t$ and assuming $\mathbf{S}_{t}=\mathbf{x}$ .

For simulation purposes, the $d-$ dimensional Black-Scholes model can be written alternatively using the Cholesky decomposition. Specifically, for $i\in\{1,\dots,d\}$ we can write

[TABLE]

where $\mathbf{B}$ is a d-dimensional Brownian motion and $\Sigma_{i}$ is the $i$ -th row of the matrix $\Sigma$ defined as a square root of the correlation matrix $\Gamma$ , given by

[TABLE]

3 Machine Learning for American options in the multi-dimensional Black-Scholes model

3.1 Gaussian Process Regression

In this Section, we present a brief review of Gaussian Process Regression and for a comprehensive treatment we refer to Rasmussen and Williams [37].

Gaussian Process Regression (GPR), also known as Kriging (see Matheron [34], Journel and Huijbregts [24]), is a class of non-parametric kernel-based probabilistic models which represents the input data as the random observations of a Gaussian stochastic process. The most important advantage of this approach in relation to other parametric regression techniques is that it is possible to effectively exploit a complex dataset which may consist of points sampled randomly in a multidimensional space.

In general, a Gaussian process $\mathcal{G}$ is a collection of random variables defined on a common probability space ${\displaystyle(\Omega,\mathcal{F},P)}$ , any finite number of which have consistent joint Gaussian distributions. We are interested in Gaussian processes for which the random variables in $\mathcal{G}$ are indexed by a point $\mathbf{x\in}\mathbb{R}^{d}$ , $d\in\mathbb{N}$ . Therefore, for all $\mathbf{x}\in\mathbb{R}^{d}$ , $\mathcal{G}\left(\mathbf{x}\right):\Omega\rightarrow\mathbb{R}$ is a Gaussian random variable and if $X$ = $\left\{\mathbf{x}_{p},p=1,\dots,P\right\}\subset\mathbb{R}^{d}$ then $\left(\mathcal{G}\left(\mathbf{x}_{1}\right),\dots,\mathcal{G}\left(\mathbf{x}_{P}\right)\right)^{\top}$ is a random Gaussian vector. Moreover, a Gaussian process is fully specified by its mean function $\mu\left(\mathbf{x}\right):\mathbb{R}^{d}\rightarrow\mathbb{R}$ (which is usually assumed to be zero) and by its covariance function $k\left(\mathbf{x},\mathbf{x}^{\prime}\right):\mathbb{R}^{d}\times\mathbb{R}^{d}\rightarrow\mathbb{R}$ .

Now, let us consider a training set $\mathcal{D}$ of $P$ observations (the input data), $\mathcal{D}=\left\{\left(\mathbf{x}_{p},y_{p}\right),p=1,\dots,P\right\}$ where $X=\left\{\mathbf{x}_{p},p=1,\dots,P\right\}\subset\mathbb{R}^{d}$ denotes the set of input vectors and $Y=\left\{y_{p},p=1,\dots,P\right\}\subset\mathbb{R}$ denotes the set of scalar outputs. These observations are modeled as the realization of the sum of a Gaussian process and a noise source. Specifically,

[TABLE]

where $\left\{f_{p}=\mathcal{G}\left(\mathbf{x}_{p}\right),p=1,\dots,P\right\}$ is a Gaussian process and $\left\{\varepsilon_{p},p=1,\dots,P\right\}$ are i.i.d. random variables such that $\varepsilon_{p}\sim\mathcal{N}\left(0,\sigma_{P}^{2}\right)$ . Moreover, the distribution of $\mathbf{f}=\left(f_{1}\dots f_{P}\right)^{\top}$ is assumed to be given by

[TABLE]

where $K\left(X,X\right)$ is a $P\times P$ matrix with $K\left(X,X\right)_{p_{1},p_{2}}=k\left(\mathbf{x}_{p_{1}},\mathbf{x}_{p_{2}}\right)$ for $p_{1},p_{2}=1,\dots,P$ with $k:\mathbb{R}^{d}\times\mathbb{R}^{d}\rightarrow\mathbb{R}$ the so called kernel function. Thus

[TABLE]

where $I_{P}$ is the $P\times P$ identity matrix.

Now, in addition, let us consider a test set $\tilde{X}$ of $M$ points $\left\{\tilde{\mathbf{x}}_{m},m=1,\dots,M\right\}$ . The realizations $\tilde{f}_{m}=\mathcal{G}\left(\tilde{\mathbf{x}}_{m}\right)$ are not known but rather we want to estimate them by exploiting the observed realizations of $\mathcal{G}$ in $\mathcal{D}$ . The a priori joint distribution of $\mathbf{y}$ and $\mathbf{\tilde{f}}=\left(\tilde{f}_{1},\dots,\tilde{f}_{M}\right)^{\top}$ is given by

[TABLE]

where $K\left(\tilde{X},\tilde{X}\right)$ is a $M\times M$ matrix given by $K\left(\tilde{X},\tilde{X}\right)_{m_{1},m_{2}}=k\left(\mathbf{\tilde{x}}_{m_{1}},\mathbf{\tilde{x}}_{m_{2}}\right)$ for $m_{1},m_{2}=1,\dots,M$ , $K\left(X,\tilde{X}\right)$ is a $P\times M$ matrix given by $K\left(X,\tilde{X}\right)_{p,m}=k\left(\mathbf{x}_{p},\mathbf{\tilde{x}}_{m}\right)$ for $p=1,\dots,P$ , $m=1,\dots,M$ and $K\left(\tilde{X},X\right)$ is a $M\times P$ matrix given by $K\left(\tilde{X},X\right)_{m,p}=k\left(\mathbf{\tilde{x}}_{m},\mathbf{x}_{p}\right)$ for $m=1,\dots,M$ , $p=1,\dots,P$ .

Since we know the values for the training set, we can consider the conditional distribution of $\mathbf{\tilde{f}}$ given $\mathbf{y}$ . It is possible to prove that $\mathbf{\tilde{f}}|\tilde{X},\mathbf{y},X$ follows a Gaussian distribution given by

[TABLE]

where

[TABLE]

and

[TABLE]

Therefore, a natural choice consists in predicting the values $\mathbf{\tilde{f}}$ through $\mathbb{E}\left[\mathbf{\tilde{f}}|\tilde{X},\mathbf{y},X\right]$ . Moreover, by using equation (3.6), one can define a function $f^{GPR}:\mathbb{R}^{d}\rightarrow\mathbb{R}$ that approximates the function $\mathbf{x}_{p}\mapsto y_{p}$ by setting

[TABLE]

where $\boldsymbol{\omega}=\left(\omega_{1},\dots,\omega_{1}\right)^{\top}$ is a vector of weights determined by

[TABLE]

The computation in (3.6) requires the knowledge of the covariance function $K$ and of the noise variance $\sigma_{P}^{2}$ . A commonly used covariance function is the Matern 3/2 kernel $k_{Ma}:\mathbb{R}^{d}\times\mathbb{R}^{d}\rightarrow\mathbb{R}$ , which is given by

[TABLE]

where $\sigma_{f}^{2}$ is called the signal variance and $\sigma_{l}$ is called the length-scale. Another possible choice is the Squared Exponential kernel $k_{SE}:\mathbb{R}^{d}\times\mathbb{R}^{d}\rightarrow\mathbb{R}$ , which is given by

[TABLE]

In general, the choice of kernel function is performed by using a log-likelihood criterion. The parameters $\sigma_{f}^{2}$ , $\sigma_{l}$ of the kernel function and $\sigma_{P}^{2}$ of the noise are called hyperparameters and need to be estimated. A common approach is to consider the maximum likelihood estimates which can be obtained by maximizing the log-likelihood function of the training data, that is by maximizing the following function:

[TABLE]

The development of the GPR model can be divided in the training step and the evaluation step (also called testing step). The training step only requires the knowledge of the training set $\mathcal{D}$ and it consists in estimating the hyperparameters and computing the vector of weights $\boldsymbol{\omega}$ . The evaluation step can be computed only after the training step has been accomplished and it consists in obtaining the predictions via the computation of $K\left(\tilde{X},X\right)\boldsymbol{\omega}$ . We stress out that the training step is independent of the test set $\tilde{X}$ . Thus one can store the values computed during the training step and perform the evaluation step many times with a small computational cost, which is $\mathcal{O}\left(P\cdot M\right)$ .

Remark 1.

We observe that the computation time depends only marginally on the size $d$ of the space where the points lie, as the value of $d$ only impacts in the time taken to calculate distances between the points which appears in the covariance matrix K, that is $\left\|\mathbf{x}-\mathbf{x}^{\prime}\right\|_{2}$ .

3.2 Machine Learning Exact Integration for European options

In order to improve the GPR-MC approach, we employ the European option price as a control variate. Here, we propose to compute such a price by means of the semi-analytical formula introduced by Goudenège et al. [19], that we term GPR-EI formula. This computation is based on two steps. First of all, the payoff function is approximated by means of GPR. Then, the European price is computed as the discounted expected value of the final cash flow, that is a multidimensional integral of the payoff function with respect to the log-underlying process density. Such an integral can be computed by means of a closed formula when replacing the true payoff function with its GPR approximation.

Let us consider a set $Z=\left\{\mathbf{z}^{q},q=1,\dots,Q\right\}$ consisting of $Q$ points in $\mathbb{R}^{d}$ quasi-randomly distributed according to the law of the vector $\left(\sigma_{1}W_{T}^{1},\dots,\sigma_{d}W_{T}^{d}\right)^{\top}$ . In particular, we define

[TABLE]

where $\Sigma_{i}$ is i-th row of the matrix $\Sigma$ and $\mathbf{h}^{q}$ is the q-th point of the Halton sequence in $\mathbb{R}^{d}$ (other low-discrepancy sequence can be considered, such as Solob’s or Faure’s ones). Let $u:Z\rightarrow\mathbb{R}$ be the function defined by

[TABLE]

In a nutshell, the main idea is to approximate the function $u$ by training the GPR method on the set $Z$ . In particular, we employ the Squared Exponential kernel defined in (3.12). Equation (3.9) allows one to approximate the function $u\left(\cdot\right)$ by

[TABLE]

where $\omega_{1},\dots,\omega_{P}$ are weights. The continuation value can be computed by integrating the function $u^{GPR}$ against a $d$ -dimensional probability density. The use of the Squared Exponential kernel allows one to easily perform such a calculation by means of a closed formula. Specifically, the GPR-EI method relies on the following Proposition.

Proposition 1.

Let us consider an European option with payoff function $\Psi$ , inception $t=0$ , maturity $T$ , and multidimensional underlying following the dynamics in (2.1) with spot price $\mathbf{S}_{0}$ . The price of such an option at $t=0$ can be approximated by

[TABLE]

where $\sigma_{f}$ , $\sigma_{l}$ , and $\omega_{1},\dots,\omega_{Q}$ are certain constants determined by the GPR approximation of the function $\mathbf{z}\mapsto u\left(\mathbf{z}\right)$ considering $Z$ as the predictor set, and $\Pi=\left(\Pi_{i,j}\right)$ is the $d\times d$ covariance matrix of the vector $\left(\sigma_{1}W_{T}^{1},\dots,\sigma_{d}W_{T}^{d}\right)^{\top}$ , that is $\Pi_{i,j}=\rho_{i,j}\sigma_{i}\sigma_{j}$ .

The proof of this Proposition is very similar to the one reported in [19].

Despite the GPR-EI formula (3.17) is adapted to compute the option price supposing the spot price to be $\mathbf{S}_{0}$ and the time to maturity to be $T$ , it works quite well also for spots close to $\mathbf{S}_{0}$ and time to maturity smaller than $T$ . The following Proposition states how to do that.

Proposition 2.

Let us consider and European option with payoff function $\Psi$ , inception $0<\tilde{t}<T$ , maturity $T$ , and multidimensional underlying following the dynamics in (2.1). Let $\tilde{\mathbf{S}}$ be the vector of the spot prices at time $\tilde{t}$ and define $\tilde{\mathbf{z}}\in\mathbb{R}^{d}$ such that

[TABLE]

The price of such an option at $\tilde{t}$ can be approximated by

[TABLE]

where $\sigma_{f}$ , $\sigma_{l}$ , and $\omega_{1},\dots,\omega_{Q}$ and $\Pi=\left(\Pi_{i,j}\right)$ are defined according to Proposition 1.

The proof of Proposition 2 derives directly from Proposition 1 by considering $\tilde{\mathbf{S}}$ in place of $\mathbf{S}_{0}$ . The hyperparameters $\sigma_{f}$ , $\sigma_{l}$ , and the weights $\omega_{1},\dots,\omega_{Q}$ need to be computed only once and then we can use formulas (3.17) and (3.19) to compute the European prices. The resolution of the linear systems within the exponential factors and the computation of the matrix determinant in (3.17) and (3.19) can be done quite fast by computing the Cholesky decomposition of the matrices $(T-\tilde{t})\cdot\Pi+\sigma_{l}^{2}I_{d}$ for each of the few possible values of $t$ , that is $t=0,t_{1},\dots,t_{N-1}$ . For this reason, turns out to be faster than repeated Monte Carlo simulations to compute the many European prices to be used as control variate.

3.3 Machine Learning Control Variate algorithm for American options

3.3.1 The GPR Monte Carlo Method

Let us introduce the GPR Monte Carlo approach. We approximate the price of an American option with the price of a Bermudan option on the same basket. Specifically, let $N$ be the number of time steps and $\Delta t=T/N$ the time increment. The discrete exercise dates are $t_{n}=n\,\Delta t$ , as $n=1,\ldots,N$ . If $\mathbf{x}$ represents the vector of the underlying prices at the exercise date $t_{n}$ , then the price of the Bermudan option $v^{BM}$ is given by

[TABLE]

First of all, by knowing the function $v^{BM}\left(t_{n+1},\cdot\right)$ , one can compute $v^{BM}\left(t_{n},\cdot\right)$ by approximation of the expectation in (3.20). In order to do that, we consider a set $X^{n}$ of $P$ points whose coordinates represent certain possible values for the underlyings at time $t_{n}$ :

[TABLE]

Suppose now we want to compute $v^{BM}\left(t_{n},\cdot\right)$ but only for $\mathbf{x}^{n,p}\in X^{n}$ . This goal can be achieved by means of a one step Monte Carlo simulation. In particular, for each $\mathbf{x}^{n,p}\in X^{n}$ , we simulate a set of points $\tilde{X}^{n}_{p}$

[TABLE]

of $M$ possible values for $\mathbf{S}_{t_{n+1}}$ according to the law of $\mathbf{S}_{t_{n+1}}\left|\mathbf{S}_{t_{n}}=\mathbf{x}\right.$ . In particular, for $i=1,\dots,d$ , $n=1,\dots,N$ , $p=1,\dots,P$ , $m=1,\dots,M$ , we define

[TABLE]

where $\mathbf{G}^{n,p,m}\sim\mathcal{N}\left(0,I_{d}\right)$ is a standard Gaussian random vector and $\Sigma_{i}$ is the $i$ -th row of the matrix $\Sigma$ , just as in (2.3). Then, the option price can be approximated for each $\mathbf{x}^{n,p}\in X^{n}$ by

[TABLE]

if the quantities $v^{BM}\left(t_{n+1},\mathbf{\tilde{x}}^{n,p,m}\right)$ are known for all of these simulated points $\mathbf{\tilde{x}}^{n,p,m}$ . If we proceed backward, the function ${v}^{BM}\left(t,\cdot\right)$ is known for $t=T$ since it is equal to the payoff function $\Psi\left(\cdot\right)$ and thanks to (3.24) it is known, through an approximation, also for $t=t_{N-1}$ and $\mathbf{x}^{N-1,p}\in X^{N-1}$ . In order to assess ${v}^{BM}\left(t_{N-2},\mathbf{x}^{N-2,p}\right)$ for all $\mathbf{x}^{N-2,p}\in X^{N-2}$ , and thus going on up to $t=0$ , it is necessary to evaluate the function ${v}^{BM}\left(t_{N-2},\cdot\right)$ for all the points in $\tilde{X}^{N-2}=\bigcup_{p=1}^{P}\tilde{X}^{N-2,p}$ . This cannot be done directly since we know $\hat{v}^{BM}\left(t_{N-1},\cdot\right)$ only for the points in $X^{N-1}$ and not for all those in $\tilde{X}^{N-2}$ . To overcome this issue, we compute the approximation of the function $\hat{v}^{BM}\left(t_{N-1},\cdot\right)$ by means of the GPR technique. In particular the set $X^{N-1}$ serves as the predictor set and $\left\{\hat{v}^{BM}\left(t_{N-1},\mathbf{x}^{N-1,p}\right),p=1,\dots,P\right\}$ as the response set.

More generally, let ${v}^{BM,GPR}_{n}\left(\cdot\right)$ be the GPR approximation of $\hat{v}^{BM}\left(t_{n},\cdot\right)$ trained by considering $X^{n}$ as the predictor set and $\left\{\hat{v}^{BM}\left(t_{n},\mathbf{x}^{n,p}\right),p=1,\dots,P\right\}$ as the response set, where $\hat{v}^{BM}$ is defined as in (3.24). Then, we can proceed backward by computing

[TABLE]

and by computing $v^{BM,GPR}_{n-1}$ , that is the GPR approximation of $\hat{v}^{BM}\left(t_{n-1},\cdot\right)$ . Finally, the option price at time $t=0$ is computed through

[TABLE]

where the points $\mathbf{\tilde{x}}^{0,1},\dots,\mathbf{\tilde{x}}^{0,M}$ are random simulations of $\mathbf{S}_{t_{1}}$ given by

[TABLE]

where $\mathbf{G}^{0,m}\sim\mathcal{N}\left(0,I_{d}\right)$ is a standard Gaussian random vector for any $m\in\left\{1,\dots,M\right\}$ .

The choice of the sets $X^{n},n=1\dots,N-1$ is a sensitive question. Similarly to what proposed by Ludkovski [33], here we use a deterministic space-filling sequence based on the Halton sequence. Specifically, let $\mathbf{h}^{p}$ be the $p$ -th point of the Halton quasi-random sequence in $\mathbb{R}^{d}$ and $\Phi^{-1}$ the inverse cumulative distribution of a standard normal distribution. We define the points $\mathbf{x}^{n,p}$ as follows:

[TABLE]

for $i=1,\dots,d$ , $n=1,\dots,N-1$ , and $p=1,\dots,P$ . This choice for the sets $X^{n}$ proves to be the most effective, since the points used to train the GPR algorithm at time $t_{n}$ are sampled according to the density function of the process $\mathbf{S}_{t_{n}}$ .

3.3.2 The GPR Monte Carlo Control Variate Method

Let us present the GPR Monte Carlo Control Variate method (GPR-MC-CV), that is our proposed algorithm.

The control variate technique is commonly used to reduce the variance of Monte Carlo estimators, but it can also give its contribution in American pricing. Following Bally et al. [3] and Caramellino and Zanette [11], we employ the European price as a control variate for the American price. Let us consider an American and an European option with the same payoff function $\Psi$ and maturity $T$ , and let $v^{AM},v^{EU}$ denote their prices respectively. For a fixed time $t$ and underlying stocks $\mathbf{x}$ , we define the American-European price gap as:

[TABLE]

Then

[TABLE]

and it is straightforward to see that

[TABLE]

where $\mathcal{T}_{t,T}$ stands for the set of all stopping times taking values in $\left[t,T\right]$ and $\hat{\Psi}$ is defined by

[TABLE]

We stress out that $\hat{\Psi}\left(T,\mathbf{x}\right)=0$ and the function $\hat{\Psi}$ depends on the time variable also. Therefore, in order to numerically evaluate $v^{AM}\left(0,\mathbf{S}_{0}\right)$ , one can arrange a dynamic programming principle, based on Bermudan approximation, actually equal to the one in Section 3.3.1 by replacing $\Psi$ with $\hat{\Psi}$ . Once the initial price gap $v\left(0,\mathbf{S}_{0}\right)$ has been calculated, one can retrieve the American price by computing

[TABLE]

The sketch of the GPR-MC-CV algorithm is presented here.

Preprocessing: compute $\mathbf{x}^{n,p}$ and $\mathbf{\tilde{x}}^{n,p,m}$ by using equations (3.28) and (3.23),

$v^{EU}\left(t_{n},\mathbf{x}^{n,p}\right)$ and $\hat{\Psi}\left(t_{n},\mathbf{x}^{n,p}\right)$ by using (3.17), (3.19) and (3.32)

Step $N-1$ : shaping of $v^{GPR}_{N-1}\left(\cdot\right)$ :

$\hookrightarrow$ For $p=1,\dots,P$ compute $\hat{v}\left(t_{N-1},\mathbf{x}^{N-1,p}\right)=\hat{\Psi}\left(\mathbf{x}^{N-1,p}\right)$

$\hookrightarrow$ Define the training set $\mathcal{D}=\left\{\left(\mathbf{x}^{p},\hat{v}\left(t_{N-1},\mathbf{x}^{N-1,p}\right)\right),p=1,\dots,P\right\}$

$\hookrightarrow$ Train GPR on $\mathcal{D}$ to obtain $v^{GPR}_{N-1}\left(\cdot\right)$

Step $N-2$ : shaping of $v^{GPR}_{N-2}\left(\cdot\right)$ :

$\hookrightarrow$ For $p=1,\dots,P$ compute

$\hat{v}\left(t_{N-2},\mathbf{x}^{N-2,p}\right)=\max\left(\hat{\Psi}\left(\mathbf{x}^{N-2,p}\right),\frac{e^{-r\Delta t}}{M}\sum_{m=1}^{M}v^{GPR}_{N-1}\left(\mathbf{\tilde{x}}^{N-2,p,m}\right)\right)$

$\hookrightarrow$ Define the training set $\mathcal{D}=\left\{\left(\mathbf{x}^{p},\hat{v}\left(t_{N-2},\mathbf{x}^{p}\right)\right),p=1,\dots,P\right\}$

$\hookrightarrow$ Train GPR on $\mathcal{D}$ to obtain $v^{GPR}_{N-2}\left(\cdot\right)$

$\quad\begin{array}[]{l}\vdots\end{array}$ $\leftarrow$ Steps $n=N-3,\ldots,1$ $\left[\begin{array}[]{l}\mbox{replace$ N-2 $with$ n $and$ N-1 $with$ n+1 $; }\\ \end{array}\right]$

Step [math]: computation of the price:

$\hat{v}\left(0,\mathbf{S}_{0}\right)=\max\left(\Psi\left(\mathbf{S}_{0}\right),\frac{e^{-r\Delta t}}{M}\sum_{m=1}^{M}v^{GPR}_{1}\left(\mathbf{\tilde{x}}^{0,m}\right)\right)$

$v^{BM}\left(0,\mathbf{S}_{0}\right)=\hat{v}\left(0,\mathbf{S}_{0}\right)+v^{EU}\left(0,\mathbf{S}_{0}\right)$

Remark 2.

We remark that when using a quasi-Monte Carlo sequence it is important to consider leaping. This technique consists in considering only some uniformly subsampled points of the original sequence, which improves convergence. However, the leap values, must be chosen with care. In fact, many values lead to sequences that do not touch on large sub-hyper-rectangles of the unit hypercube, failing to be a uniform quasi-random point set (see Kocis and Whiten [25]). A common rule for choosing the leap values for the Halton sequence consists in setting the value to $q-1$ , where $q$ is a prime number that has not been used to generate the sequence.

Remark 3.

We observe that the Monte Carlo evaluation of the continuation value can be easily parallelized since the summations in (3.25), are independent of each other and can be calculated separately. Thus, this feature allows one to significantly reduce the computational time.

Remark 4.

As observed by Ludkovsi [33], the main computational cost is due to the training of the GPR model, which is proportional to the cube of the observation amount. In our case, this training has to be performed one time to compute the European prices with a cost $\mathcal{O}\left(Q^{3}\right)$ (with $Q$ the number of points employed in European price computation), and $N-2$ times within the algorithm to approximate the American-European gap at a give time, thus $\mathcal{O}\left(N\cdot P^{3}\right)$ (with $N$ the number of time steps and $P$ the number of points used to train the GPR models at each time step). On the other hand, the cost of the Monte Carlo step depends on both the number of evaluations to be performed and to the number of points employed: the cost for such a step is $\mathcal{O}(N\cdot P\cdot M)$ (with $M$ the number of Monte Carlo simulations employed in estimating the continuation gap value). Finally, we observe that if we compute the European prices by using $M^{\prime}$ Monte Carlo simulations instead of by using the GPR-EI formula, then the cost $\mathcal{O}\left(Q^{3}\right)$ is replaced by $\mathcal{O}(N\cdot P\cdot M^{\prime})$ .

3.3.3 The Control Variate for GPR-Tree and GRP-EI

Although the control variable technique was initially conceived as a variance reduction techniques for Monte Carlo methods, it can also be a valid support in other contexts. We investigate the benefits brought by this technique to the GPR-Tree and GPR-EI techniques introduced by Goudenège et al. [19] for pricing American options in high dimension. In particular, as proposed for the GPR-MC method, we use the European price as a control variate and we employ GPR-Tree (or GPR-EI) to compute the American-European price gap. Let us give a brief introduction of these two numerical approaches. We refer the interested reader to [19] for more details.

The GPR-Tree method is similar to the GPR-MC method here proposed. The main difference consists in the use of a tree step in place of random simulations to compute the continuation value. In particular, for each time step $t_{n}$ and for each point $\mathbf{x}^{p}$ , $2^{d}$ future values are generated according to the tree method proposed by Ekvall [16], in place of Monte Carlo simulations. Such a method is particularly efficient when the dimension $d$ is low (that is, indicatively, it does not exceed 10).

The GPR-EI method differs from both the GPR-MC and GPR-Tree methods for three reasons. First of all, the predictors employed in the GPR step are related to the logarithms of the underlying value. Then, the continuation value at these points is computed through a closed formula which comes from an exact integration. Finally, the GPR-EI method employs the Squared Exponential kernel, which is given by

[TABLE]

for $\mathbf{x},\mathbf{x}^{\prime}\in\mathbb{R}^{d}$ , where $d$ is the dimension of the regression problem.

4 Numerical Results

In this Section we report some numerical results in order to investigate the effectiveness of the proposed Machine Learning algorithm for pricing American options in the multi-dimensional Black-Scholes model.

First of all, we compare the GPR-MC and GPR-MC-CV methods considering Geometric and Arithmetic basket put options and then we focus on a Call on the Maximum option. Moreover, we study the benefits of using the control variable also for GPR-Tree and GPR-EI methods. Finally, we investigate the variance of the price estimators about the two methods. We stress out that the GPR-Tree method is interesting only for low dimension options: when $d$ exceeds $10$ , the method still works, but computational times grow exponentially.

4.1 Geometric and Arithmetic Basket Put Options

In this test we focus on two payoff that depend on the mean of the underlyings. Specifically, we consider the following payoff examples:

•

Geometric basket Put

[TABLE]

•

Arithmetic basket Put

[TABLE]

We consider both the GPR-MC and the GPR-MC-CV method in order to investigates the benefits induced by the control variate technique. We consider the same parameters as in [19]: $T=1$ , $S_{i}=100$ , $K=100$ , $r=0.05$ , equal (null) dividend rates $\eta_{i}=0.0$ , equal volatilities $\sigma_{i}=0.2$ , equal correlations $\rho_{ij}=0.2$ and $N=10$ exercise dates. Moreover, we consider $P=250,500$ or $1000$ points, $M=10^{3},10^{4}$ or $10^{5}$ Monte Carlo simulations and $Q=10000$ points for the computation of the European prices with the GPR-EI formula. As opposed to the other input parameters, we vary the dimension $d$ , considering $d=2,\,5,\,10,\,20,\,40$ and $100$ . The algorithm has been implemented in MATLAB and computations have been preformed on a server which employs a $2.40$ GHz Intel*®* Xenon*®* processor (Gold 6148, Skylake) and 20 GB of RAM.

We present now the numerical results for the two payoff examples. First of all, let us present the European results, obtained by means of the GPR-EI formula. Table 1 reports the prices, changing the dimension $d$ and the number of employed points $Q$ . Moreover, we also report a Benchmark price computed by Monte Carlo simulation considering $10^{6}$ samples ( $95\%$ confidence intervals are $\pm 0.01$ for all the benchmark values.). As we can see with only $1000$ points we can obtain accurate results in any considered dimension.

Let us now focus on the American results. As far as the Geometric basket Put is considered, it is possible to reduce the problem of pricing in the $d$ -dimensional model to a one dimensional American Put option in the Black-Scholes model with opportune parameters. The price of such a one dimensional American option can be computed in a easy way, for example by using the CRR algorithm with $1000$ steps (see Cox et al. [13]). Therefore in this case we have a reliable benchmark to test the algorithm. Moreover, when $d$ is smaller than $10$ we can also compute the price by means of a multi-dimensional binomial tree (see Ekvall et al. [16]). In particular, the number of steps employed for the binomial tree is equal to $200$ when $d=2$ and to $50$ when $d=5$ . For values of $d$ larger than $5$ , prices cannot be approximated via such a tree, because the memory required for the calculations would be too large. Results are reported in Tables 3 and 3. We observe that both the two algorithms are very accurate in low dimension, despite we are approximating an American option with a Bermudan one. When larger baskets are considered, say $d\geq 40$ , the prices obtained with the GPR-MC are less accurate and less stable while changing the number of points $P$ and the number of Monte Carlo simulations $M$ . The computer processing time of the GPR-MC-CV method are a little higher than those of the GPR-MC because European prices need to be computed.

We also stress out that the computer processing time increase little with the size of the problem. This is due to the fact that the dimension affects significantly only the computational time of the Monte Carlo step while the GPR step is only minimally distressed (see Remark 1).

Table 5 and 5 report the results for the GPR-Tree and GPR-EI methods employing or not the control variate technique. By comparing the results of the two Tables, we observe that the option prices for $d\leq 10$ are very similar: in this case variate control technique is not crucial to improve convergence. As opposed to that, GPR-EI benefits sensitively from control variate technique when high values of $d$ are considered.

As opposed to the Geometric basket Put option, we have no method to obtain a fully reliable benchmark when dealing with an Arithmetic basket Put option. However, for small values of $d$ , a reference price can be obtained by means of a multidimensional tree method (see Ekvall et al. [16]), just as shown for the Geometric case. Results are reported in Tables 7 and 7. The conclusions that we can draw in this case are similar to those for the Geometric case: both the two methods are accurate in low dimension, while the control variate method is more effective in high dimension.

Table 9 and 9 report the results for the GPR-Tree and GPR-EI methods employing or not the control variate technique. Just as for the Geometric put option we observe that the option prices for $d\leq 10$ are very similar: in this case variate control technique is not crucial to improve convergence. As opposed to that, control variate technique has an impact on GPR-EI results when high values of $d$ are considered. Anyway, in this case, due to the lack of a benchmark price, it is difficult to draw clear cut conclusions.

4.2 Call on the Maximum option

Let us consider a Call on the Maximum of $d$ -assets American option, whose payoff is given by:

[TABLE]

The Call on the Maximum setting is particularly interesting for investigating scalability of our approaches in the dimension $d$ of the problem. As observed by Ludkovski [33], as opposed to basket Put options, the stopping region of a Call on the Maximum consists of several disconnected pieces and this makes the pricing problem particularly challenging. As done in the previous Section, we consider both the GPR-MC and the GPR-MC-CV method in order to investigates the benefits induced by the use of this technique. We consider the same parameters as those employed by Becker et al. [7]: $T=3$ , $S_{i}=100$ , $K=100$ , $r=0.05$ , equal dividend rates $\eta_{i}=0.1$ , equal volatilities $\sigma_{i}=0.2$ , equal (null) correlations $\rho_{ij}=0.0$ and $N=9$ exercise dates. Moreover, we consider $P=250,500$ or $1000$ points, $M=10^{3},10^{4}$ or $10^{5}$ Monte Carlo simulations. As opposed to the other input parameters, we vary the dimension $d$ , considering $d=2,\,5,\,10,\,20,\,30,\,50$ and $100$ . In this particular case, because of the long maturity and unbounded payoff, the GPR-EI formula is not very accurate when considering high dimension and initial points far from the spot $\mathbf{S}_{0}$ , and so we prefer computing the European price by means of Quasi-Monte Carlo simulation with $10^{6}$ random simulations.

First of all, let us present the European results, obtained by means of the GPR-EI formula. Table 10 reports the prices, changing the dimension $d$ and the number of employed points $Q$ . Moreover, we also report a Benchmark price computed by Monte Carlo simulation considering $10^{6}$ samples ( $95\%$ confidence intervals are $\pm 0.01$ for all the benchmark values).

The aforementioned testing set has also been considered by Becker et al. [7] and therefore we report their results as reference prices. Furthermore, for small values of $d$ , we can approximate the price obtained by means of a multidimensional tree method. Results, which are reported in Tables 12 and 12, are quite meaningful. Both the two methods perform fine in low dimension, but when large baskets are considered outcomes are strongly different. As far as this particular dataset is considered, the GPR-MC approach gives several null results and others very high, which means that the GPR regression is not able to extrapolate the price surface correctly. In particular, this happens when $d\geq 50$ . Increasing the number $P$ of points fixes things when $d=50$ and for $P=1000$ results are likely, although outside the confidence interval proposed by Becker et al. [7]. Anyway, when $d=100$ we always obtain null value, showing all the limits of the GPR-MC approach. As opposed to the GPR-MC, the GPR-MC-CV method performs very well for all the considered dimensions and almost all the values obtained with $P=1000$ and $M=10^{5}$ are within the confidence intervals proposed by Becker et al. [7].

Finally, Tables 14 and 14 report the results for the GPR-Tree and the GPR-EI method obtained by using or not the control variate technique. These two methods seems to be not very effective for the particular Bermudan option considered here. As far as the GPR-EI method is concerned, variates control technique improves the results. Such a improvement is not evident with respect to the GPR-Tree method.

4.3 Variance Reduction

We conclude our numerical investigations by showing the effect of introducing a control variate on the variance of the estimated prices. In particular, we consider the same Geometric Put option as in Section 4.1 and we price the same option $100$ different times, changing the seed of the Monte Carlo generator. This allows us to estimate the variance of the price estimator and to make comparisons. Results are available in Tables 16 and 16, that report the estimated standard deviations and their $95\%$ confidence intervals, computed according to the method suggested by Sheskin [40]. It is evident that the the standard deviation (and thus the variance) of the prices obtained with the GPR-MC-CV method is several time lower than the one computed with the GPR-MC method. This is also confirmed for all the considered combination of $P$ and $M$ , by the Hartley’s $F_{max}$ test (see Sheskin [40]) with a $99\%$ confidence level.

5 Conclusions

In this paper we have proposed a new approach to price American options on baskets of assets, each of them following a Black-Scholes dynamics. The method employs Machine Learning technique, Monte Carlo method and variance reduction technique that exploits the European option price as a control variate. The European prices are computed by means of a semy-analitical formula or Quasi-Monte Carlo simulations. Numerical results show that the method is reliable and fast for baskets including up to $100$ assets. The use of a control variate improves the algorithm accuracy and reduces the variance of the estimated prices. In certain cases, also the GPR-Tree and GRP-EI methods benefit from the use of a control variate. The computation time is small and shortly growing with respect to the dimension of the basket. Moreover, the algorithm is partially parallelizable and therefore the computing time can be significantly reduced. Machine Learning seems to be a very promising tool for American option pricing in high dimension, overcoming the problem of the curse of dimensionality.

Bibliography42

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Abbas-Turki, L. A., and Lapeyre, B. American options by Malliavin calculus and nonparametric variance and bias reduction methods. SIAM Journal on Financial Mathematics 3 , 1 (2012), 479–510.
2[2] Andersen, L., and Broadie, M. Primal-dual simulation algorithm for pricing multidimensional American options. Management Science 50 , 9 (2004), 1222–1234.
3[3] Bally, V., Caramellino, L., and Zanette, A. Pricing and hedging American options by Monte Carlo methods using a Malliavin calculus approach. Monte Carlo Methods and Applications 11 , 2 (2005), 97–133.
4[4] Bally, V., Pagès, G., and Printems, J. First-order schemes in the numerical quantization method. Mathematical Finance 13 , 1 (2003), 1–16.
5[5] Bayer, C., Siebenmorgen, M., and Tempone, R. Smoothing the payoff for efficient computation of basket option prices. Quantitative Finance 18 , 3 (2018), 491–505.
6[6] Beck, C., E, W., and Jentzen, A. Machine Learning approximation algorithms for high-dimensional fully nonlinear partial differential equations and second-order backward stochastic differential equations. Journal of Nonlinear Science 29 , 4 (2019), 1563–1619.
7[7] Becker, S., Cheridito, P., and Jentzen, A. Deep optimal stopping. Journal of Machine Learning Research 20 , 74 (2019), 1–25.
8[8] Bouchard, B., and Touzi, N. Discrete-time approximation and Monte-Carlo simulation of backward stochastic differential equations. Stochastic Processes and their Applications 111 , 2 (2004), 175–206.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Variance Reduction Applied to Machine Learning for Pricing Bermudan/American Options in High Dimension

1 Introduction

2 American options in the multi-dimensional Black-Scholes model

3 Machine Learning for American options in the multi-dimensional Black-Scholes model

3.1 Gaussian Process Regression

Remark 1**.**

3.2 Machine Learning Exact Integration for European options

Proposition 1**.**

Proposition 2**.**

3.3 Machine Learning Control Variate algorithm for American options

3.3.1 The GPR Monte Carlo Method

3.3.2 The GPR Monte Carlo Control Variate Method

Remark 2**.**

Remark 3**.**

Remark 4**.**

3.3.3 The Control Variate for GPR-Tree and GRP-EI

4 Numerical Results

4.1 Geometric and Arithmetic Basket Put Options

4.2 Call on the Maximum option

4.3 Variance Reduction

5 Conclusions

Remark 1.

Proposition 1.

Proposition 2.

Remark 2.

Remark 3.

Remark 4.