Sample caching Markov chain Monte Carlo approach to boson sampling   simulation

Yong Liu; Min Xiong; Chunqing Wu; Dongyang Wang; Yingwen Liu,; Jiangfang Ding; Anqi Huang; Xiang Fu; Xiaogang Qiang; Ping Xu; Mingtang Deng,; Xuejun Yang; Junjie Wu

arXiv:1907.08077·quant-ph·April 28, 2020

Sample caching Markov chain Monte Carlo approach to boson sampling simulation

Yong Liu, Min Xiong, Chunqing Wu, Dongyang Wang, Yingwen Liu,, Jiangfang Ding, Anqi Huang, Xiang Fu, Xiaogang Qiang, Ping Xu, Mingtang Deng,, Xuejun Yang, Junjie Wu

PDF

TL;DR

This paper introduces a novel sample caching Markov chain Monte Carlo method that reduces sample correlation and loss, enabling more efficient classical simulation of boson sampling, which is crucial for quantum supremacy validation.

Contribution

The paper proposes a new sample caching MCMC approach that eliminates sample autocorrelation and loss, improving classical boson sampling simulation efficiency and applicability to various sampling tasks.

Findings

01

Reduces sample autocorrelation in MCMC sampling.

02

Prevents sample loss during simulation.

03

Enhances efficiency of boson sampling simulation.

Abstract

Boson sampling is a promising candidate for quantum supremacy. It requires to sample from a complicated distribution, and is trusted to be intractable on classical computers. Among the various classical sampling methods, the Markov chain Monte Carlo method is an important approach to the simulation and validation of boson sampling. This method however suffers from the severe sample loss issue caused by the autocorrelation of the sample sequence. Addressing this, we propose the sample caching Markov chain Monte Carlo method that eliminates the correlations among the samples, and prevents the sample loss at the meantime, allowing more efficient simulation of boson sampling. Moreover, our method can be used as a general sampling framework that can benefit a wide range of sampling tasks, and is particularly suitable for applications where a large number of samples are taken.

Tables8

Table 1. Table 1: The results of sample caching and jump sampling. The results of the two methods are compared for simulating boson sampling instances with different number of photons (p.) and modes (m.). r o subscript 𝑟 𝑜 r_{o} is the first-order autocorrelation of the original sample sequence generated by MCMC. r ( 200 ) superscript 𝑟 200 r^{(200)} is the first-order autocorrelation of the sequence obtained by jump sampling with the jumping step of 200. r 500 subscript 𝑟 500 r_{500} , r 1000 subscript 𝑟 1000 r_{1000} , r 2000 subscript 𝑟 2000 r_{2000} and r 4000 subscript 𝑟 4000 r_{4000} are the first-order autocorrelation of the sequences obtained through sample caching with the sizes of caches to be 500 500 500 , 1000 1000 1000 , 2000 2000 2000 and 4000 4000 4000 respectively.

Scale	$r_{o}$	$r^{(200)}$	$r_{500}$	$r_{1000}$	$r_{2000}$	$r_{4000}$
9p.81m.	0.9115	0.0102	0.0261	0.0119	0.0067	0.0035
15p.225m.	0.9490	0.0128	0.0451	0.0205	0.0102	0.0057
21p.441m.	0.9633	0.0030	0.0622	0.0311	0.0162	0.0081
25p.625m.	0.9673	-0.0183	0.0642	0.0328	0.0167	0.0089
30p.900m.	0.9702	-0.0832	0.0684	0.0309	0.0095	-0.0162

Table 2. Table SI: ε 𝜀 \varepsilon and the first-order autocorrelation under different cache size. For the n = 3 , m = 9 formulae-sequence 𝑛 3 𝑚 9 n=3,m=9 case (left), K 𝐾 K is set to be 29 according to Fig. S3 . For the n = 4 , m = 16 formulae-sequence 𝑛 4 𝑚 16 n=4,m=16 case (right), K 𝐾 K is set to be 33. For each sequence 1,000,000 samples are taken. k ¯ ¯ 𝑘 \bar{k} is the average absolute distance in the original sequence of the adjacent samples of the final sequence, N 1 subscript 𝑁 1 N_{1} is the number of adjacent samples in the final sequence those are also adjacent in the original sequence, and R 1 subscript 𝑅 1 R_{1} is the corresponding ratio over the whole sample sequence, which is of the theoretical value as 1 L 1 𝐿 \frac{1}{L} . F K subscript 𝐹 𝐾 F_{K} is the frequency of adjacent samples in the final sequence those are of a distance less than K 𝐾 K in the original sequence, and ε 𝜀 \varepsilon represents the ratio. Finally, r 1 subscript 𝑟 1 r_{1} is the first order autocorrelation.

3p.9m.							4p.16m.
L	$\bar{K}$	$N_{1}$	$R_{1}$	$F_{K}$	$ε$	$r_{1}$	L	$\bar{K}$	$N_{1}$	$R_{1}$	$F_{K}$	$ε$	$r_{1}$ .
10	9.9912	99592	9.96%	953263	95.33%	0.2580	10	10.0059	100132	10.01%	968989	96.90%	0.3215
20	19.9961	49948	4.99%	774238	77.42%	0.1514	20	19.9819	50304	5.03%	816477	81.65%	0.1949
30	30.0184	33599	3.36%	625093	62.51%	0.1095	30	30.0365	33105	3.31%	673629	67.36%	0.1386
40	39.9672	25031	2.50%	519809	51.98%	0.0843	40	40.0001	24880	2.49%	566109	56.61%	0.1070
50	49.9564	19909	1.99%	443412	44.34%	0.0691	50	49.9722	19771	1.98%	486719	48.67%	0.0854
			…							…
100	99.9055	9913	0.99%	253312	25.33%	0.0371	100	100.0127	10081	1.01%	282310	28.23%	0.0445
200	199.8007	5018	0.50%	135236	13.52%	0.0189	200	199.7608	4989	0.50%	152125	15.21%	0.0234
			…							…
500	499.5682	1992	0.20%	56410	5.64%	0.0062	500	499.5963	1971	0.20%	64398	6.44%	0.0096
			…							…
1000	999.6380	994	0.10%	28471	2.85%	0.0031	1000	999.9884	967	0.10%	32528	3.25%	0.0051
			…							…
10000	9998.9470	107	0.01%	2969	0.30%	0.0009	10000	10002.9656	87	0.01%	3356	0.34%	0.0010

Table 3. Table SII: Examples for the value assigning of output patterns. V B D subscript 𝑉 𝐵 𝐷 V_{BD} is the vale assigned by transmit the binary values into decimal, V S O subscript 𝑉 𝑆 𝑂 V_{SO} is the order of the patterns, and V l o g P subscript 𝑉 𝑙 𝑜 𝑔 𝑃 V_{logP} is the logarithm of the probability of the patterns.

No.	Pattern	Probability	$V_{B D}$	$V_{S O}$	$V_{- l o g P}$	No.	Pattern	Probability	$V_{B D}$	$V_{S O}$	$V_{- l o g P}$
1	(0,0,0,1,1,1)	$6.24 \times 10^{- 2}$	7	1	1.204	11	(1,0,0,0,1,1)	$8.8 \times 10^{- 3}$	35	11	2.055
2	(0,0,1,0,1,1)	$0.71 \times 10^{- 3}$	11	2	3.144	12	(1,0,0,1,0,1)	$5.15 \times 10^{- 2}$	37	12	1.287
3	(0,0,1,1,0,1)	$1.41 \times 10^{- 2}$	13	3	1.848	13	(1,0,0,1,1,0)	$3.82 \times 10^{- 3}$	38	13	2.417
4	(0,0,1,1,1,0)	$5.45 \times 10^{- 3}$	14	4	2.262	14	(1,0,1,0,0,1)	$1.46 \times 10^{- 2}$	41	14	1.834
5	(0,1,0,0,1,1)	$1.12 \times 10^{- 2}$	19	5	1.949	15	(1,0,1,0,1,0)	$8.44 \times 10^{- 3}$	42	15	2.073
6	(0,1,0,1,0,1)	$2.33 \times 10^{- 2}$	21	6	1.631	16	(1,0,1,1,0,0)	$2.67 \times 10^{- 3}$	44	16	2.572
7	(0,1,0,1,1,0)	$1.96 \times 10^{- 2}$	22	7	1.705	17	(1,1,0,0,0,1)	$6.24 \times 10^{- 3}$	49	17	2.204
8	(0,1,1,0,0,1)	$7.05 \times 10^{- 3}$	25	8	2.151	18	(1,1,0,0,1,0)	$1.49 \times 10^{- 2}$	50	18	1.824
9	(0,1,1,0,1,0)	$2.81 \times 10^{- 2}$	26	9	1.551	19	(1,1,0,1,0,0)	$3.16 \times 10^{- 3}$	52	19	2.499
10	(0,1,1,1,0,0)	$5.04 \times 10^{- 3}$	28	10	2.297	20	(1,1,1,0,0,0)	$2.02 \times 10^{- 2}$	56	20	1.694

Table 4. Table SIII: Performance parameters of the computing node of Tianhe-2 supercomputer and those of the local cluster.

Item		Parameters of a node
		Tianhe-2		Local Cluster
		With Accelerators	Without Accelerators	Local Cluster
Peak Performance		3.43Teraflops	422.4Gigaflops	201.6Gigaflops
Processors:	CPU	Intel Xeon E5 $\times$ 2 (24 cores)	Intel Xeon E5 $\times$ 2 (24 cores)	Intel Xeon E5 $\times$ 2 (12 cores)
Processors:	Accelerators	Intel Xeon Phi $\times$ 3 (171 cores)	$∖$	$∖$
Memory Storage Capacity		72GB	64GB	16GB
Interconnect Network		TH Express-2	TH Express-2	InfiniBand

Table 5. Table SIV: Simulation results using SC-MCMC. The scales reflect the number of photons (p.) and modes (m.) in the boson sampling scheme. T t o t a l subscript 𝑇 𝑡 𝑜 𝑡 𝑎 𝑙 T_{total} is the time used for the whole sampling process, T 1 S a m p l e subscript 𝑇 1 𝑆 𝑎 𝑚 𝑝 𝑙 𝑒 T_{1Sample} is the average time for one sample, T p e r subscript 𝑇 𝑝 𝑒 𝑟 T_{per} is the time used on the calculation of permanents, and T 1 P e r subscript 𝑇 1 𝑃 𝑒 𝑟 T_{1Per} is the average time for one permanent. R a t e 𝑅 𝑎 𝑡 𝑒 Rate is the sampling rate when using N 𝑁 N nodes on the specified platform. r 1 subscript 𝑟 1 r_{1} is the first-order autocorrelation of the sequence when using a sampling cache with size of 4,000. The execution on Tianhe-2 only uses the CPUs, while the Intel Xeon Phi accelerators are not applied. When the number of photons are less than 17, the main cost of the calculation is the start-up of the calculation (note that the time for one permanent is in the order of 10 − 4 superscript 10 4 10^{-4} seconds), rather than the permanents. After that ( n ≥ 17 𝑛 17 n\geq 17 ), the time used in the simulation well confirm to the rule that the execution time doubles when the number of photons increases by 1, which means the quantity of computation of permanent becomes the main part of the simulation.

Scale	Platform	$N$	$S$	Rate(Hz)	$T_{t o t a l}$ (s)	$T_{1 S a m p l e}$ (s)	$T_{p e r}$ (s)	$T_{1 P e r}$ (s)	$%_{P e r}$	$r_{1}$
20p.400m.	Tianhe-2	4	500,000	152.03	3288.88	0.00658	3185.82	0.00637	96.87%	0.0097
25p.625m.	Tianhe-2	32	200,000	22.51	8884.56	0.04442	8804.99	0.04402	99.10%	0.0089
30p.900m.	Tianhe-2	64	20,000	1.01	19836.72	0.99184	19825.83	0.99129	99.95%	-0.0162
3p.9m.	Cluster	1	1,000,000	4263.29	234.56	0.00023	226.51	0.00023	96.57%	0.0018
4p.16m.	Cluster	1	1,000,000	4163.90	240.16	0.00024	229.83	0.00023	95.70%	-0.0005
5p.25m.	Cluster	1	1,000,000	4082.18	244.97	0.00024	231.99	0.00023	94.70%	0.0005
6p.36m.	Cluster	1	1,000,000	3919.72	255.12	0.00026	238.02	0.00024	93.30%	0.0036
7p.49m.	Cluster	1	1,000,000	3795.73	263.45	0.00026	242.39	0.00024	92.01%	0.0014
8p.64m.	Cluster	8	1,000,000	3491.04	286.45	0.00029	257.62	0.00026	89.94%	0.0023
9p.81m.	Cluster	8	1,000,000	3429.28	291.61	0.00029	272.66	0.00027	93.50%	0.0035
10p.100m.	Cluster	8	1,000,000	3195.79	312.91	0.00031	290.86	0.00029	92.95%	0.0045
11p.121m.	Cluster	32	1,000,000	2780.55	347.64	0.00035	296.36	0.00030	85.25%	0.0035
12p.144m.	Cluster	32	1,000,000	2897.39	357.68	0.00036	326.69	0.00033	91.34%	0.0045
13p.169m.	Cluster	32	1,000,000	2818.12	413.30	0.00041	371.23	0.00037	89.82%	0.0058
14p.196m.	Cluster	32	1,000,000	2813.13	537.99	0.00054	489.10	0.00049	90.91%	0.0041
15p.225m.	Cluster	32	1,000,000	2133.86	789.08	0.00079	731.82	0.00073	92.74%	0.0057
16p.256m.	Cluster	32	1,000,000	800.30	1252.05	0.00125	1183.67	0.00118	94.54%	0.0064
17p.289m.	Cluster	32	1,000,000	317.46	3156.84	0.00316	3074.90	0.00307	97.40%	0.0063
18p.324m.	Cluster	32	1,000,000	159.97	6251.32	0.00625	6156.73	0.00616	98.49%	0.0072
19p.361m.	Cluster	32	1,000,000	78.94	12667.81	0.01267	12558.42	0.01256	99.14%	0.0049
20p.400m.	Cluster	32	1,000,000	37.98	26326.21	0.02633	26199.45	0.02620	99.52%	0.0067
21p.441m.	Cluster	32	1,000,000	18.09	55293.32	0.05529	55146.15	0.05515	99.73%	0.0081

Table 6. Table SV: The increment of required photon number for Q A ( n , η ) = 0 𝑄 𝐴 𝑛 𝜂 0 QA(n,\eta)=0 for a network with m = n 2 𝑚 superscript 𝑛 2 m=n^{2} . The number of photons required for Q A ( n , η ) = 0 𝑄 𝐴 𝑛 𝜂 0 QA(n,\eta)=0 increases according to the value of η 𝜂 \eta . N M I S subscript 𝑁 𝑀 𝐼 𝑆 N_{MIS} is the least number of required photons for Q A ( n , η ) = 0 𝑄 𝐴 𝑛 𝜂 0 QA(n,\eta)=0 according to the result obtained by MIS if the transmission probability realized in physical experiment is η 𝜂 \eta . N c t subscript 𝑁 subscript 𝑐 𝑡 N_{c_{t}} is correspond number via SC-MCMC.

$R_{q}$	$10$ GHz										$76 n^{- 1}$ MHz
$η$	0.55	0.60	0.65	0.70	0.75	0.80	0.85	0.90	0.95	1	0.60	0.65	0.70	0.75	0.80	0.85	0.90	0.95	1
$N_{M I S}$	15	12	10	8	8	7	7	6	6	6	44	32	26	22	19	17	16	15	14
$N_{t_{c}}$	45	29	22	18	16	14	13	12	11	11	69	49	39	33	29	26	24	22	20
$N_{t_{c}} - N_{M I S}$	30	17	12	10	8	7	6	6	5	5	25	17	13	11	10	9	8	7	6

Table 7. Table SVI: The increment of required photon number for Q A ( n , η ) = 0 𝑄 𝐴 𝑛 𝜂 0 QA(n,\eta)=0 for a network with m = 4 n 𝑚 4 𝑛 m=4n . The number of photons required for Q A ( n , η ) = 0 𝑄 𝐴 𝑛 𝜂 0 QA(n,\eta)=0 increases according to the value of η 𝜂 \eta . N M I S subscript 𝑁 𝑀 𝐼 𝑆 N_{MIS} is the least number of required photons for Q A ( n , η ) = 0 𝑄 𝐴 𝑛 𝜂 0 QA(n,\eta)=0 according to the result obtained by MIS if the transmission probability realized in physical experiment is η 𝜂 \eta . N c t subscript 𝑁 subscript 𝑐 𝑡 N_{c_{t}} is correspond number via SC-MCMC.

$R_{q}$	$10$ GHz							$76 n^{- 1}$ MHz
$η$	0.70	0.75	0.80	0.85	0.90	0.95	1	0.70	0.75	0.80	0.85	0.90	0.95	1
$N_{M I S}$	11	9	8	7	7	6	6	59	39	30	25	21	19	17
$N_{t_{c}}$	34	25	20	17	15	14	13	99	64	48	40	34	30	27
$N_{t_{c}} - N_{M I S}$	23	16	12	10	8	8	7	40	25	18	15	13	11	10

Table 8. Table SVII: The increment of the minimum value of η 𝜂 \eta , the transmission probability of a single photon, when the number of photons is limited under 100. For the curves, the minimum value is reached when the photon number is 100. The increment varies according to the shape of the network and the repetition rate of the photon source. η M I S subscript 𝜂 𝑀 𝐼 𝑆 \eta_{MIS} is the minimum value of η 𝜂 \eta according to the curves obtained by MIS, and η S C − M C M C subscript 𝜂 𝑆 𝐶 𝑀 𝐶 𝑀 𝐶 \eta_{SC-MCMC} is the correspond value via SC-MCMC.

Network	$m = n^{2}$		$m = 4 n$
Repetition Rate	$R_{q} = 10$ GHz	$R_{q}^{'} = 76 n^{- 1}$ MHz	$R_{q} = 10$ GHz	$R_{q}^{'} = 76 n^{- 1}$ MHz
$η_{M I S}$	48.81%	53.67%	60.41%	66.42%
$η_{S C - M C M C}$	51.32%	56.43%	63.52%	69.83%
$η_{S C - M C M C}$ - $η_{M I S}$	2.51%	2.76%	3.11%	3.41%

Equations69

P_{accept} = min (1, p (s^{'}) / p (s)),

P_{accept} = min (1, p (s^{'}) / p (s)),

r_{1} = \frac{\sum _{i} ( x _{i} - x ˉ ) ( x _{i + 1} - x ˉ )}{\sum _{i} ( x _{i} - x ˉ ) ^{2}},

r_{1} = \frac{\sum _{i} ( x _{i} - x ˉ ) ( x _{i + 1} - x ˉ )}{\sum _{i} ( x _{i} - x ˉ ) ^{2}},

p (k, L) = (\frac{L - 1}{L})^{k - 1} \cdot \frac{1}{L},

p (k, L) = (\frac{L - 1}{L})^{k - 1} \cdot \frac{1}{L},

P_{cr} \equiv k = 1 \sum K p (k, L) = 1 - (\frac{L - 1}{L})^{K} \approx \frac{K}{L} .

P_{cr} \equiv k = 1 \sum K p (k, L) = 1 - (\frac{L - 1}{L})^{K} \approx \frac{K}{L} .

Q A (n, η) = lo g (t_{c} / t_{q}),

Q A (n, η) = lo g (t_{c} / t_{q}),

Pr (S \to T) = \frac{∣ Per ( U ^{(S, T)} ) ∣ ^{2}}{\prod _{i} s _{i} ! \prod _{j} t _{j} !},

Pr (S \to T) = \frac{∣ Per ( U ^{(S, T)} ) ∣ ^{2}}{\prod _{i} s _{i} ! \prod _{j} t _{j} !},

Per (A) = σ \sum i = 1 \prod n a_{i σ_{i}},

Per (A) = σ \sum i = 1 \prod n a_{i σ_{i}},

r_{k} = \frac{\mathds E [ ( x _{t} - μ ) ( x _{t + k} - μ ) ]}{σ ^{2}},

r_{k} = \frac{\mathds E [ ( x _{t} - μ ) ( x _{t + k} - μ ) ]}{σ ^{2}},

r_{k} = \frac{\sum _{t = 1}^{n - k} ( x _{t} - μ _{s} ) ( x _{t + k} - μ _{s} )}{σ _{s}^{2}},

r_{k} = \frac{\sum _{t = 1}^{n - k} ( x _{t} - μ _{s} ) ( x _{t + k} - μ _{s} )}{σ _{s}^{2}},

d = \frac{\sum _{t = 1}^{T} ( x _{t} - x _{t + 1} ) ^{2}}{\sum _{t = 1}^{T} x _{t}^{2}} .

d = \frac{\sum _{t = 1}^{T} ( x _{t} - x _{t + 1} ) ^{2}}{\sum _{t = 1}^{T} x _{t}^{2}} .

\mathds E [(x_{t} - μ_{t}) (x_{j} - μ_{t})] / σ^{2}

\mathds E [(x_{t} - μ_{t}) (x_{j} - μ_{t})] / σ^{2}

=

=

=

=

p(s_{j}|s_{i})=p_{ij}=\left\{\begin{array}[]{ll}g(s_{j}|s_{i})\cdot\min\left(1,\frac{p(s_{j})}{p(s_{i})}\right),&i\neq j\\ 1-\sum_{l=1}^{i-1}p_{il}-\sum_{l=i+1}^{N}p_{il},&i=j,\end{array}\right.

p(s_{j}|s_{i})=p_{ij}=\left\{\begin{array}[]{ll}g(s_{j}|s_{i})\cdot\min\left(1,\frac{p(s_{j})}{p(s_{i})}\right),&i\neq j\\ 1-\sum_{l=1}^{i-1}p_{il}-\sum_{l=i+1}^{N}p_{il},&i=j,\end{array}\right.

d_{A} = j = 1 \sum N (\frac{1}{N} i = 1 \sum N p_{ij}^{(k)} - p (s_{j})),

d_{A} = j = 1 \sum N (\frac{1}{N} i = 1 \sum N p_{ij}^{(k)} - p (s_{j})),

p_{+} (k, L) =

p_{+} (k, L) =

=

p_{-} (k, L) =

p_{-} (k, L) =

=

\mathds E (k)

\mathds E (k)

= i = 1 \sum \infty i \cdot (\frac{L - 1}{L})^{i - 1} \cdot \frac{1}{L}

= L .

P_{cr} (L)

P_{cr} (L)

= \frac{K}{L} + O (\frac{1}{L ^{2}})

\approx \frac{K}{L} .

g(s_{j}|s_{i})=\left\{\begin{array}[]{ll}\frac{1}{n\cdot(m-n)},&\text{ patterns correspond to }s_{i}\text{ differs from that of }s_{j}\text{ in the position of one photon;}\\ 0,&\text{ else},\end{array}\right.

g(s_{j}|s_{i})=\left\{\begin{array}[]{ll}\frac{1}{n\cdot(m-n)},&\text{ patterns correspond to }s_{i}\text{ differs from that of }s_{j}\text{ in the position of one photon;}\\ 0,&\text{ else},\end{array}\right.

S = \frac{( \sum _{i} P _{i} Q _{i} ) ^{2}}{\sum _{i} P _{i} \sum _{i} Q _{i}},

S = \frac{( \sum _{i} P _{i} Q _{i} ) ^{2}}{\sum _{i} P _{i} \sum _{i} Q _{i}},

T (n) = 1.9925 \cdot n^{2} 2^{n} \times 1 0^{- 15},

T (n) = 1.9925 \cdot n^{2} 2^{n} \times 1 0^{- 15},

T (p) = \frac{1.9675 \times 1 0 ^{10}}{p ^{0.8782}} .

T (p) = \frac{1.9675 \times 1 0 ^{10}}{p ^{0.8782}} .

Q A (n, η) = lo g (\frac{t _{c}}{t _{q}}),

Q A (n, η) = lo g (\frac{t _{c}}{t _{q}}),

t_{q} (n, η) = (R_{q} \cdot η^{n} \cdot P_{C F})^{- 1},

t_{q} (n, η) = (R_{q} \cdot η^{n} \cdot P_{C F})^{- 1},

t_{q}^{m = n^{2}}

t_{q}^{m = n^{2}}

t_{q}^{m = 4 n}

t_{c} = T (n) = 1.9925 \cdot n^{2} 2^{n} \times 1 0^{- 15},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Sample Caching Markov Chain Monte Carlo Approach to Boson Sampling Simulation

Yong Liu