Validating multi-photon quantum interference with finite data

Fulvio Flamini; Mattia Walschaers; Nicol\`o Spagnolo; Nathan Wiebe,; Andreas Buchleitner; Fabio Sciarrino

arXiv:1904.12318·quant-ph·January 27, 2023

Validating multi-photon quantum interference with finite data

Fulvio Flamini, Mattia Walschaers, Nicol\`o Spagnolo, Nathan Wiebe,, Andreas Buchleitner, Fabio Sciarrino

PDF

TL;DR

This paper develops a comprehensive, operational framework for validating multi-photon quantum interference experiments, crucial for quantum information processing, by extending existing protocols and analyzing their performance with finite data and against classical simulations.

Contribution

It unifies and enhances validation protocols for multi-photon interference, incorporating Bayesian hypothesis testing and statistical benchmarking, with finite data analysis and adversarial classical algorithms comparison.

Findings

01

Finite sample size impacts validation accuracy.

02

Bayesian and statistical benchmarks are effective for different scales.

03

Classical algorithms can mimic quantum data under certain conditions.

Abstract

Multi-particle interference is a key resource for quantum information processing, as exemplified by Boson Sampling. Hence, given its fragile nature, an essential desideratum is a solid and reliable framework for its validation. However, while several protocols have been introduced to this end, the approach is still fragmented and fails to build a big picture for future developments. In this work, we propose an operational approach to validation that encompasses and strengthens the state of the art for these protocols. To this end, we consider the Bayesian hypothesis testing and the statistical benchmark as most favorable protocols for small- and large-scale applications, respectively. We numerically investigate their operation with finite sample size, extending previous tests to larger dimensions, and against two adversarial algorithms for classical simulation: the Mean-Field sampler…

Equations8

\frac{P ( H _{Q} ∣ { x })}{P ( H _{A} ∣ { x })} = k = 1 \prod S (\frac{p _{Q} ( x _{k} )}{p _{A} ( x _{k} )}) = χ_{S},

\frac{P ( H _{Q} ∣ { x })}{P ( H _{A} ∣ { x })} = k = 1 \prod S (\frac{p _{Q} ( x _{k} )}{p _{A} ( x _{k} )}) = χ_{S},

C_{ij} = ⟨ \overset{n}{^}_{i} \overset{n}{^}_{j} ⟩ - ⟨ \overset{n}{^}_{i} ⟩ ⟨ \overset{n}{^}_{j} ⟩

C_{ij} = ⟨ \overset{n}{^}_{i} \overset{n}{^}_{j} ⟩ - ⟨ \overset{n}{^}_{i} ⟩ ⟨ \overset{n}{^}_{j} ⟩

\mathds E_{U} [\mathds E_{X} (C_{ij}^{2})] = \mathds E_{U} [C_{ij}^{2}] + \frac{\mathds E _{U} [ σ _{ij}^{2} ]}{S}

\mathds E_{U} [\mathds E_{X} (C_{ij}^{2})] = \mathds E_{U} [C_{ij}^{2}] + \frac{\mathds E _{U} [ σ _{ij}^{2} ]}{S}

P (e_{i} \to e_{i + 1}) = min (1, \frac{p _{I} ( e _{i + 1} ) p _{D} ( e _{i} )}{p _{D} ( e _{i + 1} ) p _{I} ( e _{i} )})

P (e_{i} \to e_{i + 1}) = min (1, \frac{p _{I} ( e _{i + 1} ) p _{D} ( e _{i} )}{p _{D} ( e _{i + 1} ) p _{I} ( e _{i} )})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Validating multi-photon quantum interference with finite data

Fulvio Flamini

Dipartimento di Fisica, Sapienza Università di Roma, Piazzale Aldo Moro 5, I-00185 Roma, Italy

Institut für Theoretische Physik, Universität Innsbruck, Technikerstraße 21a, 6020 Innsbruck, Austria

Mattia Walschaers

Laboratoire Kastler Brossel, Sorbonne Université, CNRS, ENS-PSL Research University, Collège de France, 4 place Jussieu, 75005 Paris, France

Nicolò Spagnolo

Dipartimento di Fisica, Sapienza Università di Roma, Piazzale Aldo Moro 5, I-00185 Roma, Italy

Nathan Wiebe

Station Q Quantum Architectures and Computation Group, Microsoft Research, Redmond, WA, United States

Pacific Northwest National Laboratory, Richland, WA, United States

Department of Physics, University of Washington, Seattle, WA, United States

Andreas Buchleitner

Physikalisches Institut, Albert-Ludwigs-Universität Freiburg, Hermann-Herder-Strasse 3, 79104 Freiburg, Germany

Fabio Sciarrino

Dipartimento di Fisica, Sapienza Università di Roma, Piazzale Aldo Moro 5, I-00185 Roma, Italy

Abstract

Multi-particle interference is a key resource for quantum information processing, as exemplified by Boson Sampling. Hence, given its fragile nature, an essential desideratum is a solid and reliable framework for its validation. However, while several protocols have been introduced to this end, the approach is still fragmented and fails to build a big picture for future developments. In this work, we propose an operational approach to validation that encompasses and strengthens the state of the art for these protocols. To this end, we consider the Bayesian hypothesis testing and the statistical benchmark as most favorable protocols for small- and large-scale applications, respectively. We numerically investigate their operation with finite sample size, extending previous tests to larger dimensions, and against two adversarial algorithms for classical simulation: the Mean-Field sampler and the Metropolized Independent Sampler. To evidence the actual need for refined validation techniques, we show how the assessment of numerically simulated data depends on the available sample size, as well as on the internal hyper-parameters and other practically relevant constraints. Our analyses provide general insights into the challenge of validation, and can inspire the design of algorithms with a measurable quantum advantage.

I Introduction

A quantum computational advantage occurs when a quantum device starts outperforming its best classical counterpart on a given specialized task Harrow17 ; Arute19 . Intermediate models AA ; Terhal04 ; Bremner10 ; Boixo18 and platforms Inagaki16 ; McMahon16 ; Goto16 ; Lechner15 ; Johnson11 ; Puri17 have been proposed to achieve this regime, largely reducing the physical resources required by universal computation. The technological race towards quantum computational advantage goes nonetheless hand-in-hand with the development of classical protocols capable to discern genuine quantum information processing Shin14 ; Fitzsimons17 ; Flamini18 ; Eisert19 ; Walschaers19 . The intertwined evolution of these two aspects has been highlighted in particular by Boson Sampling AA ; Brod19 , where several protocols have been introduced Mayer11 ; Aaronson14 ; Spagnolo14 ; Carolan14 ; Bentivegna14 ; Tichy10 ; Tichy14 ; Crespi15 ; Walschaers16 ; Liu16 ; Shchesnovich16 ; Wang16bubbles ; Dittel17 ; Dittel18 ; Viggianiello18 ; Agresti19 ; Flamini19tsne and experimentally tested Bentivegna14 ; Spagnolo14 ; Carolan14 ; Carolan15 ; Crespi16 ; Wang16beating ; Wang16ten-photon ; He17 ; Wang17 ; Viggianiello17tvd ; Viggianiello18 ; Wang18 ; Giordani18 ; Agresti19 ; Paesani19 ; Wang1920photons to rule out non-quantum processes. Boson Sampling, in its original formulation AA , consists in sampling from a probability distribution that can be related to the evolution of indistinguishable photons in a linear-optical interferometer. Recent analyses suggested reasonable thresholds in the number of photons $n$ to surpass classical algorithms Neville17 ; Clifford18 ; Wu16 .

While the sampling task itself has been thoroughly analyzed in computational complexity theory, we still lack a comparable understanding when it comes to its validation. However, it is clear from a practical perspective that any computational problem designed to demonstrate quantum advantage needs to be formulated together with a set of validation protocols which account for the physical ramifications and resources required for its implementation. For instance, while small-scale examples can be validated by direct solution of the Schrödinger equation and using statistical measures such as cross-entropy Boixo18 , this is prohibitively expensive to debug a faulty Boson Sampler. Moreover, for Boson Sampling a deterministic certification is impossible Tichy10 by the very definition of the problem Aaronson14 . Hence, it is crucial to develop debugging tools, as well as tests to exclude undesired hypotheses on the system producing the output, that are computationally affordable and experimentally feasible. Furthermore, due to random fluctuations inherent to any finite-size problem, a validation cannot be considered reliable until sufficient physical resources are spent to obtain reasonable experimental uncertainties. Ultimately, no computational problem can provide evidence of quantum advantage unless quantitative validation criteria can be stated.

In this work, we investigate the problem of validating multi-photon quantum interference in realistic scenarios with finite data. The paper is structured as follows: first, we discuss possible ambiguities in the validation of Boson Sampling, which play a crucial role in large-size experiments. Then, building upon state-of-the-art validation protocols, we address the above considerations with a more quantitative analysis. We describe a practical approach to validation that makes the most of the limited physical resources available. Specifically, we study the use of the statistical benchmark Walschaers16 and the Bayesian hypothesis testing Bentivegna14 to validate $n$ -photon interference for large and small $n$ , respectively. We numerically investigate their operation against classical algorithms to simulate quantum interference, with a particular focus on the number of measurements. The reported analysis strengthens the need for a well-defined approach to validation, both to demonstrate quantum advantage and to assist applications that involve multi-photon states.

II Validation of Boson Sampling: framework

Our aim, in the context of Boson Sampling, consists in the unambiguous identification of a quantum advantage in a realistic scenario. We focus on the task of validation, or verification, whose aim is to check if measured experimental data is compatible with what can be expected from a given physical model. Validation generally requires fewer resources and is, thus, more appropriate for practical applications than full certification, which is exponentially hard in $n$ for Boson Sampling Aaronson14 ; Hangleiter19 . In both cases, these claims must follow a well-defined protocol to distill experimental evidence that is accepted by the community under jointly agreed criteria Markov18 (Fig. 1). As we discuss below and in Sec. III, we propose an application-oriented approach to validation that takes into consideration the limited physical resources, be them related to the evaluation of permanents Valiant79 or to finite sample size Hangleiter19 . In fact, without such well-defined approaches, obstacles or ambiguities may arise in large-scale experiments, as we highlight in the following. For instance, not all validation protocols are computationally efficient, which is a strong limitation for future multi-photon applications or high-rate real-time monitoring. Also, a theoretically scalable validation protocol may still be experimentally impractical due to large instrumental overheads or large prefactors that enter the scaling law.

Given two validation protocols $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ to rule out the same physical hypothesis or model, which conclusion can be drawn if they agree for a data set of given size and unexpectedly disagree when we add more data? In principle we can accept or reject a data set when we reach a certain level of confidence, but which action is to be taken if this threshold is not reached after a large number of measurement events (which hereafter we refer to as the “sample size”)? Shall we proceed until we pass that level, shall we reject it or shall we make a guess on the available data? Finally, what if the classical algorithm becomes more effective in simulating Boson Sampling for larger data sets, as for Markov chains Neville17 , or for longer processing times, as for adversarial machine learning algorithms adversarial-ML that could exploit specific vulnerabilities of validation protocols?

However artificial some of the above questions may seem, such skeptical approach was indeed already adopted Tichy14 and addressed Dittel17 ; Dittel18 ; Crespi15 ; Viggianiello18 ; Flamini19tsne ; Crespi16 ; Carolan15 ; Walschaers16 with the Mean-Field sampler (see Sec. V.1): all these considerations are necessary to strengthen the claim of quantum advantage. Under the above premise, we therefore identify the following crucial features to be assessed in any decision on acceptance or rejection:

Sample size $\mathcal{S}$ . The strength of a validation protocol is affected by the limited number $\mathcal{S}$ of collected events, as compared to the total number of distinct $n$ -photon output events. While this limitation is not relevant for small-scale implementations, due to ( $i$ ) the then low dimension of Hilbert space, ( $ii$ ) a high level of control and ( $iii$ ) reduced losses, it represents one of the main bottlenecks for the actually targeted large-scale instances Brugger18MSc . It is thus desirable to assess the robustness and the resilience of a protocol under such incomplete-sampling effects, to quantify the impact of always strictly finite experimental resources on the actual applicability range of the protocol. We therefore propose to define a (minimal) threshold sample size $\mathcal{S}$ which must be available for validation. Given a set of $\mathcal{S}$ events, a validation protocol must be capable to give a reliable answer within a certain confidence level. 2. 2.

Available sampling time $\mathcal{T}$ . While the sampling rate is nearly constant for current quantum and classical approaches Clifford18 , de facto making the time $\mathcal{T}$ not relevant, it cannot be excluded that future algorithms may process data and output all events at once. The very quality of the simulation, i.e. the similarity to quantum Boson Sampling in a given metric, could also improve with processing time Neville17 ; adversarial-ML . Ultimately, $\mathcal{T}$ must be treated as an independent parameter with respect to $\mathcal{S}$ , while at the same time it should be adapted to the sample size required for a reliable validation. 3. 3.

Unitary $\mathcal{U}$ . Unitary evolutions should be drawn Haar-randomly by a third agent, at the start of the competition to avoid any preprocessing. This agent, the validator ( $\mathcal{V}$ ), uses specific validation protocols to decide whether a sample is compatible with quantum operation.

In the thus defined setting, a data set is said validated according to the following rule (Fig. 1a):

*Boson Sampling is validated if, collecting $\mathcal{S}$ events in time $\mathcal{T}$ from some random unitary $\mathcal{U}$ , it is accepted by all selected validators $\mathcal{V}$ .

Given a unitary and a set of validation protocols, we are then left with the choice of $\mathcal{S}$ and $\mathcal{T}$ , which need be plausible for technological standards. Demanding to sample $\mathcal{S}$ events in time $\mathcal{T}$ , these thresholds in fact limit the size of the problem ( $n$ , $m$ ) for an experimental implementation. As for the time $\mathcal{T}$ , one possibility, feasible for quantum experiments, could be for instance one hour. Within this time, a quantum device will probably output events at a nearly constant rate, while a classical computer can output them at any rate allowed by its clock cycle time. The choice of the sample size $\mathcal{S}$ is instead more intricate, since a value too high collides with the limited $\mathcal{T}$ , while a value too low implies an unreliable validation $\mathcal{V}$ . With these or further considerations jointV , classical and quantum samplers should agree upon a combination of ( $n$ , $m$ , $\mathcal{S}$ , $\mathcal{T}$ ) that allows them to validate their operation.

III Validation with finite sample size

In this section, we investigate a convenient approach to validation that distinguishes between two regimes: until $n\sim 30$ (Sec. III.1) and from $n\sim 30$ (Sec. III.2). In each section, we will first summarize the main ideas behind their operation. Then, we will discuss their performance for various ( $n$ , $m$ ), highlighting strengths and limitations, by numerically simulating experiments with finite sample size and distinguishable or indistinguishable photons.

III.1 Bayesian tests for small-scale experiments

The Bayesian approach to Boson Sampling validation ( $\mathcal{V}_{B}$ ), introduced in Ref. Bentivegna14 , aims to identify the most likely between two alternative hypotheses, which model the multi-photon states under consideration. In particular, $\mathcal{V}_{B}$ tests the Boson Sampling hypothesis ( $H_{Q}$ ), which assumes fully indistinguishable $n$ -photon states, against an alternative hypothesis ( $H_{A}$ ) for the source that produces the measurement outcomes $\{x\}$ . Equal probabilities are assigned to the two hypotheses prior to the experiment. Let us denote with $p_{Q}(x_{k})$ $\left(p_{A}(x_{k})\right)$ the scattering probability associated with the output state $x_{k}$ for $H_{Q}$ ( $H_{A}$ ). The intuition is that, if $H_{Q}$ is most suitable to model the experiment, it is more likely to collect events for which $p_{Q}(x_{k})>p_{A}(x_{k})$ . The idea is made quantitative considering the confidence $P(\{x\}|H_{hypo})=\prod_{k=1}^{\mathcal{S}}p_{hypo}(x_{k})$ we assign to each hypothesis. By applying Bayes’ theorem, after $\mathcal{S}$ events we have

[TABLE]

and our confidence in the hypothesis $H_{Q}$ becomes $P(H_{Q}|\{x\})=\frac{\chi_{\mathcal{S}}}{1+\chi_{\mathcal{S}}}$ .

This test requires the evaluation of permanents of $n\times n$ scattering matrices for $p_{Q}(x_{k})$ Valiant79 ; Wu16 , which sets an upper limit to the number of photons that can be studied in practical applications Wang16beating ; Wang17 ; Wang16ten-photon ; He17 ; Neville17 ; Viggianiello17tvd ; Paesani19 ; Wang1920photons . Indeed, it is foreseeable that real-time monitoring or feedback-loop stabilization of quantum optics experiments will only have access to portable platforms with limited computational power. However, an interesting advantage of this validation protocol is its broad versatility, due to the absence of assumptions on the alternative distributions. Importantly, when applied to validate Boson Sampling with distinguishable photons, it requires very few measurements ( $\mathcal{S}\sim 20)$ for a reliable assessment. In Fig. 2, for instance, we numerically investigate its application as a function of sample size, extending previous simulations from $n=3$ Bentivegna14 to $n=(3,6,9,12)$ and $m=n^{2}$ .

Data for distinguishable ( $H_{C}$ ) and indistinguishable ( $H_{Q}$ ) photons were generated using exact algorithms, respectively by Aaronson and Arkhipov Aaronson14 and by Clifford and Clifford Clifford18 . The analysis shows how the validation protocol becomes even more effective for increasing $n$ , being it able to output a reliable verdict after only $\sim 20$ events. However, as mentioned, its power comes at the cost of being computationally inefficient in $n$ . Also, it is not possible to preprocess $\mathcal{V}_{B}$ and store information for successive re-use, since its confidence depends on the specific $\mathcal{U}$ and sampled events, according to $p_{Q}(x_{k})$ . Hence, in the regime $n\sim 25-35$ Wang1920photons ; Neville17 it becomes rapidly harder to perform a validation in real time. Eventually, since classical supercomputers cannot assist quantum experiments in everyday applications, $\mathcal{V}_{B}$ becomes prohibitive from $n\sim 35$ .

III.2 Statistical benchmark for large-scale experiments

In the previous section we described how the Bayesian test is effective in validating small- and mid-scale experiments with very few measurement events. However, the evaluation of permanents hinders its application for large $n$ , be it due to too large scattering matrices or to the need for speed in real-time evaluations. To overcome this limitation, further validation protocols have been proposed in the last few years, to find a convenient compromise between predictive power and physical resources. All these approaches have their own strengths and limitations, and tackle the problem from different angles Walschaers19 , e.g. using suppression laws Tichy10 ; Tichy14 ; Crespi15 ; Dittel17 ; Dittel18 , machine learning Agresti19 ; Flamini19tsne or statistical properties related to multi-particle interference Walschaers16 . In this section we will focus on the latter protocol, which arguably represents the most promising solution for the reasons we outline below.

Statistical benchmark with finite sample size. Validation based on the statistical benchmark ( $\mathcal{V}_{S}$ ) looks at statistical features of the C-dataset, the set of two-mode correlators

[TABLE]

where $(i,j)$ are distinct output ports and $\hat{n}_{i}$ is the bosonic number operator. Two statistical features that are effective to discriminate states with indistinguishable and distinguishable photons are its normalized mean NM (the mean divided by $n/m^{2}$ ) and its coefficient of variation CV (the standard deviation divided by the mean). For any unitary transformation and input state we can retrieve a point in the plane (NM, CV), where alternative models tend to cluster in separate clouds located via random matrix theory (Fig. 3a) Walschaers16 . Validation based on $\mathcal{V}_{S}$ would then consist in (i) collecting a suitable number $\mathcal{S}$ of events, (ii) evaluating the experimental point (NM, CV) associated to the $C_{ij}$ and (iii) identifying the cluster that the point is assigned to. For $\mathcal{S}$ sufficiently large, the point will be attributable with large confidence to only one of the models, thus ruling out the others (Fig. 3b).

$\mathcal{V}_{S}$ represents the state of the art for validation protocols that do not require the evaluation of permanents. Indeed, this approach has several advantages Giordani18 : ( $a$ ) it is computationally efficient (one only needs to compute two-point correlators), ( $b$ ) it can reveal deviations from the expected behaviour (manifest in the NM-CV plane), ( $c$ ) it makes more reliable predictions for larger $n$ (clouds become more separate), ( $d$ ) it is sample-efficient (clouds separate relatively early, after few measurements events). However, despite points ( $c$ , $d$ ) above, in actual conditions the experimental point is not always easy to validate. In fact, as mentioned in point ( $b$ ), hardware imperfections and partial distinguishability make the point move away from the average route shown in Fig. 3a. These issues can be addressed and mitigated by numerically generating, for a fixed sample size $\mathcal{S}$ , clouds from unitary transformations that take these aspects into account. As suggested in Ref. Giordani18 , and more closely investigated in Fig. 3b,c, a convenient approach is to employ machine learning to assign experimental points to one of the two clouds, with a certain confidence level. Specifically, one can train a classifier with numerically generated data Aaronson14 ; Clifford18 for a certain ( $n$ , $m$ , $\mathcal{S}$ ), that can even include error models, and then deploy it for all applications in that regime. In this sense, $\mathcal{S}$ can be seen as the label of the model that can classify (validate) data for a given ( $n$ , $m$ ). This intuition can be extended to a classifier that is trained on data from multiple $\mathcal{S}$ (see Fig. 3c), which is likely more practical. For a fixed $\mathcal{S}$ , the computational resources to sample events from a distribution given by $n$ distinguishable (indistinguishable) photons scale polynomially Aaronson14 (exponentially Clifford18 ) in $n$ . However, once trained, this classifier can be considered as an off-the-shelf tool that is readily applicable to validate multi-photon interference with no additional computational overhead, which is ideal for large-size experiments. In Sec. V.2, we also discuss how such a classifier can even be combined with other protocols, which search the data for different distinctive structures, to boost its accuracy.

Finite-size effects in validation protocols. So far, we qualitatively discussed the role of a limited sample size for the validation of multi-photon quantum interference. To provide a more quantitative analysis of finite-size effects for the task of validation, and in particular for $\mathcal{V}_{S}$ , in the following we study the scaling of the parameters involved in the above validation protocol with $\mathcal{S}$ . The goal of this section is to elaborate on a standard test which should be implemented in all validation protocols, to guarantee their experimental feasibility.

Let us start by considering a fixed unitary circuit $U$ , for which we calculate the correlators $C_{ij}$ from Eq. (2). Such evaluation in principle assumes the possibility to collect an arbitrary number of measurement events. In practical applications, however, sample sizes will always be limited. Hence, finite-size effects play a role in the estimation of the above correlators. According to the central limit theorem, the correlator retrieved from the experimental data can be represented as $\widetilde{C}_{ij}=C_{ij}+X_{ij}$ , where $X_{ij}$ is a random number normally distributed with zero mean and variance $\sigma_{ij}^{2}\,\mathcal{S}^{-1}$ . The $\sigma_{ij}^{2}$ depend on the unitary evolution $U$ and should either be evaluated from the data or be estimated using random matrix theory. Now, to infer, from noisy C-datasets Walschaers16 , the centre of the cloud of points in the NM-CV plane, we need to average not only over the Haar measure, but also over $X_{ij}$ .

Consequently, we have to assess the impact of finite-size effects on the estimate of the moments (NM, CV). First, since the noise induced by the finite sample size averages out, namely $\mathds{E}_{X}(\widetilde{C}_{ij})={C}_{ij}$ , we have that $\widetilde{NM}=NM$ . The estimation of CV is a bit more subtle because we need to evaluate the mean of ${\widetilde{C}_{ij}}^{2}$ . Since $\mathds{E}_{X}({\widetilde{C}_{ij}}^{2})={C^{2}_{ij}}+\sigma_{ij}^{2}\,\mathcal{S}^{-1}$ , then

[TABLE]

and, hence, $|\widetilde{CV}|>|CV|$ . Note that $\mathds{E}_{U}[\mathds{E}_{X}({\widetilde{C}^{2}_{ij}})]$ and $\mathds{E}_{X}[\mathds{E}_{U}({\widetilde{C}_{ij}}^{2})]$ cannot be easily compared, since the latter involves averaging the distribution of $X_{ij}$ over the unitary group. However, using the properties of the normal distribution under convex combinations, we can deduce that both orders of averaging yield approximately the same result (and the same scaling in $\mathcal{S}$ ), in particular once $\mathcal{S}$ is large and the distribution is concentrated close to its mean. Numerical simulations for $3\leq n\leq 15$ and $m=n^{2}$ indeed confirm its validity (Fig. 4). Specifically, we observe that, upon averaging over different Haar-random unitaries with $\mathcal{S}$ events per realization, the deviation of the experimentally-measured ${\widetilde{C}_{ij}}^{2}$ from the analytically predicted values decreases as fast as $1/\mathcal{S}$ . Hence, their estimation from finite-size data sets shows no exponential overhead that would hinder a practical application of the validation protocol.

IV Discussion

Validation of multi-photon quantum interference is expected to play an increasing role as the dimensionality of photonic applications increases, both in the number of photons and modes. To this end, and as notably emphasized by the race towards quantum advantage via Boson Sampling, it is necessary to define a set of requirements for a validation protocol to be meaningful. Ultimately, these requirements should allow to establish strong experimental evidence of quantum advantage that is accepted by the community within a jointly agreed framework.

In the present work, we implement such a program and describe a set of critical points that experimenters will need to agree upon in order to validate the operation of a quantum device. With the goal of building a solid framework for validation, we then discuss a practical approach that applies the most suitable state-of-the-art protocols in realistic scenarios. We report numerical analyses on the application of two key validation protocols, the Bayesian hypothesis testing and the statistical benchmark, with finite-size data, providing compelling evidence in support of this approach.

A clear and illustrative example for the above considerations is provided in Section V.1, where we numerically studied the competition between a recent classical simulation algorithm and the statistical benchmark, respectively to counterfeit and to validate Boson Sampling, while they process an increasing number of measured output events. The analysis quantifies the general intuition that there must be a trade-off between speed and quality in approximate simulations of Boson Sampling. We also provide a formal analysis on the performance of the validation protocol with finite-size samples, showing that the estimation of relevant quantities converges fast to the predicted values. We expect that similar features will be crucial for larger-scale demonstrations and, as such, a key prerequisite to be investigated in all validation protocols.

Finally, in Section V.2 we introduce a novel approach to validation that can bring together the strengths of multiple protocols. This approach uses a meta-algorithm (AdaBoost) to combine protocols based on machine learning into a single validator with boosted accuracy. This strategy becomes more advantageous for a larger number of such protocols with comparable performance, as well as with very noisy data.

Acknowledgements

This work was supported by the ERC Advanced Grant CAPABLE (Composite integrated photonic platform by femtosecond laser micromachining; grant agreement no. 742745); by the QuantERA ERA-NET Cofund in Quantum Technologies Project HiPhoP (High dimensional quantum Photonic Platform, Project ID 731473) and by project PRIN 2017 ”Taming complexity via QUantum Strategies a Hybrid Integrated Photonic approach” (QUSHIP) Id. 2017SRNBRK. A.B. acknowledges support by the Georg H. Endress foundation. M.W. is funded through Research Fellowship WA 3969/2-1 of the German Research Foundation (DFG). This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 801110 and the Austrian Federal Ministry of Education, Science and Research (BMBWF). It reflects only the author’s view and the Agency is not responsible for any use that may be made of the information it contains.

V Appendix

V.1 Classical simulation and the role of sample size

To shed some light on the critical aspects of validation, and as a benchmark of the state of the art in this context, we now provide a qualitative analysis inspired by the Metropolized independent sampling ( $\mathcal{M}$ ), a recent algorithm to classically simulate Boson Sampling Neville17 . The idea behind $\mathcal{M}$ is reminiscent of the Mean-Field sampler ( $\mathcal{MF}$ ) Tichy14 , an adversarial classical algorithm that was capable to hack one of the first validation protocols Carolan14 using limited classical resources. In the race towards quantum computational supremacy, the introduction of $\mathcal{MF}$ has prompted the development of more sophisticated techniques to tackle classical simulations. For instance, besides the Bayesian test (see inset in Fig. 2), also the statistical benchmark is highly effective to validate Boson Sampling against $\mathcal{MF}$ (see Fig. 5a). For our scope, the key difference between the two algorithms is that, while for $\mathcal{MF}$ the quality of the simulation does not really change over time, $\mathcal{M}$ samples from a distribution that gets closer to $\mathcal{Q}$ the more events are evaluated (i.e. for a larger $\mathcal{S}$ ).

The goal of $\mathcal{M}$ is to generate a sequence of $n$ -photon events $\{e_{i}\}$ from a Markov chain that mimics the statistics of an ideal Boson Sampling experiment. Given a sampled event $e_{i}$ , a new candidate event $e_{i+1}$ is efficiently picked according to the probability distribution of distinguishable photons $p_{D}$ , and accepted with probability

[TABLE]

where $p_{I}(e_{i})$ is the output probability corresponding to event $e_{i}$ for indistinguishable photons. While the approach remains computationally hard, since it requires the evaluation of permanents Scheel08 ; Valiant79 , the advantage is that only a limited number of them needs to be evaluated to output a new event, rather than the full distribution as in a brute-force approach. Ultimately, after a certain number of steps in the chain, $\mathcal{M}$ is guaranteed to sample close to the ideal Boson Sampling distribution $p_{I}$ Brooks98 . Hence, not only does the sample size $\mathcal{S}$ play a key role to improve the reliability of validation protocols, as shown in Sec. III, but it can be crucial also to increase the quality of the outcome of a classical simulation. This is a relevant point to keep in mind, even though $\mathcal{M}$ has since been surpassed by an algorithm that is both provably faster and exact Clifford18 . In fact, in future, novel classical algorithms might be developed Liu19 that depend on $\mathcal{S}$ more efficiently.

The aim of our present analysis is to investigate the role of the sample size in a validation of the samples generated by $\mathcal{M}$ , via $\mathcal{V}_{S}$ . Indeed, a crucial issue in a hypothetical competition between $\mathcal{M}$ and $\mathcal{V}_{S}$ concerns the number of events $\mathcal{S}$ available to accept or reject a data set. While larger sets provide deeper information to $\mathcal{V}_{S}$ to identify fingerprints of quantum interference, on the other hand $\mathcal{M}$ approaches the target distribution $p_{I}$ as more steps are made along the chain. However, in order to output a large number of events in time $\mathcal{T}$ , $\mathcal{M}$ requires physical and computational resources that set a limit to the tractable dimension of the problem. We are then interested in the intermediate regime, the one relevant for experiments, to determine whether convergence is reached fast enough to mislead $\mathcal{V}_{S}$ . In the specific case of $\mathcal{M}$ , we then need to look at the scaling in $n$ of its hyper-parameters: burn-in (the number $B_{n}$ of events to be discarded at the beginning of the chain) and thinning (the number $T_{n}$ of steps to skip to reduce correlations between successive events). Eventually, the time required to classically simulate Boson Sampling will scale as $\mathcal{T}=\tau_{p}\left(B_{n}+\mathcal{S}\;T_{n}\right)$ , where $\tau_{p}$ is the time to evaluate a single scattering amplitude according to Eq. (4). Considering the estimate provided by the supercomputer Tianhe-2 Wu16 , and for fixed ( $\mathcal{T}$ , $\mathcal{S}$ ), we find the constraint $B_{n}=\alpha\,n^{-2}\,2^{-n}\;\mathcal{T}-\mathcal{S}\;T_{n}$ where $\alpha\sim c^{0.8782}\,10^{11}$ and $c$ is the number of processing nodes. If we assume $T_{n}=100$ Neville17 for all $n$ and $\mathcal{V}$ , we get an estimate of the maximum $B_{n}$ allowed by ( $\mathcal{T}$ , $\mathcal{S}$ ). The key issue is that this estimate does not guarantee that $\mathcal{M}$ achieves the target distribution fast enough, since $B_{n}$ decreases (exponentially) in $n$ . Moreover, the minimum $B_{n}$ is expected to increase with $n$ , since on average the Markov chain needs to explore more states before picking a good one.

To better clarify the above considerations, we simulate a competition between $\mathcal{M}$ and $\mathcal{V}_{S}$ for $n=10$ photons in $m=100$ modes on Fig. 6. Data for distinguishable and indistinguishable photons were generated with exact algorithms, respectively by Aaronson and Arkhipov Aaronson14 and by Clifford and Clifford Clifford18 . The analysis proceeds through five main steps: 1) randomly pick a unitary transformation $\mathcal{U}$ according to the Haar measure; 2) simulate the generation of $\mathcal{S}$ $n$ -particle output events; 3) extract the $C$ -dataset from these $\mathcal{S}$ events; 4) evaluate the corresponding (NM, CV) point and plot it in Fig. 6a; 5) repeat steps 1-4 200 times, to simulate as many different experiments. Upon completion, evaluate average and variance of $P_{\mathcal{M}}$ and plot them in Fig. 6b. With this analysis, we get a quantitative intuition on how the confidence of a validation changes with $\mathcal{S}$ , as does the quality of the classical simulation. Similar behaviour is found also for other choices of $n$ and $m$ . In particular, we observe how a stronger thinning (up to $T_{10}=100$ , as in Ref. Neville17 ) is reflected in the quality of the simulation, where $\mathcal{M}$ behaves very similar to the ideal Boson Sampler for small as well as for large sample sizes. Conversely, a faster $\mathcal{M}$ that trades quality for speed by computing fewer permanents ( $T_{10}=10,30$ ) is more easily detectable by $\mathcal{V}_{S}$ . Constraints due to a speed vs. quality compromise (Fig. 3b,c,d) define a generic scenario for a classical simulation which is run with a specific choice of $\mathcal{T}$ and $\mathcal{S}$ .

V.2 Combining and boosting validation protocols

So far, all validation protocols have always been applied separately and independently. Certainly, this fact shows the multifaceted nature of this line of research, where effective solutions have been developed using very different strategies. Yet, it also reflects its somewhat fragmented condition, since each protocol does not benefit from potential insights provided by the others. This limitation becomes relevant in realistic scenarios with noise and finite data sets, since each validation protocol suits some task better than the others, with different degrees of sample efficiency and resilience.

In this section, we present a novel, synergistic approach to validation, which aims at combining the strengths of these protocols to form a joint, enhanced validator. Specifically, we focus on validation protocols that make use of machine learning, and propose to combine them with a meta-algorithm (AdaBoost Freund97 ) that attempts an adaptive boosting of their individual performance. The output of AdaBoost is a weighted sum of the predictions of these learning algorithms (’weak learners’), which are asked, sequentially, to pay more attention to the instances that were incorrectly classified by the previous learners. As long as the performance of each learner is slightly better than chance, the classifier resulting from AdaBoost provably converges to a better validation protocol.

We numerically test this approach by combining two validation protocols that employ machine learning: the statistical benchmark $\mathcal{V}_{S}$ Walschaers16 (equipped with a simple neural network classifier trained on numerically generated data, as in Fig. 3b,c) and the visual assessment $\mathcal{V}_{V}$ Flamini19tsne , which uses dimensionality reduction algorithms and convolutional neural networks. Here we do not consider the Bayesian approach, since, in its current formulation, it does not fit the framework of machine learning. A schematic description of our proof-of-concept analysis, which we carry out for $n=10$ and $m=100$ , is shown in Fig. 7.

Since $\mathcal{V}_{S}$ requires fewer events than $\mathcal{V}_{V}$ to validate ideal, noiseless experiments Aaronson14 ; Clifford18 , to perform this test we trained $\mathcal{V}_{S}$ on data sets with a tunable amount noise, purposely assembled to be hard to validate. To this end, samples ( $\mathcal{S}=2\times 10^{3}$ ) for 500 Haar-random unitary transformations were constructed by sampling with a certain probability $p$ (or $1-p$ ) from a Boson Sampler with fully indistinguishable (or distinguishable) photons. This probability $p$ was then varied in time, to simulate, for instance, a periodic drift in the synchronization of the input photons. As expected with these settings, we find that AdaBoost maintains the original accuracy of $\mathcal{V}_{S}$ and $\mathcal{V}_{V}$ when applied to, respectively, batches of $\mathcal{V}_{S}$ and $\mathcal{V}_{V}$ that are already highly accurate. This is mainly due to complexity of these classifiers, which are already strong learners and, hence, hard to enhance by AdaBoost. Analogous results are found with mixed batches of $\mathcal{V}_{S}$ and $\mathcal{V}_{V}$ , for which AdaBoost returns a joint classifier that practically focuses on the most accurate one in the set. A different result is obtained, instead, by combining several weak $\mathcal{V}_{V}$ , for which we purposely spoil the training of the convolutional neural network (accuracy $A\sim 51\%$ instead of $A\sim 98\%$ ) by reducing the number of training epochs. In this case, AdaBoost does in fact enhance the accuracy of $\mathcal{V}_{V}$ up to $A\sim 57\%$ .

In future, we expect that this approach will prove useful in non-ideal conditions with experimental noise, where validation protocols do not operate in the ideal settings where they were conceived. Furthermore, the above analyses can show larger boosts if applied to actual experiments that involve structured (non-Haar-random) interferometers, for which protocols such as $\mathcal{V}_{S}$ and $\mathcal{V}_{V}$ can have lower accuracies and different behaviors. Finally, still in non-ideal settings, more favorable boosts can be obtained if new validation protocols are developed that are as sample-efficient as $\mathcal{V}_{S}$ .

References

(1)

Harrow A W and Montanaro A 2017 Nature 549 203–209

(2)

Arute F, et al. 2019 Nature 574 505-–510

(3)

Aaronson S and Arkhipov A 2011 The computational complexity of linear optics Proceedings of the 43rd annual ACM symposium on Theory of Computing ed Press A pp 333–342

(4)

Terhal B M and DiVincenzo D P 2004 Quantum Inf. Comput. 4 134–145

(5)

Bremner M J, Jozsa R and Shepherd D J 2010 Proc. R. Soc. Lond. A 467 459–472

(6)

Boixo S, Isakov S V, Smelyanskiy V N, Babbush R, Ding N, Jiang Z, Bremner M J, Martinis J M and Neven H 2018 Nat. Phys. 14 595–600

(7)

Inagaki T, Haribara Y, Igarashi K, Sonobe T, Tamate S, Honjo T, Marandi A, McMahon P L, Umeki T, Enbutsu K, Tadanaga O, Takenouchi H, Aihara K, Kawarabayashi K I, Inoue K, Utsunomiya S and Takesue H 2016 Science aah4243

(8)

McMahon P L, Marandi A, Haribara Y, Hamerly R, Langrock C, Tamate S, Inagaki T, Takesue H, Utsunomiya S, Aihara K, Byer R L, Fejer M M, Mabuchi H and Yamamoto Y 2016 Science aah5178

(9)

Goto H 2016 Sci. Rep. 6 21686

(10)

Lechner W, Hauke P and Zoller P 2015 Sci. Adv. 1 e1500838

(11)

Johnson M W, Amin M H S, Gildert S, Lanting T, Hamze F, Dickson N, Harris R, Berkley A J, Johansson J, Bunyk P, Chapple E M, Enderud C, Hilton J P, Karimi K, Ladizinsky E, Ladizinsky N, Oh T, Perminov I, Rich C, Thom M C, Tolkacheva E, Truncik C J S, Uchaikin S, Wang J, Wilson B and Rose G 2011 Nature 473 194–198

(12)

Puri S, Andersen C K, Grimsmo A L and Blais A 2017 Nat. Commun. 8 15785

(13)

Fitzsimons J F and Kashefi E 2017 Phys. Rev. A 96 012303

(14)

Eisert J, Hangleiter D, Walk N, Roth I, Markham D, Parekh R, Chabaud U and Kashefi E. preprint at arXiv:1910.06343v2

(15)

Flamini F, Spagnolo N and Sciarrino F 2018 Rep. Prog. Phys. 82, 016001

(16)

Walschaers M, 2020 J. Phys. B 53, 4

(17)

Shin S W, Smith G, Smolin J A and Vazirani U 2014 How ’quantum’ is the d-wave machine? preprint at arXiv:1401.7087

(18)

Brod D J, Galvão E F, Crespi A, Osellame R, Spagnolo N, Sciarrino F 2019 Advanced Photonics 1 034001

(19)

Mayer K, Tichy M C, Mintert F, Konrad T and Buchleitner A 2011 Phys. Rev. A 83 062307

(20)

Aaronson S and Arkhipov A 2014 Quantum Information & Computation 14 1383–1423

(21)

Liu K, Lund A P, Gu Y J and Ralph T C 2016 J. Opt. Soc. Am. B 33 1835–1841

(22)

Wang S T and Duan L M Certification of boson sampling devices with coarse-grained measurements. preprint at arXiv:1601.02627

(23)

Shchesnovich V S 2016 Phys. Rev. Lett. 116 123601

(24)

Tichy M C, Tiersch M, De Melo F, Mintert F and Buchleitner A 2010 Phys. Rev. Lett. 104 220405

(25)

Tichy M C, Mayer K, Buchleitner A and Molmer K 2014 Phys. Rev. Lett. 113 020502

(26)

Dittel C, Keil R and Weihs G 2017 Quantum Sci. Technol. 2 1

(27)

Dittel C, Dufour G, Walschaers M, Weihs G, Buchleitner A and Keil R 2018 Phys. Rev. Lett. 120 240404

(28)

Crespi A 2015 Phys. Rev. 91 013811

(29)

Flamini F, Spagnolo N and Sciarrino F 2019 Quantum Sci. Technol. 4 024008

(30)

Walschaers M, Kuipers J, Urbina J D, Mayer K, Tichy M C, Richter K and Buchleitner A 2016 New J. Phys. 18 032001

(31)

Bentivegna M, Spagnolo N, Vitelli C, Brod D J, Crespi A, Flamini F, Ramponi R, Mataloni P, Osellame R, Galvão E F and Sciarrino F 2014 Int. J. Quantum Inform. 12 1560028

(32)

Carolan J, Meinecke J D A, Shadbolt P J, Russell N J, Ismail N, Worhoff K, Rudolph T, Thompson M G, O’Brien J L, Matthews J C F and Laing A 2014 Nat. Photonics 8 621–626

(33)

Agresti I, Viggianiello N, Flamini F, Spagnolo N, Crespi A, Osellame R, Wiebe N and Sciarrino F 2019 Phys. Rev. X 9 011013

(34)

Spagnolo N, Vitelli C, Bentivegna M, Brod D J, Crespi A, Flamini F, Giacomini S, Milani G, Ramponi R, Mataloni P, Osellame R, Galvão E F and Sciarrino F 2014 Nat. Photonics 8 615–620

(35)

Viggianiello N, Flamini F, Innocenti L, Cozzolino D, Bentivegna M, Spagnolo N, Crespi A, Brod D J, Galvão E F, Osellame R and Sciarrino F 2018 New J. Phys. 20 033017

(36)

Carolan J, Harrold C, Sparrow C, Martin-Lopez E, Russell N J, Silverstone J W, Shadbolt P J, Matsuda N, Oguma M, Itoh M, Marshall G D, Thompson M G, Matthews J C F, Hashimoto T, O’Brien J L and Laing A 2015 Science 349 711

(37)

Crespi A, Osellame R, Ramponi R, Bentivegna M, Flamini F, Spagnolo N, Viggianiello N, Innocenti L, Mataloni P and Sciarrino F 2016 Nature Commun. 7 10469

(38)

Wang H, Li W, Jiang X, He Y M, Li Y H, Ding X, Chen M C, Qin J, Peng C Z, Schneider C, Kamp M, Zhang W J, Li H, You L X, Wang Z, Dowling J P, Höfling S, Lu C Y and Pan J W 2018 Phys. Rev. Lett. 120 230502

(39)

Giordani T, Flamini F, Pompili M, Viggianiello N, Spagnolo N, Crespi A, Osellame R, Wiebe N, Walschaers M, Buchleitner A and Sciarrino F 2018 Nat. Photonics 12 173–178

(40)

He Y, Ding X, Su Z E, Huang H L, Qin J, Wang C, Unsleber S, Chen C, Wang H, He Y M, Wang X L, Zhang W J, Chen S J, Schneider C, Kamp M, You L X, Wang Z, Höfling S, Lu C Y and Pan J W 2017 Phys. Rev. Lett. 118 190501

(41)

Wang H, He Y, Li Y H, Su Z E, Li B, Huang H L, Ding X, Chen M C, Liu C, Qin J, Li J P, He Y M, Schneider C, Kamp M Peng C Z, Höfling S, Lu C-Y and Pan J W, 2017 Nat. Photonics 11, 361–365

(42)

Wang X L, Chen L K, Li W, Huang, H L, Liu C, Chen C, Luo Y, Su Z E, Wu D, Li Z D, Lu H, Hu Y, Jiang X, Peng C Z, Li L, Liu N L, Chen Y A, Lu C Y and Pan J W 2016 Phys. Rev. Lett. 117 21

(43)

Wang L J, Zou K H, Sun W, Mao Y, Zhu Y X, Yin H L, Chen Q, Zhao Y, Zhang F, Chen T Y and Pan J W 2017 Phys. Rev. A 95 012301

(44)

Viggianiello N, Flamini F, Bentivegna M, Spagnolo N, Crespi A, Brod D J, Galvão E F, Osellame R and Sciarrino F 2018 Sci. Bull. 63 22

(45)

Paesani S, Ding Y, Santagati R, Chakhmakhchyan L, Vigliar C, Rottwitt K, Oxenløwe F, Wang J, Thompson M and Laing A 2019 Nat. Phys. 15

(46)

Wang H, Qin J, Ding X, Chen M-C, Chen S, You X, He Y-M, Jiang X, Wang Z, You L, Renema J J, Hoefling S, Lu C-Y and Pan J W, 2019 Phys. Rev. Lett 123, 250503

(47)

Neville A, Sparrow C, Clifford R, Johnston E, Birchall P M, Montanaro A and Laing A 2017 Nat. Phys. 13 1153–1157

(48)

Clifford P and Clifford R 2018 The classical complexity of boson sampling Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms SODA ’18 (Philadelphia, PA, USA: Society for Industrial and Applied Mathematics) pp 146–155

(49)

Wu J, Liu Y, Zhang B, Jin X, Wang Y, Wang H and Yang X 2018 Natl. Sci. Rev. 5 715–720

(50)

Hangleiter D, Kliesch M, Eisert J and Gogolin C 2019 Phys. Rev. Lett. 122 210502

(51)

Markov I L, Fatima A, Isakov S and Boixo S Quantum supremacy is both closer and farther than it appears. preprint at arXiv:1807.10749

(52)

Valiant L 1979 Theoretical Computer Science 8 189–201

(53)

Liu Y, Xiong M, Wu C, Wang D, Liu Y, Ding J, Huang A, Fu X, Qiang X, Xu P, Deng M, Yang X and Wu J. preprint at arXiv:1907.08077

(54)

Lowd D and Meek C 2005 Adversarial learning KDD (ACM - Association for Computing Machinery) pp 641–647

(55)

Brugger J 2018 Quantum annealing and disorder: A statistical analysis Master’s thesis Albert-Ludwigs-Universität Freiburg

(56)

Given that the validators have a non-zero probability of false positives or negatives the criteria for validation need to be generalized. in this context a device is said to be validated if the probability that a valid device passes (or fails) the tests is greater by a fixed fraction than the probabilities that would be predicted by all available alternative hypotheses.

(57)

Scheel S 2008 Acta Physica Slovaca 58 675

(58)

Brooks S P and Roberts G O 1998 Stat. Comput. 8 319–335

(59)

Freund Y and Schapire R E 1997 J. Comput. Syst. Sci. 55(1) 119–139

(60)

van der Maaten L J P and Hinton G E 2008 J. Mach. Learn. Res. 9 2579–2605.

Bibliography60

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) Harrow A W and Montanaro A 2017 Nature 549 203–209
2(2) Arute F, et al. 2019 Nature 574 505-–510
3(3) Aaronson S and Arkhipov A 2011 The computational complexity of linear optics Proceedings of the 43rd annual ACM symposium on Theory of Computing ed Press A pp 333–342
4(4) Terhal B M and Di Vincenzo D P 2004 Quantum Inf. Comput. 4 134–145
5(5) Bremner M J, Jozsa R and Shepherd D J 2010 Proc. R. Soc. Lond. A 467 459–472
6(6) Boixo S, Isakov S V, Smelyanskiy V N, Babbush R, Ding N, Jiang Z, Bremner M J, Martinis J M and Neven H 2018 Nat. Phys. 14 595–600
7(7) Inagaki T, Haribara Y, Igarashi K, Sonobe T, Tamate S, Honjo T, Marandi A, Mc Mahon P L, Umeki T, Enbutsu K, Tadanaga O, Takenouchi H, Aihara K, Kawarabayashi K I, Inoue K, Utsunomiya S and Takesue H 2016 Science aah 4243
8(8) Mc Mahon P L, Marandi A, Haribara Y, Hamerly R, Langrock C, Tamate S, Inagaki T, Takesue H, Utsunomiya S, Aihara K, Byer R L, Fejer M M, Mabuchi H and Yamamoto Y 2016 Science aah 5178