A New Proof of Nonsignalling Multiprover Parallel Repetition Theorem
Himanshu Tyagi, Shun Watanabe

TL;DR
This paper provides a new information-theoretic proof of the nonsignalling multiprover parallel repetition theorem, enhancing understanding of hardness of approximation in computational complexity.
Contribution
It introduces a novel proof technique based on a recent method for strong converse results, replacing de Finetti decompositions with a change of measure approach.
Findings
New proof simplifies understanding of the theorem
Extends techniques from multiuser information theory
Potentially impacts hardness of approximation results
Abstract
We present an information theoretic proof of the nonsignalling multiprover parallel repetition theorem, a recent extension of its two-prover variant that underlies many hardness of approximation results. The original proofs used de Finetti type decomposition for strategies. We present a new proof that is based on a technique we introduced recently for proving strong converse results in multiuser information theory and entails a change of measure after replacing hard information constraints with soft ones.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A New Proof of Nonsignalling Multiprover Parallel Repetition
Theorem
Himanshu Tyagi*†*
Shun Watanabe*‡*
Abstract
We present an information theoretic proof of the nonsignalling multiprover parallel repetition theorem, a recent extension of its two-prover variant that underlies many hardness of approximation results. The original proofs used de Finetti type decomposition for strategies. We present a new proof that is based on a technique we introduced recently for proving strong converse results in multiuser information theory and entails a change of measure after replacing hard information constraints with soft ones.
††footnotetext: *†*Department of Electrical Communication Engineering, Indian Institute of Science, Bangalore 560012, India. Email: [email protected]. *‡*Department of Computer and Information Sciences, Tokyo University of Agriculture and Technology, Tokyo 184-8588, Japan. Email: [email protected].
I Introduction
The parallel repetition theorem is an important tool in theoretical computer science which is used to prove hardness of approximation results. It shows roughly that if distributed provers can satisfy a random predicate with probability without coordinating, then they can satisfy independent copies of the same predicate only with probability going to [math] exponentially in . Such a theorem for two-prover case was shown in [12], with a simplified proof given in [6]. The precise form of the statement of such a theorem relies on the structure of the query distribution, the predicate, and the class of strategies allowed for the provers. In particular, it was noted in [6] that in most applications we only need a parallel repetition theorem for nonsignalling strategies, a class of correlation that subsumes even quantum correlations.
While the validity of a multiprover parallel repetition theorem for the standard setting is unclear, recently such a theorem has been proved for the nonsignalling setting [8] (see, also, [1, 2]). The proof uses de Finetti type decomposition of strategies and a linear programming interpretation of the value function. In this paper, we provide a new proof of the same result that is completely “information theoretic”. Our proof draws on the connection between the parallel repetition setting and that of multiuser rate-distortion theory. In particular, we rely on a change of measure approach for proving strong converse results in multiuser distributed coding problems. This approach was introduced for centralized coding problems in [5], and was recently sophisticated and extended to distributed coding problems in [13] using a relaxation technique introduced in [9]; see [13] for detailed account. In the change of measure approach, we first replace the hard information constraints involving conditional independence by their soft counterparts involving bounds on KL divergences. Next, we change measure to that obtained by conditioning on the “winning” event. The -fold problem is related to a single instance of the problem using a tensorization property of the resulting value function.
This paper is part review – we recall the formulation and results for two provers in Section II, followed by those for the multiprover setting in Section III. Our main contribution is a new proof of the multiprover parallel repetition theorem (Theorem 4) given in Section IV. The final section contains brief concluding remarks.
Notation. Given random variable , for a subset of , we abbreviate the random variable as . Similarly, for a tupple , denote . For other notations, we basically follow [4].
II Two-Prover Parallel Repetition Theorem
We begin by reviewing the two-prover setting. A two-prover game consists of a verifier and two-provers and . The verifier samples a query according to a fixed joint distribution on finite alphabet , and sends and to and , respectively. Upon receiving the queries, and send responses and , respectively, where depends only on . They may use any mappings , , of to get ; for finite sets and , denote by the set of all mappings from . The provers win the game if for a prespecified predicate . We will represent the game by the pair . The goal of the provers is to choose mappings that maximize the winning probability. This maximum winning probability is termed the value of the game and is given by
[TABLE]
In parallel repetitions of the game, the verifier samples sequences of queries and according to the product distribution . The provers now respond with sequences and where depends only on , . They win the game if predicates for each coordinate are satisfied, namely the predicate for the parallel repetition game is given by
[TABLE]
where denotes the AND function. The value of is defined similarly as follows:
[TABLE]
As a simple attempt towards winning the parallel repetition game, provers may simply apply strategies for single instance of the game across each coordinate. In fact, they may use a different strategy for each coordinate. Clearly, any such attempt will have value less than . But can they do better by using other functions that take into account the entire vector and do not have a product structure across coordinates? At a high level, a parallel repetition theorem says that the answer is no: The exponential decay of value with is unavoidable.
The first instance of parallel repetition theorem was shown by Raz [12] (see [6] for simpler proof).
Theorem 1** ([12]).**
There exists a function satisfying if such that for any game ,
[TABLE]
The statement above holds for any game with the same universal function and universal exponent that depends only on the cardinality of the response set .
An important aspect of the setting above, which will be a prime focus here, is the role of randomness in response strategies. A simple derandomization argument shows that the value of games will not change if the pair is generated randomly using shared randomness that is independent of the query. Such strategies with shared randomness available to the provers can be described by channels
[TABLE]
where is a distribution on and given by is the deterministic strategy induced by functions .
In physics, strategies of the form (1) are said to satisfy the hidden variable theory, a classical physics principle which says that if all the hidden variables are revealed then the state of the world will be deterministic. We denote the set of all such strategies by . With this new notation at our disposal and using the observation above that shared randomness does not improve the value of a game, we can express alternatively as
[TABLE]
Note that since strategies using shared randomness can perform at best as deterministic strategies, the same must be true for strategies using independent private randomness at the provers. Thus, yet another alternative form of is given by
[TABLE]
namely we can consider maximization over Markov chains with marginal of fixed to .
It is important to examine the limitation posed by restricting to strategies in . In fact, a contentious debate in physics revolving around statistical modeling of quantum measurements was finally settled in the second half of the previous century through quantitative distinction between correlations allowed in hidden variable theory and more general nonsignalling correlation.
For our setting, we can define the class of nonsignaling strategies as follows.
Definition 1** (Nonsignalling strategies).**
Let be the set of all strategies satisfying
[TABLE]
for every and . Equivalently, we can express these conditions as , namely the Markov relations and hold.
Note that these strategies include as a special case the “long Markov strategies” satisfying . This latter class performs as well as . In fact, it is easy to verify that strategies in satisfy (3), which yields
[TABLE]
In typical applications of parallel repetition theorem in complexity theory, it suffices to use a version of the theorem for nonsignalling strategies. In any case, the next question is of independent interest: Does parallel repetition theorem hold if we allow the broader class of nonsignalling strategies?
Specifically, denote by the maximum probability of satisfying using nonsignalling strategies, i.e.,
[TABLE]
By (4), . In fact, the inequality can be strict for some games as illustrated by the next example.
Example 1**.**
Consider the following CHSH type Bell test experiment [3]. For , let be the uniform distribution on , and let predicate be given by . It can be seen that the winning probability of any deterministic strategy is upper bounded by , whereby . This bound is attained by the deterministic strategy for all .
Next, consider the strategy given by
[TABLE]
This particular correlation is termed the Pepescu-Rohrlich box, PR box for short, since it appeared in [10]. It can be verified that this strategy satisfies the nonsignalling condition (3). But provers can win the game with probability by using this strategy. Therefore, , strictly more than .
Holenstein proved that the following version of parallel repetition theorem for nonsignaling strategies.
Theorem 2** ([6]).**
There exists a function satisfying if such that for any game ,
[TABLE]
Note that now the exponent of parallel repetition theorem doesn’t even depend on the cardinality of response set. Also, we remark that the proof of Theorem 2 in [6] is much simpler than the simplified proof of Theorem 1 in the same paper.
III Multiprover Parallel Repetition Theorem
Moving to the multiprover setting, a multiprover game consists of a verifier and provers . Denoting and , the verifier samples a query according to a fixed joint distribution and sends to for in . Upon receiving the queries, each prover sends a response , , to the verifier. The provers win the game if for a given predicate .
As in the previous section, the provers’ strategy can be described by a channel . The set of all strategies that can be described as convex combination of deterministic, local strategies is denoted by . Namely, is the set of all strategies of the form
[TABLE]
where is measure on and . The value of the game that can be attained by strategies satisfying hidden variable theory is given by
[TABLE]
The parallel repetition game is defined analogously to the two-player setting.
For the multi-prover setting, a nonsignaling strategy is a channel such that the following condition is satisfied:
[TABLE]
for all and all subsets of . Denoting the set of all nonsignaling strategies by , the value of the game that can be attained by nonsignaling strategies is given by
[TABLE]
A general parallel repetition theorem for strategies in is not known. As we have mentioned at the end of the previous section, proving parallel repetition theorem for strategies in is relatively easier than that for strategies in ; the former is known to hold under the condition that query distribution has full support [1, 2]. Remarkably, without the full support condition, a counterexample appeared in [7] for a parallel repetion theorem for . Specifically, it was shown that the following three-prover game satisfies for all (this example appeared first in [1]):
Example 2** (Anticorrelation game).**
For , let , . Let query distribution be uniform on all strings with Hamming weight . The required game is given by predicate
[TABLE]
i.e., the responses are identical at the two locations where queries are .
This example rules out a parallel repetition theorem for in general. In other words, is not sufficient to claim the exponential decay of winning probability in parallel repetition games. In fact, even preceeding this counterexample, a parallel repetition theorem, i.e., exponential decay, was shown to hold if the value of the single game for a broader class of strategies, called sub-nonsignalling strategies, is strictly less than [8].
Sub-nonsignaling strategies , which we define next, need not be conditional distributions and are only required to be subnormalized, namely we only need it to be nonnegative and satisfying . Both total variation distances and KL diveregence can be applied to such subnormalized distribution. We remark that the marginal and the conditional distribution , respectively, for a subnormalized distribution are defined as and . While , too, is a subnormalized distribution, will be a (normalized) distribution.
Definition 2** (Sub-nonsignalling strategies).**
The set of sub-nonsignalling strategies consists of subnormalized such that, for each subsets of , there exists a channel satisfying:
[TABLE]
for all .
Note that nonsignalling strategies are those for which the inequality condition above is replaced with identity. Heuristically, sub-nonsignalling strategies maybe regarded as the class of strategies close to nonsignalling stratetigies in statistical distance. Another heuristic was suggested in [8] which interpretted sub-nonsignalling strategies as nonsignalling strategies with addition dependent power to randomly abstain from responding. In fact, we can find a sub-nonsignalling strategy close to a distribution for which all conditional distributions are close to some conditional distributions .111Lemma 3 is a multiprover extension of [6, Lemma 9.5] which showed that in the two-prover setting we can find a nonsignalling .
Lemma 3** ([8, Lemma 5.2]).**
Let be a query distribution on , and let be a probability distribution on . Suppose that for each there exist a conditional distribution such that
[TABLE]
Then, there exists a sub-nonsignaling such that
[TABLE]
By definition, the value of the game that is attained by sub-nonsignaling strategies satisfy . For two-prover games, was shown in [8] to coincide with . However, equality may not hold for multiprover games, in general. Indeed, the game in Example 2 has and . Interestingly, when the query distribution has full support, there exists a constant such that, for , (. [8])
[TABLE]
Before we state the parallel repetition theorem for sub-nonsignalling strategies, we switch to a slightly more general formulation where in the parallel repition game, instead of winning all the games, we are interested in quantifying the probability that the provers win more than a fraction of the game. This formulation is closer to the rate-distortion theory formulation of information theory and appeared, for instance, in [11]. Specifically, for , consider
[TABLE]
where . Since is the indicator for a win in the th coordinate, denotes the total number of wins. Analogously, is defined by restricting the maximum in (7) to nonsignalling strategies; our original definition coincides with .
We now recall the multiprover parallel repetition theorem from [8].
Theorem 4**.**
Let be a multiprover game with . For any with , we have
[TABLE]
where the constant depends only on .
For multiprover games with full support query distributions, Theorem 4 together with (6) implies the parallel repetition theorem for nonsignaling strategies, shown first in [2].
The proof of the multiprover parallel repetition theorem for nonsignalign strategies and full support query distribution in [2] entails extending the proof approach for the two-prover setting in [6]. An alternative proof was provided in [1] by using a technique based on de Finetti theorem. At a high level, this technique allows us to restrict attention to convex combinations of product strategies. In [8], the parallel repetition theorem for sub-nonsignaling strategies, namely Theorem 4, was proved by using another variant of de Finetti theorem.
In the next section, we provide an alternative proof of Theorem 4. Our proof is based on a technique recently developed by the authors in [13] to prove strong converse theorems for multi-user information theory problems. A crucial observation is that the parallel repetition theorem can be regarded as an exponential strong converse of a multi-user rate-distortion problem with no communication. In contrast to the proof in [8] that uses a structural decomposition of strategies, our proof is completely “information theoretic”.
IV A New Proof of Theorem 4
Our goal is to derive an upper bound for the maximum probability of the event of winning more than fraction of games. Following [13], we start with a change of measure (query distribution and provers’ strategy) by conditioning on . The “distance” between the new distribution and the original distribution are bounded in terms of the exponent of the probability of . However, since we have conditioned the strategy on the winning event, the information structure may break down – we are only guaranteed to be “close” to a distribution satisfying our original information constraints. Nonetheless, to complete the proof we need an appropriate single-letterization argument to relate this new game to one instance of the original game.
To enable this, our proof looks at the expected number of wins instead of the probability of winning. For a given multiprover game and , define
[TABLE]
Note that the maximum is over the set of distributions, which we call -approximate nonsignaling distributions, that satisfy the information structure only approximately. In particular, we have replaced the hard information constraints required by nonsignalling strategies by their soft counterparts expressed by bounds on KL diveregence. Below we shall see two properties of : it tensorizes and can be bounded above roughly by . We note that a linear programming based notion of approximate nonsignaling strategies was used in [6, 2, 1, 8]. Our divergence based notion of approximation is amenable to tensorization and facilitates an information theoretic proof.
We can now apply our proof recipe outlined earlier. Under the changed measure obtained by conditioning on , the expected number of wins is more than . Also, this new measure satisfies the soft information constraint bound with equal to the exponent of probability of . Thus, must be more than . Using the properties of mentioned earlier, we can bound it above roughly by , which shows that must be roughly bounded above by . The required bound for exponent is obtained by the contrapositive statement.
Formal arguments follow. We begin with the tensorization property.
Lemma 5**.**
For a given multiprover game , and , we have
[TABLE]
Proof.
The inequality holds by definition. For the other direction, fix a -approximate nonsignalling distribution . We have
[TABLE]
where is distributed uniformly on . Furthermore,
[TABLE]
where the first inequality follows from [13, Proposition 1] and the second and the third inequalities hold since conditioning decreases entropy. Thus, is a -approximate nonsignalling distribution and the claim follows by (8). ∎
Next, we relate and using Lemma 3.
Lemma 6**.**
For a given multiprover game and , we have
[TABLE]
where the constant depends only on .
Proof.
Consider a -approximate nonsignalling distribution . For any , since and , by using Pinsker’s inequality [4] and the triangle inequality, we get
[TABLE]
Next, by applying Lemma 3 with , there exists a sub-nonsignaling strategy such that
[TABLE]
Finally, since is bounded by , we have
[TABLE]
where the final inequality uses the fact that is sub-nonsignalling. We obtain the claimed bound with since was an arbitrary -approximate nonsignalling distribution. ∎
We have all the tools for the proof of Theorem 4 in place.
Proof of Theorem 4
If , we can find a sub-nonsignalling strategy such that for some . Denoting
[TABLE]
we change the measure by conditioning on the event as follows:222Although is only a subnormalized distribution, the changed measure is a distribution.
[TABLE]
Then, by a simple calculation, we have
[TABLE]
Furthermore, for each , denoting by the dominating conditional distribution for the sub-nonsignaling strategy (cf. (5)), we have
[TABLE]
where the second inequality follows from the sub-nonsignaling condition (5) and the third inequality uses the fact that is a conditional distribution. The above bound implies that the changed measure is -approximate nonsignalig distribution. Furthermore, since holds with probability under the changed measure , we have
[TABLE]
which together with Lemma 5 and Lemma 6 implies
[TABLE]
By considering contraposition, if
[TABLE]
then we have . Thus, by setting , implies (9), and we have the claim of the theorem. ∎
V Discussion
A multiprover parallel repetition theorem for standard strategies, i.e., strategies satisfying the hidden variable theory, is not available. In fact, our initial attempt in this work was to provide an alternative proof of the two-prover parallel repetition theorem for the standard strategies. We tried to prove a counterpart of the tensorization property, Lemma 5, for standard strategies. However, our preliminary attempt failed, mainly because it was difficult to identify a suitable soft constraint for the long Markov chain in (2). Nonetheless, we do believe that our measure change approach can be used to obtain a parallel repetition theorem for standard strategies, perhaps by proving an approximate tensorization property of the value function with suitable soft constraints.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Arnon-Friedman, R. Renner, and T. Vidick, “Non-signaling parallel repetition using de Finetti reductions,” IEEE Trans. Inf. Theory , vol. 62, no. 3, pp. 1440–1457, November 2016.
- 2[2] H. Buhrman, S. Fehr, and C. Schaffner, “On the parallel repetition of multi-player games: The no-signaling case,” in Leibniz International Proceedings in Informatics , 2014, pp. 24–35.
- 3[3] J. F. Clauser, M. A. Horne, A. Shimony, and R. A. Holt, “Proposed experiment to test local hidden-variable theories,” Physical Review Letters , vol. 23, no. 15, pp. 880–884, October 1969.
- 4[4] I. Csiszár and J. Körner, Information theory: Coding theorems for discrete memoryless channels. 2nd edition . Cambridge University Press, 2011.
- 5[5] W. Gu and M. Effros, “A strong converse for a collection of network source coding problem,” Proc. IEEE International Symposium on Information Theory , pp. 2316–2320, 2009.
- 6[6] T. Holenstein, “Parallel repetition: Simplifications and the no-signaling case,” Theory of Computing , vol. 5, pp. 141–172, 2009.
- 7[7] J. Homgren and L. Yang, “(A counterexample to) Parallel repetition for non-signaling multi-player games,” 2018, http://people.csail.mit.edu/holmgren/papers/ns-parrep.pdf.
- 8[8] C. Lancien and A. Winter, “Parallel repetition and concentration for (sub-)no-signalling games via a flexible constrained de Finetti reduction.” Chicago J. Theor. Comput. Sci. , no. 11, pp. 1–22, 2016.
