Optimal Separation and Strong Direct Sum for Randomized Query Complexity
Eric Blais, Joshua Brody

TL;DR
This paper proves new theorems on the query complexity of randomized algorithms, establishing bounds and direct sum properties, and demonstrates their implications for communication complexity, answering longstanding open questions.
Contribution
It introduces a bounded-error separation theorem and a strong direct sum theorem for randomized query complexity, advancing understanding of complexity bounds and their applications.
Findings
Existence of a total function with ( ext{R}(f) \, ext{log}(1/\u03b5)) query complexity
Optimal superlinear direct-sum theorem for randomized query complexity
Implications for communication complexity, confirming open questions
Abstract
We establish two results regarding the query complexity of bounded-error randomized algorithms. * Bounded-error separation theorem. There exists a total function whose -error randomized query complexity satisfies . * Strong direct sum theorem. For every function and every , the randomized query complexity of computing instances of simultaneously satisfies . As a consequence of our two main results, we obtain an optimal superlinear direct-sum-type theorem for randomized query complexity: there exists a function for which . This answers an open question of Drucker (2012). Combining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Optimal Separation and Strong Direct Sum for Randomized Query Complexity
Eric Blais
University of Waterloo
Joshua Brody
Swarthmore College
Abstract
We establish two results regarding the query complexity of bounded-error randomized algorithms.
Bounded-error separation theorem.
There exists a total function whose -error randomized query complexity satisfies .
Strong direct sum theorem.
For every function and every , the randomized query complexity of computing instances of simultaneously satisfies .
As a consequence of our two main results, we obtain an optimal superlinear direct-sum-type theorem for randomized query complexity: there exists a function for which . This answers an open question of Drucker (2012). Combining this result with the query-to-communication complexity lifting theorem of Göös, Pitassi, and Watson (2017), this also shows that there is a total function whose public-coin randomized communication complexity satisfies , answering a question of Feder, Kushilevitz, Naor, and Nisan (1995).
1 Introduction
We consider two fundamental questions related to the query complexity of functions in the bounded-error randomized setting: how the randomized query complexity of total functions scales with the allowable error (the separation problem), and how the query complexity of computing instances of a function scales with the complexity of computing only 1 instance of the same function (the direct sum problem). Standard folklore arguments give upper bounds on how much the randomized query complexity can depend on and on in these two problems; the results described below show that these well-known upper bounds are tight in general.
A randomized algorithm computes a function over a finite set with error if for every input , the algorithm outputs the value with probability at least . The query cost of is the maximum number of coordinates of that it queries, with the maximum taken over both the choice of input and the internal randomness of . The -error (worst-case) randomized query complexity of (also known as the randomized decision tree complexity of ) is the minimum query complexity of an algorithm that computes with error at most . We denote this complexity by , and we write to denote the -error randomized query complexity of .
Another natural measure for the query cost of a randomized algorithm is the expected number of coordinates of an input that it queries. Taking the maximum expected number of coordinates queried by over all inputs yields the average query cost of . The minimum average query complexity of an algorithm that computes a function with error at most is the average -error query complexity of , which we denote by . We again write . Note that corresponds to the standard notion of zero-error randomized query complexity of .
1.1 Our Results
Bounded-Error Separation Theorem for Query Complexity
One of the first tricks that one learns in the study of randomized algorithm is success amplification: it is possible to cheaply reduce the error of a randomized algorithm from to any by running the algorithm times and outputting the most frequent answer. In the context of randomized query complexity, this means that for every function ,
[TABLE]
When considering partial functions, it is easy to see that the success amplification trick is optimal, as there are partial functions for which this relationship is tight (see Section 2.2). However, in the case of total functions, for many natural functions such as the majority function, parity function, dictator function, etc., the stronger bound \mathrm{R}_{\epsilon}(f)=O\big{(}\mathrm{R}(f)\big{)} holds and until now it was not known whether the bound in (1) is tight for any total function. In fact, even separations between zero-error and -error randomized query complexity were not known until very recently, when Ambainis et al. [2] showed that there exists a total function for which . Similarly, other separations between randomized query complexity and other measures of complexity have also only been established very recently [23, 1, 3, 4, 2].
In this work, we give the first separation within the bounded-error randomized query complexity setting. Our separation shows that the bound in (1) is optimal in general.
Theorem 1**.**
For infinitely many values of and every , there exists a total function with randomized query complexity
[TABLE]
Note that by the trivial relation between average and worst-case randomized query complexity, Theorem 1 implies the existence of a function for which \mathrm{R}_{\varepsilon}(f)\geq\Omega\big{(}\mathrm{R}(f)\cdot\log\tfrac{1}{\varepsilon}\big{)} and \overline{\mathrm{R}}_{\varepsilon}(f)\geq\Omega\big{(}\overline{\mathrm{R}}(f)\cdot\log\tfrac{1}{\varepsilon}\big{)}, giving optimal separations in both the worst-case randomized query complexity and average query complexity settings.
Strong Direct Sum Theorem
The direct sum problem asks how the cost of computing a function scales with the number of instances of the function that we need to compute. This problem has received a considerable amount of attention in the context of query complexity [18, 7, 24, 25, 19, 8, 13], communication complexity [20, 14, 11, 5, 21, 6], and beyond.
Given a function and a parameter , define by setting f^{k}(x^{(1)},\ldots,x^{(k)})=\big{(}f(x^{(1)}),\ldots,f(x^{(k)})\big{)}. A simple union bound argument shows that the randomized query complexity of is bounded above by
[TABLE]
since we can run a randomized algorithm that computes with error at most on each of the instances. An analogous upper bound holds in the average query complexity setting as well.
Jain, Klauck, and Santha [19] first considered the problem of showing a direct sum theorem for randomized query complexity. They showed that for every function and for small enough constant , Note that in this inequality, the allowable error on the right-hand side of the equation is larger than the error parameter, in contrast to the upper bound where it is (much) smaller. Ben-David and Kothari [8] obtained an improved direct sum theorem holds, showing that holds for every function. This result is formally stronger since it implies the Jain–Klauck–Santha bound, but it also does not show that the error parameter on the right-hand-side of the inequality needs to be smaller than , as it is in the upper bound (2).
We show that the bound in (2) is tight in the average-case query complexity model.
Theorem 2**.**
For every function , every , and every ,
[TABLE]
We establish Theorem 2 by proving a corresponding strong direct sum theorem in the distributional setting, as we discuss in more details in Section 1.3. It remains open to determine whether a similar strong direct sum theorem holds in the worst-case randomized query complexity model. However, in that setting Shaltiel [25] has shown that a proof of such a direct sum theorem can’t be obtained via a corresponding theorem in the distributional setting, as a counterexample shows that direct sum theorems do not hold in this setting in general.
1.2 Applications
Superlinear Direct-Sum-Type Theorem for Query Complexity
Combining (1) and (2), we obtain a bound on the cost of computing instances of a function with bounded (constant) error and the cost of computing a single instance of the same function:
[TABLE]
Drucker [13, Open problem 2] asked if the superlinear dependence on in (3) is necessary for any total function . Theorems 1 and 2 give a positive answer to this question.
Corollary 3**.**
There exists a total function such that for all ,
[TABLE]
Note that Corollary 3 stands in contrast to the quantum query complexity setting, where such a superlinear dependence on is not required [10].
Superlinear Direct-Sum-Type Theorem for Communication Complexity
Let denote the minimum amount of communication required of a public-coin randomized protocol that computes a function with error at most . As in the query complexity model, the communication complexity of the function is bounded above by
[TABLE]
Feder, Kushilevitz, Naor, and Nisan [14] showed that this upper bound is not tight in general, as the equality function satisfies \mathrm{R}^{\mathrm{cc}}(\textsc{Eq}^{k})=O\big{(}k\cdot\mathrm{R}^{\mathrm{cc}}(\textsc{Eq})\big{)}.111In fact, Feder et al. showed that the private-coin randomized communication complexity of Eq satisfies the stronger relation \mathrm{R}^{\mathrm{cc,priv}}(\textsc{Eq}^{k})=o\big{(}k\cdot\mathrm{R}^{\mathrm{cc,priv}}(\textsc{Eq})\big{)}; their construction also directly establishes the result stated in the main text. They then asked whether holds for all functions or not [14, Open problem 2 in §7].
In the last few years, there has been much work on related direct sum questions. Molinaro, Woodruff, and Yaroslavtsev [21, 22] showed that in the one-way communication complexity model, the equality function does satisfy the superlinear direct sum bound \mathrm{R}^{\mathrm{cc},\to}(\textsc{Eq}^{k})=\Theta\big{(}k\log k\cdot\mathrm{R}^{\mathrm{cc},\to}(\textsc{Eq})\big{)}. In the two-way communication complexity model that we consider, Barak, Braverman, Chen, and Rao [6] showed that every function satisfies the direct sum R(f^{k})=\widetilde{\Omega}\big{(}\sqrt{k}\,R(f)\big{)}, and this bound remains the state of the art as far as we know. Using the connection between information complexity and amortized communication complexity of Braverman and Rao [9], Ganor, Kol, and Raz [15] also showed that there is a partial function whose distributional communication complexity is exponentially larger than its amortized distributional communication complexity, showing that a tight direct sum theorem cannot hold in general in this setting. None of these results, however, answer Feder et al.’s original question.
Corollary 3 combined with the randomized query-to-communication lifting theorem of Göös, Pitassi, and Watson [17] answers Feder et al.’s question by showing that there is a function for which the bound in (4) is tight.
Corollary 4**.**
There is a constant and a total function such that for all ,
[TABLE]
1.3 Proof Overviews
Bounded-Error Separation Theorem
The proof of Theorem 1 is established by following the general approach used to great effect by Ambainis et al. [2]: first, identify a partial function for which the query complexity separation holds, then design a variant of the Göös–Pitassi–Watson (GPW) pointer function [16] that “embeds” the partial function into a total function and preserves the same separation.
The first step in this plan is accomplished by observing that the partial gap identity function defined by
[TABLE]
satisfies \overline{\mathrm{R}}_{\varepsilon}(\textsc{GapID})=\Theta\big{(}\mathrm{R}(\textsc{GapID})\cdot\log\tfrac{1}{\varepsilon}\big{)} for every .
Ambainis et al. [2] also used (essentially) the same gap identity function to establish the separation \overline{\mathrm{R}}_{0}(f)=\widetilde{\Omega}\big{(}\mathrm{R}(f)^{2}\big{)}. In constructing a GPW pointer function analogue of the GapID function, however, Ambainis et al. lose a few logarithmic factors: their construction shows that there exists a total function with -error randomized query complexity that satisfies and . The polylogarithmic gap between those two bounds is not particularly important when comparing this query complexity to the zero-error randomized query complexity of the same function, but it makes it impossible to obtain any separation at all between and whenever . To prove Theorem 1, we need a new variant of the GPW pointer function whose analysis avoids any gap that is a non-constant function of .
At a high-level, GPW pointer functions are constructed by defining an array of cells, whose values are taken from some (typically fairly large) alphabet . The first logarithmic gap in Ambainis et al.’s upper and lower bounds occurs because the upper bound is measured in terms of the number of bits queried by the algorithm while the lower bound is in terms of the number of cells queried by an algorithm. To eliminate this gap, we must either reduce the size of the alphabet from to a constant size or modify the analysis so that both the upper and lower bounds are in terms of bit-query complexity. We do the latter, using the notion of resilient functions [12] to show that an algorithm must query a constant fraction of the bits of a cell to learn anything about the contents of that cell. Resilient functions were introduced by Chor et al. [12], who gave an essentially optimal construction using basic linear algebra and the probabilistic method. Sherstov recently created a gadget [26] resilient to approximate polynomial degree. This gadget is both similar in construction to [12] and in motivation to our work; it too removes some loss due to function inputs coming from large alphabets.
The second logarithmic gap in Ambainis et al.’s construction occurs because the location of the “special” cells that an algorithm seeks to discover in the GPW pointer function can be found by following a binary tree structure; the upper bound accounts for the cell queries an algorithm requires to follow this structure while the lower bound holds even if an algorithm finds these special cells in a single query. We bypass this problematic issue with a simple but powerful observation: in our setting, once we use resilient functions to encode the contents of each cell, there is no longer any requirement to keep the size of the alphabet for each cell in the GPW pointer function to be polylogarithmic in and so we can include a lot more information in each cell without affecting the query complexity gap. We use this flexibility to replace pointers to the root of a binary tree structure with direct pointers to all the special cells in its leaves.
The details of the proof of Theorem 1 are presented in Section 2.
Strong Direct Sum Theorem
Our proof of the strong direct sum theorem proceeds by establishing an analogous result in the setting of distributional query complexity. The -error distributional complexity of with respect to the distribution on , denoted by , is the minimum query complexity of a deterministic algorithm that computes the value correctly with probability at least when is drawn from .
The distributional complexity approach is also the one used in prior work on direct sum theorems for query complexity [19, 8]. The challenge with this approach, however, is that a strong direct sum theorem for distributional query complexity does not hold in general, as Shaltiel [25] demonstrated (see also §4 in [13]): there exists a function and a distribution on ’s domain for which \mathrm{D}^{\mu^{k}}_{\epsilon}(f^{k})=O\big{(}\epsilon k\mathrm{D}^{\mu}_{\epsilon}(f)\big{)}.
A similar barrier to strong direct sum theorems exists in the communication complexity setting. Molinaro, Woodruff, and Yaroslavstev [21, 22] bypassed this barrier by considering randomized protocols that are allowed to abort with some bounded probability. They were then able to show that the information complexity of such communication protocols (in both the one-way and two-way communication models) satisfies a strong direct sum property.
Following an analogous approach, we consider randomized algorithms that are allowed to abort (or, equivalently, to output some value that corresponds to “don’t know”) with some probability at most . The -error, -abort randomized query complexity of a function is denoted by . With a natural extension of Yao’s minimax principle, we can obtain bounds on this randomized query complexity by considering the corresponding -error, -abort distributional complexity of a function , which is the minimum query complexity of deterministic algorithms must err with probability at most and abort with probability at most when inputs are drawn from the distribution . We show that a strong direct sum theorem does hold in this setting.
Lemma 5**.**
There exists a constant such that for every function , every distribution on , and every ,
[TABLE]
The proof of Theorem 2 is then obtained from this lemma by showing that an analogue of Yao’s minimax principle holds for algorithms that can both err and abort. The full details of the proofs of Lemma 5 and Theorem 2 are presented in Section 3.
2 Bounded-Error Separation Theorem
We complete the proof of Theorem 1 in this section. In Section 2.1, we first define the pointer function PtrFcn at the heart of the proof. In Sections 2.2–2.4, we establish a lower bound on the query complexity of the PtrFcn function via reductions from the GapID function, and in Section 2.5, we provide a matching upper bound on this query complexity. We complete the proof of Theorem 1 in Section 2.6 by combining these results with the use of resilient functions.
2.1 Pointer Function
The total function at the heart of the proof of Theorem 1 is a variant of the Göös–Pitassi–Watson pointer function PtrFcn that we define below. Let denote the set .
Define to be the set of symbols that encode a value that we denote by , row pointers that we denote by , and one column pointer that we denote .
The function is defined as follows. First, we represent an input as an grid of cells. We say that a column is special for if for every . Then if and only if
- •
There is a unique column that is special for ;
- •
Within the special column , there is a unique cell called the special cell;
- •
for all and all ;
- •
For all , let . Then, we have
- –
(i.e., all cells pointed to by the special cell have value 0)
- –
(i.e., half the cells pointed to by the special cell point back to the special cell)
We call the cells linked cells; linked cells that point back to the special cell are good. In summary, if (i) there is a special column, (ii) within the special column, there is a special cell, (iii) all cells in the special column that are not the special cell have for all , (iv) each linked cell has value [math], and (v) exactly half of the linked cells are good.
The following simple claim will be useful in obtaining the query complexity lower bound for PtrFcn.
Claim 6**.**
Let be an -error randomized algorithm for PtrFcn. Let , and let be the special cell of . Then probes with probability at least .
Proof.
Let be the same input as except that . Then but differ only on the special cell. Whenever doesn’t probe the special cell, it must output the same value for and , so it errs on either or . By the error guarantee of and a union bound, the probability that doesn’t probe cell is at most . ∎
2.2 Lower Bound on the Query Complexity of GapID
We begin the proof of Theorem 1 by establishing a (simple, asymptotically optimal) lower bound on the average query complexity of the GapID function.
Lemma 7**.**
For every and every ,
Proof.
Fix any . We will show that . This suffices to complete the proof of the theorem since it implies that for any , .
Let be a randomized algorithm that computes GapID with error probability at most . Let be a random variable that denotes the set of coordinates queried by , and let denote the event that each coordinate of the input queried by the algorithm has the value [math]. Note that when the event occurs, has the same behavior on input as it does on the input . Since and has error probability at most , this means that for every input ,
[TABLE]
and so .
Define to be the uniform distribution on all inputs with . To err with probability at most on those inputs, the algorithm must satisfy for every in the support of . Combining this upper bound with the previous lower bound, we therefore have that
[TABLE]
For any value ,
[TABLE]
Therefore,
[TABLE]
Combining this inequality with (5), we obtain
[TABLE]
Rearranging the inequality yields and so the average query complexity of is bounded below by
[TABLE]
2.3 Lower Bound on the Query Complexity of BlueRed
We wish to relate the average query complexity of PtrFcn to that of the GapID function. We do this by relating both query complexities to that of another partial function that we call BlueRed.
Let , and call a symbol colored if it is not black. The input is an grid of entries from , with the promise that each column contains a unique colored entry, and either all colored entries are red, or half the colored entries are blue. Formally, we define as follows:
[TABLE]
The following reduction shows that the average query complexity of BlueRed is times as large as that of the GapID function.
Lemma 8**.**
For every ,
Proof.
Fix any algorithm that computes BlueRed with error at most and has expected query cost . We will use to construct an algorithm that computes GapID with error at most and expected cost .
Given an input , the algorithm constructs an instance of the BlueRed problem in the following way. First, it generates indices independently and uniformly at random. Then it defines
[TABLE]
Finally, the algorithm emulates the algorithm on input , querying the value of whenever queries the bit for some . This construction guarantees that computes GapID with error at most ; its query complexity corresponds to the number of red or blue entries that are queried by .
Let be the random variable that denotes the set of indices queried by , and let denote the set of columns whose red or blue entry is queried by . Without loss of generality, we may assume that does not query any entry of a column after it finds the colored entry within that column. We partition into two sets and , where denotes the set of columns whose colored entry is found within the first queries to that column and denotes the set of columns whose colored entry was found with more than queries to that column. Let be indicator variables where if and only if the th query made by is red or blue and is one of the first queries to column . Since each value is drawn uniformly at random from , each of these indicator variables has expected value . Therefore,
[TABLE]
Furthermore, by definition at least queries are made to each column in so the expected size of this set is bounded by \operatorname*{\mathrm{E}}\big{[}|C_{\mathrm{late}}|\big{]}\leq\frac{2}{n}\operatorname*{\mathrm{E}}\big{[}|Q|\big{]} and
[TABLE]
Thus, the expected query cost of is at most , as we wanted to show. ∎
2.4 Lower Bound on the Query Complexity of PtrFcn
Lemma 9**.**
For every ,
Proof.
Let be a randomized algorithm that computes PtrFcn with error at most and expected query cost . We use to construct a randomized algorithm that computes BlueRed with the same cost and error at most .
Let be an input for BlueRed. Each time queries a cell, queries the corresponding entry in . If the entry in is black, then returns . If the entry in is red, then returns . Finally, if the entry of is blue, then terminates the emulation and returns [math]. If reaches the end of the emulation without having been terminated, outputs the same result as .
The query complexity of is at most that of . It remains to show that errs with probability at most . There are two cases to consider.
The first case is when . Then maps directly to an input and hence errs with probability at most on .
The second case is when . Let be an arbitrary -input for PtrFcn such that (i) whenever , (ii) whenever , and (iii) the special entry and good entries of correspond to blue entries of . It might not be possible to completely emulate on input without knowing the exact set of blue entries. However, doesn’t need to fully emulate —it only needs to know how to map black and red entries. Once a blue entry is probed, halts and outputs [math]. In this way, we claim that on input probes the same cells as on input until it halts. Therefore its output is the same as unless probes the special cell or a good cell. Moreover, in this case, outputs correctly with certainty. Thus, by Claim 6, the error of is at most
[TABLE]
2.5 Upper Bound on the Query Complexity of PtrFcn
The proof of Theorem 1 also requires a tight upper bound on the (worst-case) randomized query complexity of PtrFcn. This argument is straightforward, and similar to the analysis of Ambainis et al. [2] for their analogous pointer function.
Lemma 10**.**
.
Proof.
The algorithm that computes the PtrFcn function is described in Algorithm 2. In this algorithm, the set corresponds to the set of potential special columns. The query complexity of PtrFcnSolver follows from the fact that each iteration of the inner while loop either eliminates one of the columns from the set of candidates or one of the cells in the columns in . The final check of the (lone remaining) potential special column at the end of the algorithm examines at most cells.
Whenever the PtrFcnSolver returns the value 1, then it in fact has observed a certificate that so the algorithm has perfect soundness.
Conversely, suppose . Exactly half of the columns are good, so contains such a cell with probability at least . Now, consider the for loop iteration when the first good cell is selected. Since is a good cell, it points back to the special cell, which in turn points to a linked cell in all columns except the special column. For any remaining , probes the linked cell in column , verifies the value equals [math], and removes it from . In this way, the remaining columns in save the special column are eliminated. Once we reduce to a single remaining candidate, we can probe all cells in this column and all linked cells using queries to verify that indeed . ∎
2.6 Completing the Proof of Theorem 1
The last ingredient that we need to complete the proof of Theorem 1 is the concept of resilient functions [12].
Definition 1**.**
The function is -resilient for some if for any set of coordinates and any assignment of values for the inputs , when the values are set uniformly at random then is uniformly distributed in .
We use the following existence result on resilient functions that was established by Chor et al. [12].
Theorem 11** (Chor et al. [12]).**
For every large enough , there is a function that is -resilient and satisfies .
We use resilient functions to bound the query complexity of functions via the following lemma.
Lemma 12**.**
Fix a finite set of cardinality for some integer and let be an -resilient function. Then for every function and every ,
[TABLE]
Proof.
The upper bounds follow immediately from the observation that if is a randomized algorithm that computes with -error, then we can define a algorithm that computes with the same error by simulating and querying the bits to observe the value to return to each query.
For the lower bounds, let be a randomized algorithm that computes with error at most . We define an algorithm for computing that simulates in the following way. For the first queries to a cell, answers the queries with uniformly random variables in . On a query to the -th bit of a cell, queries the value of the corresponding cell in . It then draws a value in uniformly at random among all values that agree with the bits output so far. The current query and all further queries to bits of that cell are then answered using . Once terminates, returns ’s output and terminates as well.
The correctness of follows directly from the correctness of . Furthermore, on any input for which makes queries, makes at most queries since distinct queries of are required for each query that eventually makes to . Thus both the average-case and worst-case query complexities of are bounded by times the corresponding query complexities of . ∎
We are now ready to complete the proof of the separation theorem.
Proof of Theorem 1.
Fix for any integer so that is a power of 2. Fix a -resilient function for some and define the function . By Lemmas 12 and 10,
[TABLE]
In particular, setting we obtain .
Using Lemma 12, 9, and 8, we obtain the chain of inequalities
[TABLE]
By Lemma 7, when this implies that
[TABLE]
Theorem 1 is obtained by noting that EncFcn is a function on variables. ∎
3 Strong Direct Sum Theorem
We establish Theorem 2 by proving a corresponding direct sum result in the distributional model and applying a Yao minimax principle for algorithms that err and abort with bounded probability.
We introduce the model of algorithms that can abort in Section 3.1, where we also relate this model to the average query complexity setting of randomized algorithms and establish a Yao minimax principle. In Section 3.2, we establish the main technical result, a strong direct sum theorem for distributional complexity. We complete the proof of Theorem 2 itself in Section 3.3 and the proofs of Corollaries 3 and 4 are completed in Section 3.4.
3.1 Algorithms That Can Abort
We consider randomized algorithms that are allowed to err and abort. In this setting, an algorithm outputs instead of giving a valid output when it chooses to abort. Let and denote the distributional and randomized query complexities of when the algorithm aborts with probablity at most and errs with probability at most .
Randomized query complexity in the setting where algorithms can abort with constant probability is asymptotically equivalent to the average randomized query complexity.
Proposition 13**.**
For every function , every and every ,
[TABLE]
Proof.
For the first inequality, let be a randomized algorithm that computes with error and has expected query complexity . Let be the randomized algorithm that simulates except that whenever tries to make more than queries, it aborts. The algorithm also computes with error at most , and it has worst-case query complexity . Furthermore, by Markov’s inequality, aborts with probability at most .
For the second inequality, let be a randomized algorithm with query complexity that computes with error probability at most and abort probability at most . Let be the randomized algorithm that simulates until that algorithm does not abort, then outputs the same value. The error probability of conditioned on it not aborting is at most , so the algorithm also errs with probability at most , and its expected query complexity is . ∎
Yao’s minimax principle can be adapted for the setting of algorithms that abort as follows.
Lemma 14**.**
For any such that , we have
[TABLE]
Proof.
We handle the initial inequality (i.e., the easy direction) first. Fix a -query randomized algorithm achieving . By the guarantee of , we have that for any input , aborts with probability at most and errs with probabiltiy at most . Let and be indicator variables for the events that aborts on and errs on respectively. Then, we have and similarly when the expectation is taken over the randomness of the algorithm . Next, fix any input distribution and let . It follows that
[TABLE]
Using Markov’s inequality twice, we have
[TABLE]
By a union bound, there exists a setting of the random string such that both and . Fixing this gives a -query deterministic algorithm that aborts with probability at most and errs with probability at most , hence .
For the second inequality, let . Consider a two-player, zero-sum game where player 1 selects a -query deterministic algortihm for , player 2 selects an input , and player 1 is paid if aborts, if errs, and [math] otherwise. Note that each mixed strategy for player 1 corresponds to a randomized algorithm and each mixed strategy for player 2 corresponds to an input distribution . By our choice of , it follows that for any mixed strategy for player 2, player 1 can obtain payoff . By the minimax theorem, it follows that there is a mixed strategy for player 1 (i.e., a -query randomized algorithm ) that provides the same payoff for every choice of player 2. Finally, note that aborts with probability at most and errs with probability at most ; otherwise, the payoff would be less than . We’ve shown a -query randomized algorithm that aborts w/probability at most and errs w/probability at most , hence . ∎
3.2 Strong Direct Sum for Distributional Complexity
We prove a slightly more precise variant of Lemma 5.
Lemma 15**.**
For every function , every distribution on , and every ,
[TABLE]
Proof.
Let be a deterministic algorithm with query complexity that computes with error probability at most and abort probability at most when the input is drawn from . Then conditioned on not aborting, it outputs the correct value of with probability at least and
[TABLE]
This implies that at least indices satisfy
[TABLE]
otherwise the product in the product in the previous inequality would be less than , contradicting the lower bound on this product.
For each , let denote the number of queries that makes to on input . The query complexity of guarantees that for each input , . Therefore, and at least indices satisfy
[TABLE]
Thus, some index satisfies both (6) and (7). Fix such an index . For inputs and , write to be the input obtained by replacing with in . With this notation, the two conditions (6) and (7) satisfied by can be rewritten as
[TABLE]
and
[TABLE]
The correctness of also guarantees that
[TABLE]
and
[TABLE]
Therefore, by Markov’s inequality, there exists an input such that
[TABLE]
Let be the deterministic algorithm that computes by simulating on the input with two additions:
If attempts to query more than bits of , aborts, and 2. 2.
When terminates, the algorithm first verifies that the output generated by satisfies . If so returns the value ; if not, aborts.
The algorithm has query complexity at most and, by the conditions satisfied by , it aborts with probability at most and errs with probability at most when . ∎
3.3 Proof of Theorem 2
We now complete the proof of Theorem 2. Fix . By Proposition 13 and the second inequality of Lemma 14,
[TABLE]
Let denote a distribution where the maximum is attained. By Lemma 15,
[TABLE]
Using the first inequality of Lemma 14 we then obtain
[TABLE]
Combining these inequalities and applying Proposition 13 once more yields
[TABLE]
Theorem 2 follows from the identity \overline{\mathrm{R}}_{\frac{\epsilon}{k}}(f)=\Theta\big{(}\overline{\mathrm{R}}_{\frac{96\epsilon}{k}}(f)\big{)} obtained from the standard success amplification trick. ∎
3.4 Proof of Corollaries 3 and 4
Corollary 3 is obtained as a direct consequence of Theorems 1 and 2.
Proof of Corollary 3.
The upper bound is via the universal bound (3). For the matching lower bound, let be a function that satisfies the condition of Theorem 1. By Theorem 2, the randomized communication complexity of satisfies
[TABLE]
By Theorem 1,
[TABLE]
as long as . Combining those inequalities yields \mathrm{R}(f^{k})=\Omega\big{(}k\log k\cdot\mathrm{R}(f)\big{)}, as we wanted to show. ∎
The proof of Corollary 4 uses the following randomized query-to-communication lifting theorem of Göös, Pitassi, and Watson [17].
Theorem 16** (Göös, Pitassi, Watson).**
Define to be the index function mapping to and fix . For every ,
[TABLE]
and
[TABLE]
Remark**.**
The statement of Theorem 16 in [17] only mentions the first identity explicitly. However, as discussed in their Section II, the theorem statement holds for functions with any finite range.222In fact, their theorem also holds in even more general settings such as when is a partial function or a relation, for example. Therefore, the theorem holds for the function as well as .
Proof of Corollary 4.
By Corollary 3, there exists a function which satisfies . Combining this result with Theorem 16, we obtain
[TABLE]
4 Conclusions and Open Problems
In this work, we prove a strong direct sum theorem for average-case randomized query complexity—to compute with probability in the average-case query complexity model, one must spend times the resources of computing with error . We then give the first total function whose query complexity scales with , matching the blowup one gets from standard error reduction. We believe a strong direct sum-type theorem should also hold for many composed functions. A natural first step would be to prove an XOR Lemma in the strong direct sum setting.
Conjecture 1**.**
(Strong Direct Sum for xor.) For all functions , all positive integers , and all ,
[TABLE]
We believe a similar result should hold for the majority function .
Conjecture 2**.**
(Strong Direct Sum for maj.) For all functions , all positive integers , and all ,
[TABLE]
More generally, let us say that a total function admits a strong direct sum theorem if for all functions , for all positive integers , and for all , we have . Using this terminology, Conjectures 1 and 2 posits that and admit strong direct sum theorems. These two conjectures are special cases of the following general problem.
Open Problem 1**.**
Which functions admit strong direct sum theorems?
In a different direction, we note that while we identified one total function whose randomized query complexity scales logarithmically with the inverse of the error parameter , we don’t have a good understanding of which functions scale this way.
Open Problem 2**.**
Characterize the functions whose average-case randomized query complexity satisfies for all .
Acknowledgments
The first author thanks Alexander Belov and Shalev Ben-David for enlightening discussions and helpful suggestions. The second author thanks Peter Winkler for insightful discussions. Both authors wish to thank the anonymous referees for valuable feedback and for the reference to [26].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Scott Aaronson, Shalev Ben-David, and Robin Kothari. Separations in query complexity using cheat sheets. In Proceedings 48th Annual ACM Symposium on Theory of Computing , pages 863–876, 2016.
- 2[2] Andris Ambainis, Kaspars Balodis, Aleksandrs Belovs, Troy Lee, Miklos Santha, and Juris Smotrovs. Separations in query complexity based on pointer functions. Journal of the ACM , 64(5):32, 2017.
- 3[3] Andris Ambainis, Martins Kokainis, and Robin Kothari. Nearly optimal separations between communication (or query) complexity and partitions. In Proceedings 31st Annual Conference on Computational Complexity , page 4, 2016.
- 4[4] Anurag Anshu, Aleksandrs Belovs, Shalev Ben-David, Mika Göös, Rahul Jain, Robin Kothari, Troy Lee, and Miklos Santha. Separations in communication complexity using cheat sheets and information complexity. In Proceedings 57th Annual IEEE Symposium on Foundations of Computer Science , pages 555–564, 2016.
- 5[5] Ziv Bar-Yossef, Thathachar S Jayram, Ravi Kumar, and D Sivakumar. An information statistics approach to data stream and communication complexity. Journal of Computer and System Sciences , 68(4):702–732, 2004.
- 6[6] Boaz Barak, Mark Braverman, Xi Chen, and Anup Rao. How to compress interactive communication. SIAM Journal on Computing , 42(3):1327–1363, 2013.
- 7[7] Yosi Ben-Asher and Ilan Newman. Decision trees with AND, OR queries. In Proceedings 10th Annual Structure in Complexity Theory Conference , pages 74–81, 1995.
- 8[8] Shalev Ben-David and Robin Kothari. Randomized query complexity of sabotaged and composed functions. Theory of Computing , 14(1):1–27, 2018.
