On polynomial approximations over $\mathbb{Z}/2^k\mathbb{Z}$
Abhishek Bhrushundi, Prahladh Harsha, and Srikanth Srinivasan

TL;DR
This paper investigates how low-degree polynomials over rings of the form /2^k approximate Boolean functions, revealing that increasing k can sometimes improve approximation but does not always do so, especially for the majority function.
Contribution
It provides new insights into polynomial approximations over /2^k rings, showing both potential benefits and limitations of increasing k, and connects to non-classical polynomial bounds.
Findings
Increasing k can improve approximation for some functions.
For majority, increasing k does not improve approximation beyond classical bounds.
Results answer open questions about non-classical polynomial approximation.
Abstract
We study approximation of Boolean functions by low-degree polynomials over the ring . More precisely, given a Boolean function , define its -lift to be by . We consider the fractional agreement (which we refer to as ) of with degree polynomials from . Our results are the following: - Increasing can help: We observe that as increases, cannot decrease. We give two kinds of examples where actually increases. The first is an infinite family of functions such that . The second is an infinite family of functions such that -- as small as possible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCoding theory and cryptography · Computability, Logic, AI Algorithms · semigroups and automata theory
On polynomial approximations over ††thanks: A
preliminary version of this paper appeared in Proc. th Annual Symp. on Theoretical Aspects of Comp. Science (STACS), 2017 [BHS17].
Abhishek Bhrushundi Department of Computer Science, Rutgers University, USA. [email protected]. Work done while the author was visiting the Tata Institute of Fundamental Research. Research supported in part by UGC-ISF grant 6-2/2014(IC).
Prahladh Harsha Tata Institute of Fundamental Research, India. [email protected]. Research supported in part by UGC-ISF grant 6-2/2014(IC).
Srikanth Srinivasan Department of Mathematics, Indian Institute of Technology, Bombay, India. [email protected].
We study approximation of Boolean functions by low-degree polynomials over the ring . More precisely, given a Boolean function , define its -lift to be by . We consider the fractional agreement (which we refer to as ) of with degree polynomials from .
Our results are the following:
- •
Increasing can help: We observe that as increases, cannot decrease. We give two kinds of examples where actually increases. The first is an infinite family of functions such that . The second is an infinite family of functions such that — as small as possible — but .
- •
Increasing doesn’t always help: Adapting a proof of Green [Comput. Complexity, 9(1):16–38, 2000], we show that irrespective of the value of , the Majority function satisfies
[TABLE]
In other words, polynomials over for large do not approximate the majority function any better than polynomials over .
We observe that the model we study subsumes the model of non-classical polynomials in the sense that proving bounds in our model implies bounds on the agreement of non-classical polynomials with Boolean functions. In particular, our results answer questions raised by Bhowmick and Lovett [In Proc. 30th Computational Complexity Conf., pages 72–-87, 2015] that ask whether non-classical polynomials approximate Boolean functions better than classical polynomials of the same degree.
1 Introduction
Many lower bound results in circuit complexity are proved by showing that any small sized circuit in a given circuit class can be approximated by a function from a simple computational model (e.g., small depth circuits by low-degree polynomials) and subsequently showing that this is not possible for some suitable “hard function”.
A classic case in point is the work of Razborov [Raz87] which shows lower bounds for , the class of constant depth circuits made up of AND, OR and gates. Razborov shows that any small circuit C can be well approximated by a low-degree multivariate polynomial in the sense that
[TABLE]
The next step in the proof is to show that the hard function, on the other hand, does not have any such approximation. Razborov does this for a suitable symmetric function, Smolensky [Smo87] for the function (for constant odd ), and Szegedy [Sze89] and Smolensky [Smo93] for the Majority function on bits.
Given the importance of the above lower bound, polynomial approximations in other domains and metrics have been intensely investigated and have resulted in interesting combinatorial constructions and error-correcting codes [Gro00, Efr12], learning algorithms [LMN93, KS04] and more recently in the design of algorithms for combinatorial problems [Wil14, AWY15] as well.
To describe the model of polynomial approximation considered in this paper, we first recall the Razborov [Raz87] model of polynomial approximation. Given a Boolean function and degree , Razborov considers the largest such that there is a degree polynomial that has agreement at least with (i.e., ). Call this . In this notation, Szegedy [Sze89] and Smolensky’s [Smo93] results for the Majority function can be succinctly stated as
[TABLE]
We consider a generalization of the above model to rings in the following simple manner. To begin with, we consider the ring . Given a Boolean function , let be the -lift of defined as (i.e., if and otherwise). Once again, we can define to be the largest such that there exists a degree polynomial that has agreement with . Note that since if, for instance, has agreement with , then also has the same agreement with . Hence, proving upper bounds for is at least as hard as proving upper bounds for .
More generally, we can extend these definitions to , the agreement of , the -lift of , defined as , with degree polynomials from . It is not hard to show that and hence as increases, the problem of proving upper bounds on can only get harder.
Our motivation for this model comes from a recent work of Bhowmick and Lovett [BL15], who study the maximum agreement between non-classical polynomials of degree and a Boolean function , which is similar to (see Section 5 for an exact translation between the above model and non-classical polynomials). In particular, non-classical polynomials of degree can be considered as a subset of the degree polynomials in . With respect to correlation111The correlation between is defined to be where is the primitive th root of unity in . If are -valued, then this quantity is exactly where is the agreement between and . Otherwise, however, it does not measure agreement., Bhowmick and Lovett showed that there exist non-classical polynomials (and hence polynomials in ) of logarithmic degree that have very good correlation with the function. With respect to agreement, they show that low-degree non-classical polynomials can only have small agreement with the Majority function. Their results stated in our language, imply that
[TABLE]
In particular, if , this result unfortunately does not give any non-trivial bound on the maximum agreement between non-classical polynomials of degree and the function. Bhowmick and Lovett, however, conjectured that this result could be improved and left open the question of whether non-classical polynomials of degree can do any better than classical polynomials of the same degree in approximating the Majority function. More generally, they informally conjectured that although non-classical polynomials achieve better correlation with Boolean functions than their classical counterparts, they possibly do not approximate Boolean functions any better than classical polynomials. Our work stems from trying to answer these questions.
1.1 Our results
We prove the following results about agreement of Boolean functions with polynomials over the ring :
We explore whether there exist Boolean functions for which agreement can increase by increasing . In particular, do there exist Boolean such that ?
It is not hard to show that this is impossible for . Further, it can be shown that if , then . Keeping this in mind, the first place where we can expect larger to show better agreement is vs. . Our first result shows that there are indeed separating examples in the regime.
- (a)
Fix to be any power of . For infinitely many , there exists a Boolean function such that but .
Note that since is Boolean, for any . We then ask if there exist Boolean functions such that is more or less the trivial bound of , while is significantly larger for and some . In this context, we show the following result.
- (b)
Fix any . For large enough , there is a Boolean function such that but , for . 2. 2.
We show that for , the majority function on bits, and any ,
[TABLE]
222The constant in the is an absolute constant.
by adapting a proof due to Green [Gre00] of a result on the approximability of the parity function by low-degree polynomials over the ring for prime .
Coupled with the observation that the class of polynomials over rings subsumes the class of non-classical polynomials, part of the first result provides a counterexample to an informal conjecture of Bhowmick and Lovett [BL15] that, for any Boolean function , non-classical polynomials of degree do not approximate any better than classical polynomials of the same degree, and the second result confirms their conjecture that non-classical polynomials do not approximate the Majority function any better than classical polynomials.
1.2 Organisation
We start with some preliminaries in Section 2. In Section 3, we show some separation results. Next, in Section 4, we prove upper bounds for . Finally, in Section 5, we discuss how our model relates to non-classical polynomials, answering questions raised by Bhowmick and Lovett.
2 Preliminaries
For , denotes the Hamming weight of , and for , is the least significant bit of in base . For , we use (resp. ) to denote the set of elements in of Hamming weight at most (resp. exactly ). We use to denote the collection of all Boolean functions defined on .
2.1 Elementary symmetric polynomials
Recall that for , the elementary symmetric polynomial of degree over , , is defined as . Here denotes addition modulo two. This may be interpreted as
[TABLE]
A direct consequence of Lucas’s theorem (see, e.g., [Knu97, Section 1.2.6, Ex. 10]) and Eq. 2.1 is the following:
Lemma 2.1**.**
For every , . More generally, where the product runs over all such that the least significant bit of the binary expansion of is .
The following result follows from the work of Green and Tao [GT09, Theorem 11.3], who build upon the ideas of Alon and Beigel [AB01].
Theorem 2.2** (Green-Tao [GT09], Alon-Beigel [AB01]).**
Fix . Then, for every multilinear polynomial of degree at most , we have .
Theorem 2.2 has a nice corollary:
Corollary 2.3**.**
For every fixed , the functions are almost balanced and almost uncorrelated, i.e.
- •
,
- •
, .
Combining Corollary 2.3 with Lemma 2.1, we get another useful fact:
Lemma 2.4**.**
Let be uniformly distributed over . Then, for every fixed , the random variables are almost uniform and almost -wise independent i.e.
- •
, .
- •
, .
2.2 Boolean functions and polynomials over
Given an and , we define the -lift of to be the function defined as follows. For any ,
[TABLE]
For and , will denote the set of multilinear polynomials of degree at most over the ring .
For functions for some finite domain and range , the agreement between and , denoted by , is defined to be the fraction of inputs where they agree, i.e.,
[TABLE]
We will consider how well multilinear polynomials of degree can approximate Boolean functions in the above sense. More precisely, for any Boolean function , we define
[TABLE]
Following [Gop08], we call a set an interpolating set333This is also called a hitting set in the literature. for if the only polynomial that vanishes at all points in is zero everywhere. Formally, for any ,
[TABLE]
We now state a number of standard facts regarding Boolean functions and multilinear polynomials over . The omitted proofs are either easy or well-known.
Unless mentioned otherwise, let be any integers satisfying .
Lemma 2.5**.**
Any polynomial satisfies the following:
(Schwartz-Zippel) If is non-zero, then . 2. 2.
* is the zero polynomial iff for all .* 3. 3.
(Möbius Inversion) Say , where and denotes . Then, where is the characteristic vector of . 4. 4.
(* is an interpolating set) vanishes at all points in iff vanishes at all points of . By shifting the origin to any point of , the same is true of any Hamming ball of radius in .*
Proof.
Point 1: Write as , where is the largest power of that divides the GCD of the coefficients of . Projecting to a non-zero polynomial over by dropping all its coefficients modulo and applying the standard Schwartz-Zippel lemma over completes the proof.
Point 2 follows from point 1, and point 4 from point 3. ∎
Lemma 2.6**.**
Fix any .
. 2. 2.
. 3. 3.
. 4. 4.
.
Proof.
Point 1 is trivial since there is a constant polynomial that has agreement at least with .
Point : Say has agreement with . Then, (interpreted naturally as a polynomial in ) has agreement with .
For point , consider a polynomial that achieves the maximum agreement with . Let be the polynomial obtained from by dropping all its co-efficients modulo . Note that for any , implies that (in the ring ). Hence, the probability that is zero is at least . Lemma 2.5 point 1 implies that must be the zero polynomial. Equivalently, all of the coefficients of are divisible by and hence can be naturally identified with for some . It is easy to check that and hence we have . On the other hand, from point , we already know that . Hence we are done.
Point follows from points and .
∎
3 Some separation results
3.1 Symmetric functions as separating examples
We know from Theorem 2.2 that, for every fixed , . In contrast, the main result of this section shows that
Theorem 3.1**.**
For every fixed , , where .
Notice that for . This implies that, for , is an example of a function for which there exist such that but for some .
Proof of Theorem 3.1.
Lemma 2.1 from Section 2 tells us that . Thus, , the -lift of , is given by
[TABLE]
Fix to be . Consider the polynomial in . To prove the theorem, it suffices to show that
[TABLE]
Clearly, , and
[TABLE]
The following theorem due to Kummer (see, e.g., [Knu97, Section 1.2.6, Ex. 11]) determines the largest power of a prime that divides a binomial coefficient.
Theorem 3.2** (Kummer).**
Let be a prime and such that . Suppose is the largest integer such that . Then is equal to the number of borrows required when subtracting from in base .
Let be the number of borrows required when subtracting from . Rewriting Eq. 3.2 in terms of using Kummer’s theorem, we get
[TABLE]
We will need the following lemma.
Lemma 3.3**.**
* if either*
, or 2. 2.
.
Proof.
Since , all the bits of except and are zero. Thus, when subtracting from , no borrows are required by the bits , .
Using the above observation, the reader can verify that when the number of borrows required is at least i.e. , which in turn implies that . Since , . This proves the second part of the lemma.
To prove the first part, suppose . Since , it follows that both and will need to borrow when subtracting from . As argued before, no borrows are required by the bits before (i.e. less significant than) , and thus the total number of borrows required by the bits , , is 2.
Note that the bit borrows from . Consider the following case analysis:
- •
Case : will not need to borrow since . In fact, none of the bits after (i.e. more significant than) will need to borrow, and thus . This implies that . We also have and hence .
- •
Case : will require a borrow and this means . This implies that . Since , it follows that .
This completes the proof. ∎
By Lemma 3.3, we have
[TABLE]
Using Lemma 2.4 from Section 2, we have
[TABLE]
which, together with Eq. 3.4, implies
[TABLE]
∎
3.2 A separation at
Let be any power of . In this section, we show that there are functions for which .
Theorem 3.4**.**
For large enough , there exists a function such that but .
In particular, we see that . This result is notable, since it shows that there is a separation at the first place where it is possible to have one (Recall that for any by Lemma 2.6).
Let us begin the proof of Theorem 3.4. We first define a family of Boolean functions on . We denote the variables by and . We use to denote the th elementary symmetric polynomial from the ring , i.e., .444We distinguish between and since the former is from and latter a polynomial in .
We will need the following easy corollary of Theorem 3.2.
Corollary 3.5**.**
Let be a power of . Then, for , the highest power of dividing is equal to the highest power of dividing .
Let . Given any function , we define the Boolean function as follows:
[TABLE]
Define . Note that is defined so that its -lift agrees with on points where . Also Corollary 3.5 implies that the following is an alternate equivalent definition of in terms of elementary symmetric polynomials modulo .
[TABLE]
We now begin the proof of Theorem 3.4. First of all, let us note that for any choice of , we have:
Lemma 3.6**.**
.
Proof.
Consider the polynomial defined above. From Eq. 3.5, it follows that the probability that 555 denotes the -lift of . is less than or equal to the probability that , which is by Corollary 2.3. This gives the claim. ∎
The main lemma is the following.
Lemma 3.7**.**
Say is chosen uniformly at random. Then,
[TABLE]
This will prove Theorem 3.4. We will prove the above lemma in the following subsection.
3.3 Proof of Lemma 3.7
The outline of the proof is as follows. Fix any polynomial of degree at most . We need to show that for a random . The fact that is random ensures that any cannot agree with on significantly more than half the inputs in . For inputs outside , we need a more involved argument, following Alon and Beigel [AB01]. We show that for any we can find somewhat large sets and of and variables respectively such that when we set the variables outside , we obtain a polynomial that is symmetric in the variables of . This is a Ramsey theoretic argument ála Alon-Beigel [AB01].
Following this argument, we only need to prove the agreement upper bound for that is symmetric in and variables. This can be done by reduction to a constant-sized problem, as we show below. A careful computation to solve the constant-sized problem will finish the proof.
We begin with some notation that will be useful in the proof. Throughout, we work with disjoint sets of -variables and -variables of equal size and consider polynomials over these variables. Let the -variables be and the -variables be . For , the set of -polynomials over the variables and is denoted . Similarly, we use to denote the fact that is a polynomial only over the variables .
We use to denote Boolean assignments . Given and , we use to denote its natural restriction to the variables indexed by .
We say that is -symmetric if it is a linear combination of the polynomials in the set . We note that being -symmetric depends on the sets under consideration. This will be implicit when used.
Given a multilinear monomial over the and -variables, its multidegree is defined to be if multiplies -variables and -variables. Let be the set of multidegrees of monomials of degree at most . We order in ascending order according to , i.e., fix a total ordering of such that if , then 666If , but , then the relation between and is fixed in an arbitrary manner.. Let be the largest element in the ordering . We will define to be the largest (w.r.t. ) multidegree of a monomial that has a non-zero coefficient in . For such that , we say that a polynomial is -symmetric if we can write as
[TABLE]
where and is -symmetric. Note that if , then any polynomial of degree at most is -symmetric, since we can take and .
We also need the following variant of the function defined above. Call a function (resp. ) -simple w.r.t. (resp. w.r.t. ) if it is a linear combination of symmetric polynomials in (resp. in ) of degree strictly less than . Equivalently, we can say that only depends upon , and similarly for w.r.t. .
Given pairs of polynomials , and , such that and are -simple w.r.t. and respectively, define
[TABLE]
Also, define .
With the notation above, we are ready to state a claim that generalizes Lemma 3.7.
Lemma 3.8**.**
Fix any . For and , there is an such that given any , for uniformly random , we have for any and such that are -simple w.r.t. , and are -simple w.r.t. ,
[TABLE]
The statement of the above lemma for and implies Lemma 3.7 since in this case and as noted above, any polynomial of degree at most is -symmetric.
The proof of Lemma 3.8 is by induction on the order . The base case is the case when , the minimal element of the ordering .
Throughout, the parameter is a fixed integer power of .
3.3.1 Base case of the induction:
In this case, by Eq. 3.6, it is clear that is an -symmetric polynomial. We show in this case that bounding reduces (for most ) to bounding the correlations between functions on inputs. A simple computation solves this problem.
Fix any bits . Define the Boolean function on variables as follows. Notice the similarity to Eqs. 3.5 and 3.7.
[TABLE]
Call a polynomial relevant if is a linear combination of monomials from the set . Let denote the set of relevant polynomials. Note that relevant polynomials do not involve the variable .
Given and as in the statement of Lemma 3.8, we define to be
We say that is -hard if for any of degree at most , we have
We need the following property of a random .
Lemma 3.9**.**
For any , there is an such that if , then for chosen uniformly at random \mathop{\mathrm{Pr}}_{H}[\text{H\varepsilon-hard}]\leq\varepsilon.
Proof.
The proof is a trivial union bound. The number of polynomials of degree at most is at most (there are many possible monomials each has possible coefficients). For each such , the expected number of locations where is . By a Chernoff bound, the probability that this number is not in the range is By Corollary 2.3, it follows that , and hence the above probability can be upper bounded by . A union bound over all the possible tells us that with probability over the choice of , every of degree at most satisfies . In particular, for any , a large enough will ensure that the probability that is not -hard is at most . ∎
We will prove the following lemmas.
Lemma 3.10**.**
Fix any as in the statement of Lemma 3.8. For any , there is an such that for any
[TABLE]
Lemma 3.11**.**
.
The above lemmas clearly prove Eq. 3.8 in the case , which completes the base case.
Proof of Lemma 3.10.
We choose during the course of the proof. First of all, we will assume that , so that we have
[TABLE]
We now show that when is -hard, then for any that is -symmetric of degree at most , we have
[TABLE]
This will prove the lemma. Fix any -hard for the remainder of the lemma.
Since is -symmetric, it follows that we can write
[TABLE]
for some choice of the s from .
Let be a new variable taking values in . We now define as follows. We set for all and for all . Note that we have since does not depend on the random variable . Further, since is -hard, we know that .
In particular, we see that . Since and agree outside , this implies that
[TABLE]
So to upper bound , we upper bound . Assume .
Consider . By Eq. 3.12, we have
[TABLE]
where the s are in , being the least significant bits of (and similarly for ) and we have used Lemma 2.1 for the final equality above.
Now consider . By the definition of above, Eq. 3.7, and once again using Lemma 2.1, we have
[TABLE]
where , being -simple, are functions of , and similarly, are functions of .
Let be new variables. Define by replacing by and by in Eq. 3.14 above. That is,
[TABLE]
and similarly define by replacing by and by for each in the definition of above. We have
[TABLE]
By Lemma 2.4, we know that if is large enough, then for uniformly random , the tuples and are -close to the uniform distribution (in statistical distance) over . Note also that are mutually independent. From this, it easily follows that the final expression in the above display is -close to . Thus, we get
[TABLE]
Therefore, we analyze . Conditioning on any setting of , we see that the functions (being -simple) are fixed to some constants in and hence simplifies to a polynomial . Similarly, simplifies to some . Further, note that by the constraints on sets and in Eq. 3.16, must be a linear combination of monomials from the set . Renaming variables to respectively, we see that . Since this is true of any , the same upper bound holds for as well.
Combined with Eq. 3.17 and Eq. 3.13, this yields . Since this is true for every -hard function , and the probability that a random is -hard is at least , we are done. ∎
Proof of Lemma 3.11.
We prove the statement by a simple case analysis.
The first case is that the relevant polynomial depends on at least one among . Without loss of generality, we assume that depends on . Then, by the definition of , we can write where . Consider any setting of such that . Under this restriction, is a constant function whereas is a non-constant linear function depending on . Hence, when , and can agree on at most half the inputs. Thus we get that .
The second case is that depends on neither nor . In this case, consider any setting of . Under each of these restrictions, computes the constant function (recall that does not depend on ) whereas is a non-constant linear function. Thus, as before, we get that . This proves the lemma. ∎
3.3.2 The induction case
We now induct. Let be non-minimal and let be its predecessor w.r.t. . Assume Lemma 3.8 for -symmetric polynomials. We now prove it for -symmetric polynomials.
We will need the following basic Ramsey-theoretic statement. It is a straightforward generalization (to hypergraphs) of the fact that any large enough bipartite graph contains large bipartite independent sets or complete bipartite subgraphs. Unfortunately we could not find exactly this statement in the literature, so we provide a proof of the statement in Appendix A.
Let and be disjoint sets of size each. A function is said to be an -colouring of . (Recall that denotes the collection of all -sized subsets of .)
Lemma 3.12**.**
For any and any , there is an such that for any , any disjoint -sets and any -colouring of , there are sets and with such that the restriction of to is a constant function.
We now prove the inductive case of Lemma 3.8. Let be -symmetric. By Eq. 3.6, we have
[TABLE]
where is -symmetric, has multidegree at most , is the part of of multidegree exactly , and is the part of multidegree strictly less than (i.e. at most ).
Use to define an -colouring of as follows. For and , we define to be the coefficient of the monomial in . Applying Lemma 3.12 with , we see that if , then there are and such that for all and , we have .
Assume that is as in Eq. 3.18 and . We find as above. For any setting , we can write the polynomial as where is the part of degree , and has degree strictly less than .
Observe that
[TABLE]
and is an -symmetric polynomial (on the remaining variables ). Hence, by Eq. 3.18, we get
[TABLE]
As observed above, is -symmetric. Further, it is easily checked that any restriction of an -symmetric polynomial continues to be -symmetric on the remaining variables. Hence, is also -symmetric. Further, note that has multidegree at most . Also, by definition, the degree of is strictly less than and hence the multidegree of is at most . Altogether, this implies that is a sum of an -symmetric polynomial (i.e. ) and a polynomial of multidegree at most (i.e. . Thus, is an -symmetric polynomial on -variables and -variables, where .
Now, we analyze . Note that choosing a random function is the same as choosing each of its restrictions independently and uniformly at random.
We claim that for each , by the induction hypothesis, we have
[TABLE]
Assuming the above, we show how to finish the proof. Let denote the number of such that . The random variable777Note that is a random variable since it depends on the random function . is a sum of independent [math]- random variables with . Thus, by the Chernoff bound, we have
[TABLE]
In the event that , we have . Hence, we see that in this case
[TABLE]
In particular, the probability that there is any of degree at most such that can be upper bound bounded, using Eq. 3.20 and a union bound over all such , by
[TABLE]
Here, we have used the fact that the number of polynomials of degree at most is equal to the number of ways of choosing the coefficients (in ) of many monomials. The second inequality follows from the fact that . The final inequality is true as long for some .
Overall, we see that if we define , then for any , we have the statement of the lemma for -symmetric polynomials. This completes the induction.
It remains to prove Eq. 3.19. Fix any . Let . We use and to denote assignments to the variables indexed by and respectively and and to denote their natural completions to an assignment to all the variables (i.e. the other variables are assigned by ). Assume that .
By the definition of in Eq. 3.7 and using Lemma 2.1, we have
[TABLE]
where , being -simple, are functions of , and similarly, are functions of .
We would like to write the above in terms of the bits of and . This is done as follows. Consider the case of . Let denote the number of s assigned by to the variables. Note that , and hence it follows that the function is a function of and hence -simple w.r.t. ; similarly, is -simple w.r.t. . Similarly, we can also write for some -simple depending on ; also, for some -simple depending on .
Further elementary reasoning (left to the reader) allows us to deduce that there are -simple (depending on ) such that
[TABLE]
The above along with Eq. 3.22 gives us
[TABLE]
where for each , satisfies , and is hence -simple w.r.t. , and similarly is -simple w.r.t. . Hence, we see that (i.e. same as our hard function, but on inputs). Using the fact that , the induction hypothesis gives us
[TABLE]
In particular, since is -symmetric, we have , which establishes Eq. 3.19 and completes the proof.
4 Upper bounds for
In this section, we show an upper bound on where denotes the Majority function on bits888We define the majority function as iff .
Theorem 4.1**.**
For any , .
The proof of Theorem 4.1 presented below is an adaptation of techniques appearing in a work of Green [Gre00], who proved a similar result on the approximability of the parity function by polynomials over the ring , for prime .
We will need some definitions and facts about .
We use to denote the unique ring homomorphism from to . Its kernel is the set of non-invertible elements in .
We call a set forcing for if any polynomial that vanishes over is forced to take a value in at all points . Formally,
[TABLE]
Define the polynomial to be the polynomial obtained by applying the map to each of the coefficients of . Since a multilinear polynomial in is the zero polynomial iff it vanishes at all points of (by Lemma 2.5), we see that is forcing iff .
Note that any interpolating set for (see Section 2 for the definition) is forcing for , but the converse need not be true.
We now adapt the proof of Lemma 11 in [Gre00] to bound the size of forcing sets for .
Lemma 4.2**.**
If is forcing for , then .
Proof.
Assume for the sake of contradiction that is forcing for and . The latter implies the existence of a non-zero multilinear polynomial of degree at most satisfying for all .
Let be the polynomial in obtained by first clearing out the denominators of the coefficients of , followed by dividing the resulting polynomial by the GCD of all the coefficients. Finally, let be any polynomial in satisfying . It follows that is a non-zero polynomial of degree at most such that , since would imply that every coefficent of (and thus every coefficent of ) is divisible by two, which is impossible since the coefficients of have no common divisor.
To complete the proof, observe that for all , and since is forcing for , this implies that , which is a contradiction.
∎
We now use Lemma 4.2 to prove Theorem 4.1.
Proof of Theorem 4.1.
We assume throughout that ; otherwise, there is nothing to prove. Let be the -lift of the function. Let be arbitrary and let . We want to show that . We will argue by contradiction. So assume that .
Let be the complement of , i.e. the set of points where makes an error in computing . We have . We will try to find a degree (for suitable ) polynomial such that vanishes at all points in but has the property that is a unit (i.e. ) for some . To be able to do this, we need the fact that is not forcing for . By Lemma 4.2, if is indeed forcing for , then
[TABLE]
where the last equality follows if we choose . This contradicts our upper bound on the size of . Hence, cannot be forcing for . In particular, we can find that vanishes on and furthermore, for some .
We now claim that for some of Hamming weight . To see this, consider the polynomial . By construction of , we know that is a non-zero polynomial of degree . Hence, by Lemma 2.5, is non-zero when restricted to the Hamming ball of radius around the all s vector. In particular, this implies that there is an input of Hamming weight where is non-zero and hence , or equivalently . Fix this for the remainder of the proof. Note that since vanishes on .
Now, consider the polynomial . We first show that for all of Hamming weight . Consider any of Hamming weight . If , then since . On the other hand, if , then since has Hamming weight . Thus, vanishes at all inputs of Hamming weight .
Since the degree of is at most and vanishes at all inputs of , this implies (by Lemma 2.5) that must be [math] everywhere. However, at , . This yields the desired contradiction. ∎
5 Connection to non-classical polynomials
Let denote the one dimensional torus. Observing that the additive structure of is isomorphic to the additive subgroup , we can think of a Boolean function as a function , and conversely, a map as a Boolean function.
Tao and Ziegler [TZ12] give a characterization of non-classical polynomials as follows:
Definition 5.1** (Tao and Ziegler [TZ12]).**
A function is a non-classical polynomial of degree if and only if it has the following form:
[TABLE]
Here , and are uniquely determined. is called the shift of , and the largest such that for some is called the depth of .
Since we are interested in the agreement of a non-classical polynomial with Boolean (-valued) functions, we will only consider polynomials with shift , where is the depth of the polynomial and .
Remark 5.2**.**
Classical polynomials are non-classical polynomials with and depth . It is easy to see that every classical polynomial corresponds to a Boolean function. It is also not hard to show that every Boolean function can be represented as a classical polynomial.
The following lemma relates our model to non-classical polynomials:
Lemma 5.3**.**
Let be a Boolean function, and , .
If there is a non-classical polynomial of degree and depth satisfying , then there is a satisfying , where is the -lift of . 2. 2.
If there is a satisfying , then there is a non-classical polynomial of degree and depth satisfying .
Proof.
Fix , , and for the rest of the proof.
Proof of : Let be a non-classical polynomial of degree and depth with . It is not hard to verify that can be written in the following form (See, e.g., proof of Lemma in [BL15]):
[TABLE]
where is of degree .
Suppose . Choose , , satisfying
[TABLE]
By our choice of , we have that, for every and ,
[TABLE]
It follows that .
Proof of : Let such that . Using arguments similar to above, we can find a of degree such that , for all .
Define as
[TABLE]
By comparing to the form in Definition 5.1, it is easy to see that is a non-classical polynomial of degree at most and depth . Furthermore, we have that, for all and ,
[TABLE]
This completes the proof.
∎
The first part of Lemma 5.3 implies the following corollary of Theorem 4.1:
Corollary 5.4**.**
Let be a non-classical polynomial of degree . Then,
[TABLE]
This proves a conjecture of Bhowmick and Lovett [BL15] that non-classical polynomials of degree do not approximate the Majority function any better than classical polynomials of the same degree.
The following is a consequence of Theorem 2.2 and the first part of Lemma 5.3:
Corollary 5.5**.**
Let . Then, for every classical polynomial of degree ,
[TABLE]
On the other hand, the second part of Lemma 5.3 and Theorem 3.1 imply
Corollary 5.6**.**
For every , there is a non-classical polynomial of degree and depth such that
[TABLE]
Noting that for , Corollary 5.5 and Corollary 5.6 imply the following:
Theorem 5.7**.**
There is a Boolean function and , such that for every classical polynomial of degree at most , we have
[TABLE]
but there is a non-classical polynomial of degree satisfying
[TABLE]
This provides a counterexample to an informal conjecture of Bhowmick and Lovett [BL15] that, for any Boolean function , non-classical polynomials of degree do not approximate any better than classical polynomials of the same degree.
6 Acknowledgements
We would like to thank David Barrington for taking the time to explain Szegedy’s [Sze89] result to us, Arkadev Chattopadhyay for referring us to Green’s result [Gre00], and Swagato Sanyal for helpful discussions. We are also grateful to the anonymous reviewers for their detailed and helpful comments. In particular, we thank an anonymous reviewer for STACS 2017 who pointed out an error in the induction case of a previous proof of Lemma 3.8.
Appendix A Proof of Lemma 3.12
Proof.
Note that the constraint is easy to satisfy since if the latter part of the lemma holds for some , then it continues to be the case for . So we ignore the constraint for the rest of the proof.
We prove by induction the following stronger statement. For any and any , there is an such that for any , any disjoint -sets and any -colouring of , one of the following holds.
- •
There are sets and with and such that the restriction of to is the constant [math] function.
- •
There are sets and with and such that the restriction of to is the constant function.
Setting above clearly yields the lemma.
The proof is by induction on . Note that the statement is trivial when , since a -colouring is by definition a constant function. So we can take for any .
Now consider the case when and ; w.l.o.g. assume . In this case, the function is essentially a colouring of and hence the statement of the lemma reduces to the case of the standard Ramsey theorem for -uniform hypergraphs. Thus, we know that exists in this case. This completes the base case.
For the induction, assume the statement for any and any with for some . Consider the case of such that . Assume w.l.o.g. that . We now proceed by induction on .
The base case of the induction is when , which is trivial as . For the induction case, assume that and we have the statement for smaller values of . W.l.o.g. assume that .
By the induction hypotheses, we know the existence of
[TABLE]
and . We claim that has the required properties.
To see this, consider any -colouring of with . Fix an arbitrary and . Note that for and , we obtain a colouring of by setting . Since , we know that there exist and of size each such that the restriction of to is a constant. Equivalently, there is an such that for each and , we have
Assume . Now, consider the restriction of to . Since , we see that there exist and satisfying one of the following.
- •
and , and the restriction of to is the constant [math] function.
- •
and , and the restriction of to is the constant function.
In the former case, we can take and to prove the inductive case. Note that since is the constant [math] function on , the restriction of to is also the constant [math] function.
In the latter case, we just take and , since we are guaranteed that the restriction of to is the constant function.
In the case that , we repeat the same proof except that we use the fact that to prove that there exist and satisfying one of the following.
- •
and , and the restriction of to is the constant [math] function.
- •
and , and the restriction of to is the constant function.
This proves the inductive case, and hence completes the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AB 01] Noga Alon and Richard Beigel . Lower bounds for approximations by low degree polynomials over ℤ m subscript ℤ 𝑚 \mathbb{Z}_{m} . In Proc. 16 16 16 th IEEE Conf. on Computational Complexity , pages 184–187. 2001. doi:10.1109/CCC.2001.933885 . · doi ↗
- 2[AWY 15] Amir Abboud , Richard Ryan Williams , and Huacheng Yu . More applications of the polynomial method to algorithm design . In Proc. 26 26 26 th Annual ACM-SIAM Symp. on Discrete Algorithms (SODA) , pages 218–230. 2015. doi:10.1137/1.9781611973730.17 . · doi ↗
- 3[BHS 17] Abhishek Bhrushundi , Prahladh Harsha , and Srikanth Srinivasan . On polynomial approximations over Z / 2 k Z 𝑍 superscript 2 𝑘 𝑍 {Z/2^{k}Z} . In Brigitte Vallée and Heribert Vollmer , eds., Proc. 34 34 34 th Annual Symp. on Theoretical Aspects of Comp. Science (STACS) , volume 66 of LIP Ics . Schloss Dagstuhl, 2017.
- 4[BL 15] Abhishek Bhowmick and Shachar Lovett . Nonclassical polynomials as a barrier to polynomial lower bounds . In Proc. 30 30 30 th Computational Complexity Conf. , pages 72–87. 2015. ar Xiv:1412.4719 , doi:10.4230/LIP Ics.CCC.2015.72 . · doi ↗
- 5[Efr 12] Klim Efremenko . 3-query locally decodable codes of subexponential length . SIAM J. Comput., 41(6):1694–1703, 2012. (Preliminary version in 41st STOC , 2009). eccc:TR 08-069 , doi:10.1137/090772721 . · doi ↗
- 6[Gop 08] Parikshit Gopalan . Query-efficient algorithms for polynomial interpolation over composites . SIAM J. Comput., 38(3):1033–1057, 2008. (Preliminary version in 17th SODA , 2006). doi:10.1137/060661259 . · doi ↗
- 7[Gre 00] Frederic Green . A complex-number Fourier technique for lower bounds on the mod-m degree . Comput. Complexity, 9(1):16–38, 2000. (Preliminary version in 12th STACS , 1995). doi:10.1007/PL 00001599 . · doi ↗
- 8[Gro 00] Vince Grolmusz . Superpolynomial size set-systems with restricted intersections mod 6 and explicit Ramsey graphs . Combinatorica, 20(1):71–86, 2000. doi:10.1007/s 004930070032 . · doi ↗
