Ramsey numbers and monotone colorings
Martin Balko
Department of Applied Mathematics and Institute for Theoretical Computer Science, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic
[email protected]
Department of Computer Science, Faculty of Natural Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
Abstract.
For positive integers N and r≥2, an r-monotone coloring of (r{1,…,N}) is a 2-coloring by −1 and +1 that is monotone on the lexicographically ordered sequence of r-tuples of every (r+1)-tuple from (r+1{1,…,N}).
Let Rmon(n;r) be the minimum N such that every r-monotone coloring of (r{1,…,N}) contains a monochromatic copy of (r{1,…,n}).
For every r≥3, it is known that Rmon(n;r)≤towr−1(O(n)), where towh(x) is the tower function of height h−1 defined as tow1(x)=x and towh(x)=2towh−1(x) for h≥2.
The Erdős–Szekeres Lemma and the Erdős–Szekeres Theorem imply Rmon(n;2)=(n−1)2+1 and Rmon(n;3)=(n−22n−4)+1, respectively.
It follows from a result of Eliáš and Matoušek that Rmon(n;4)≥tow3(Ω(n)).
We show that Rmon(n;r)≥towr−1(Ω(n)) for every r≥3.
This, in particular, solves an open problem posed by Eliáš and Matoušek and by Moshkovitz and Shapira.
Using two geometric interpretations of monotone colorings, we show connections between estimating Rmon(n;r) and two Ramsey-type problems that have been recently considered by several researchers.
Namely, we show connections with higher-order Erdős–Szekeres theorems and with Ramsey-type problems for order-type homogeneous sequences of points.
We also prove that the number of r-monotone colorings of (r{1,…,N}) is 2Nr−1/rΘ(r) for N≥r≥3, which generalizes the well-known fact that the number of simple arrangements of N pseudolines is 2Θ(N2).
The project leading to this application has received funding from European Research Council (ERC)
under the European Unions Horizon 2020 research and innovation programme under grant agreement No.
678765.
The author was also supported by the grant 1452/15 from Israel Science Foundation and by the grant no. 18-13685Y of the Czech Science Foundation (GAČR).
The author also acknowledges the support of the Center for Foundations of Modern Computer Science (Charles University project UNCE/SCI/004).
1. Introduction
Let r≥2 be an integer.
An ordered r-uniform hypergraph is a pair H=(H,≺) consisting of an r-uniform hypergraph H and a total ordering ≺ of the vertices of H.
Let H1=(H1,≺1) and H2=(H2,≺2) be two ordered r-uniform hypergraphs.
We say that H1 and H2 are isomorphic if there is an isomorphism between H1 and H2 that preserves the orders ≺1 and ≺2.
The ordered hypergraph H1 is an ordered sub-hypergraph of H2 if H1 is a sub-hypergraph of H2 and ≺1 is a suborder of ≺2.
For a positive integer n, we let Knr be the ordered complete r-uniform hypergraph on n vertices.
That is, the edge set of Knr consists of all r-element subsets of the vertex set.
We also use Pnr to denote the monotone r-uniform path on n vertices.
That is, Pnr=(Pnr,≺) is an ordered r-uniform n-vertex hypergraph with edges formed by r-tuples of consecutive vertices in ≺.
A coloring c of an ordered r-uniform hypergraph H is a function that assigns some element from a finite set C of colors to each edge of H.
We say that H is monochromatic in c if all edges of H receive the same color via c.
If ∣C∣=k, then we call c a k-coloring of H.
The ordered Ramsey number R(H) of an ordered r-uniform hypergraph H is the minimum positive integer N such that for every 2-coloring c of KNr there is a sub-hypergraph of KNr that is monochromatic in c and isomorphic to H.
It follows from Ramsey’s theorem that ordered Ramsey numbers always exist and are finite.
There are examples of ordered graphs G=(G,≺), for which ordered Ramsey numbers R(G) differ significantly from the standard Ramsey numbers R(G).
For example, there are ordered matchings M=(M,≺) on n vertices for which R(M) is only linear in n, while R(M) grows superpolynomially in n [1, 4].
The motivation for studying the growth rate of the ordered Ramsey numbers R(Pnr) of monotone r-uniform paths comes from the classical paper by Erdős and Szekeres [9].
In this paper, which was one of the starting points of both Ramsey theory and discrete geometry, Erdős and Szekeres independently reproved Ramsey’s Theorem and also proved two other important results in Ramsey theory, the Erdős–Szekeres Theorem about point sets in convex position and the Erdős–Szekeres Lemma on monotone subsequences.
The latter results states that for every n∈N there is a positive integer N(n)=(n−1)2+1 such that every sequence of N(n) numbers contains a nondecreasing or a nonincreasing subsequence of length n.
Moreover, the number N(n) is minimum possible, as there are sequences of (n−1)2 numbers without a monotone subsequence of length n.
It is easy to show that N(n)≤R(Pn2).
In fact, N(n)=R(Pn2)=(n−1)2+1 [17].
The Erdős–Szekeres Theorem states that for every n∈N there is a positive integer ES(n) such that every set of ES(n) points in the plane with no three collinear points contains n points that are vertices of a convex n-gon.
This result is closely connected to the problem of estimating R(Pn3).
Erdős and Szekeres [9] showed ES(n)≤(n−22n−4)+1.
We can again rather easily show that ES(n)≤R(Pn3).
The bound of Erdős and Szekeres then follows from the fact R(Pn3)=(n−22n−4)+1 for every n≥2 [13, 19].
Moreover, several other interesting geometric applications of estimates on R(Pnr) for r≥3 appeared, for example, variants of the Erdős–Szekeres Theorem for convex bodies [13] or the higher-order Erdős–Szekeres theorems [6].
Given this motivation, the ordered Ramsey numbers R(Pnr) have been recently quite intensively studied [6, 13, 17, 19] and their growth rate is nowadays well understood.
For positive integers n and h, let towh(n) be the tower function of height h−1.
That is, tow1(n)=n and towh(n)=2towh−1(n) for every h≥2.
Moshkovitz and Shapira [19] showed that, for all positive integers n and r with r≥3,
[TABLE]
In fact, Moshkovitz and Shapira [19] proved R(Pn+r−1r)=ρr(n)+1, where ρr(n) is the number of line partitions of n of order r (see [19] for definitions).
For r=3, this gives the exact formula R(Pn3)=(n−22n−4)+1 and yields a new proof of the Erdős–Szekeres Theorem [9].
Their coloring c of KN3=(KN3,≺) that gives R(Pn3)>(n−22n−4) satisfies the following transitivity property: if v1≺v2≺v3≺v4 are vertices of KN3 such that c({v1,v2,v3})=c({v2,v3,v4}), then all triples from (3{v1,v2,v3,v4}) have the same color in c.
More generally, for an integer r≥2, a 2-coloring c of KNr=(KNr,≺) is called transitive if for every (r+1)-tuple of vertices {v1,…,vr+1} that satisfies v1≺⋯≺vr+1 and c({v1,…,vr})=c({v2,…,vr+1}) it holds that all r-tuples from (r{v1,…,vr+1}) have the same color in c.
For an ordered hypergraph H, let Rtrans(H) be the number R(H) restricted to transitive 2-colorings.
That is, Rtrans(H) is the minimum positive integer N such that for every transitive 2-coloring c of KNr there is an ordered sub-hypergraph of KNr that is monochromatic in c and isomorphic to H.
Note that Rtrans(Pnr)=Rtrans(Knr) for all positive integers n and r≥2.
We also remark that Rtrans(Pnr)<R(Knr) for every r≥2 and every sufficiently large n.
For example, Rtrans(Pn2)=(n−1)2+1 [17], while R(Knr) equals the standard Ramsey number R(Knr) of the complete r-uniform hypergraph on n vertices and thus R(Kn2) grows exponentially in n [8].
Perhaps surprisingly, the colorings of KNr, which were found by Moshkovitz and Shapira [19] and which give R(Pn+r−1r)>ρr(n), are not transitive for r>3.
Thus it is natural to ask the following question.
Problem 1**.**
[6, 19*]**
What is the growth rate of Rtrans(Pnr)?*
Problem 1 was considered by Eliáš and Matoušek [6], who asked for better lower bounds on Rtrans(Pnr).
Moshkovitz and Shapira [19] note that it might be very well possible that bounds comparable with the bounds for R(Pnr) hold also for Rtrans(Pnr).
They also mention a problem of deciding whether R(Pnr)=Rtrans(Pnr) for all n and r.
Clearly, Rtrans(Pnr)≤R(Pnr) and, by (1), Rtrans(Pnr) grows at most as a tower of height r−2.
This was also shown by Eliáš and Matoušek [6], who also proved Rtrans(Pn4)=tow3(Θ(n)).
Thus Problem 1 is settled for r≤4.
We are not aware of any other lower bound on Rtrans(Pnr).
In this paper, we settle Problem 1 by constructing, for all n and r with r≥3, transitive colorings cr of KNr with no monochromatic copy of P2n+r−1r, where N≥towr−1((1−o(1))n).
In fact, we show that the colorings cr satisfy so-called monotonicity property, which is much more restrictive than the transitivity property and which admits several geometric interpretations.
1.1. Monotone colorings
For a positive integer n, we write [n] to denote the set {1,…,n}.
Let S be a sequence of n elements from some set.
For a subset {i1,…,ik} of [n], we use S(i1,…,ik) to denote the subsequence of S obtained by deleting all elements from S that are at position ij for some j∈[k].
Let r≥2 be an integer.
A 2-coloring c of KNr=(KNr,≺) is called an r-monotone coloring of KNr if it assigns −1 or +1 to every edge of KNr such that the following monotonicity property is satisfied: for every sequence S of r+1 vertices of KNr ordered by ≺ and all integers i,j,k with 1≤i<j<k≤r+1, we have c(S(k))≤c(S(j))≤c(S(i)) or c(S(k))≥c(S(j))≥c(S(i)).
In other words, the monotonicity condition says that there is at most one change of a sign in the sequence (c(S(r+1)),…,c(S(1))).
When referring to a 2-coloring that is r-monotone for some r≥2, we sometimes use the term monotone.
We also abbreviate −1 and +1 by − and +, respectively.
Note that every r-monotone coloring of KNr is a transitive 2-coloring of KNr.
For r=2, transitive and 2-monotone colorings coincide.
However, for r≥3, the monotonicity property is much more restrictive than the transitivity property, as Kr+1r admits 2r+2 transitive and only 2r+2 r-monotone colorings.
An example of a transitive 2-coloring of K43, which is not 3-monotone, is a function c with (c({1,2,3}),c({1,2,4}),c({1,3,4}),c({2,3,4}))=(−,+,−,+).
The notion of monotone colorings has been considered by several researchers [12, 18, 21] under different names.
In some sense, monotone colorings can be viewed as more natural than transitive colorings, as they admit various geometric interpretations; see Subsections 2.1 and 2.2 for examples.
2. Our results
A monotone Ramsey number Rmon(H) of an ordered r-uniform hypergraph H is the minimum positive integer N such that for every r-monotone coloring c of KNr there is an ordered sub-hypergraph of KNr that is monochromatic in c and isomorphic to H.
Since every monotone coloring is transitive, we get Rmon(Pnr)≤Rtrans(Pnr) and also Rmon(Pnr)=Rmon(Knr) for all n and r≥2.
It follows from (1) that Rmon(Pnr)≤towr−1(O(n)).
All known lower bounds on Rtrans(Pnr) are also true for Rmon(Pnr).
That is, we have Rmon(Pn2)=Rtrans(Pn2)=R(Pn2)=(n−1)2+1 [9], Rmon(Pn3)=Rtrans(Pn3)=R(Pn3)=(n−22n−4)+1 [9], and Rmon(Pn4)=tow3(Θ(n)) [6] for every n∈N, as all the constructed transitive colorings in these results are actually monotone.
As our first main result, we prove an asymptotically tight lower bound on Rmon(Pnr) for r≥3.
Since Rmon(Pnr)≤Rtrans(Pnr), this settles Problem 1.
Theorem 2**.**
For positive integers r and n with r≥3, we have
[TABLE]
For r∈{3,4}, the lower bounds from Theorem 2 asymptotically match the lower bounds obtained from results of Erdős and Szekeres [9] and Eliáš and Matoušek [6], respectively.
Our construction is closer to the construction of Moshkovitz and Shapira [19], which they used to show the tight bound R(Pn+r−1r)≥ρr(n)+1.
Our bounds on Rmon(Pnr) do not match the upper bounds on R(Pnr) exactly and thus deciding whether Rtrans(Pnr)=R(Pnr) for all r and n remains an interesting open problem.
It is even possible that Rmon(Pnr)=R(Pnr) for all r and n.
Despite having several natural geometric interpretations, the monotone colorings seem to be quite unexplored.
For example, we are not aware of any non-trivial estimate on the number of r-monotone colorings of Knr for r>3.
Here, we derive both upper and lower bounds for this number.
Note that the bounds are reasonably close together, even with respect to r.
Theorem 3**.**
For integers r≥3 and n≥r, the number Sr(n) of r-monotone colorings of Knr satisfies
[TABLE]
As we will see in Subsection 2.2, Theorem 3 is a generalization of the well-known fact that the number of simple arrangements of n pseudolines is 2Θ(n2).
This fact follows from Theorem 3 by setting r=3.
However, the constants in the exponents in the bounds from Theorem 3 are not the best known.
Felsner and Valtr [11] showed that the number of simple arrangements of n pseudolines is at most 20.657n2, improving the previous bounds 20.792n2 by Knuth [15] and 20.697n2 by Felsner [10].
Felsner and Valtr [11] also proved the lower bound 20.188n2.
All these bounds apply also to S3(n).
In the rest of this section we use two geometric interpretations of monotone colorings to show connections between the problem of estimating Rmon(Pnr) and some geometric Ramsey-type problems that have been recently studied.
We note that besides the following two geometric interpretations of monotone colorings, there is also a third one, which was discovered by Ziegler [21].
He showed that monotone colorings can be interpreted as extensions of the cyclic arrangement of hyperplanes with a pseudohyperplane.
2.1. Higher-order Erdős–Szekeres theorems
Very recently, Miyata [18] introduced a new geometric interpretation of (k+2)-monotone colorings for k∈N, which are called degree-k oriented matroids in [18].
This interpretation concerns k-intersecting pseudoconfigurations of points (or k-pseudoconfigurations, for short), which are formed by a pair (P,L) satisfying the following conditions.
The set P={p1,…,pn} contains n points in the Euclidean plane ordered by their increasing x-coordinates and the set L is a collection of x-monotone Jordan arcs such that:
- (i)
for every l∈L, there are at least k+1 points of P lying on l,
2. (ii)
for every (k+1)-tuple of distinct points of P, there is a unique curve l from L passing through each point of this (k+1)-tuple,
3. (iii)
any two distinct curves from L cross at most k times.111We count all crossings, not only those contained in P.
This notion naturally generalizes the concept of generalized point sets [14] (sometimes called abstract order types), which correspond to 1-pseudoconfigurations.
It also captures the essential combinatorial properties of configurations of points and graphs of polynomial functions, which is a setting considered by Eliáš and Matoušek [6] in their study of higher-order Erdős–Szekeres theorems.
A k-pseudoconfiguration (P,L) of points is simple if each curve from L passes through exactly k+1 points of P; see Figure 1.
If (P,L) is simple, we let li1,…,ik+1 be the curve from L passing through points pi1,…,pik+1.
Each curve l from L is a graph of a continuous function fl:R→R and we let l−:={(x,y)∈R2:y<fl(x)}.
A sign function of a simple k-pseudoconfiguration (P,L) is a function f:(k+2P)→{−,+} such that, given {i1,…,ik+2}∈(k+2P) with i1<⋯<ik+2, f(pi1,…,pik+2)=− if and only if pik+2∈li1,…,ik+1−.
Miyata [18] proved the following correspondence between (k+2)-monotone colorings of Knk+2 and simple k-pseudoconfigurations of n points.
Theorem 4**.**
[18*]**
For k,n∈N, there is a one-to-one correspondence between sign functions of simple k-pseudoconfigurations of n points and (k+2)-monotone colorings of Knk+2.
The monotone coloring corresponding to a k-pseudoconfiguration P is the sign function of P.*
A subset S of P is (k+1)st order monotone if the sign function of (P,L) attains only − or only + value on all of (k+2)-tuples of S.
Theorem 4 immediately gives the following corollary.
Corollary 5**.**
For all positive integers k and n, the number Rmon(Pnk+2) is the minimum positive integer N such that every simple k-pseudoconfiguration of N points contains a (k+1)st order monotone subset of size n.
Generalizing the Erdős–Szekeres Theorem [9] to higher orders, Eliáš and Matoušek [6] introduced the following more restrictive setting in which, for every l∈L, fl is a function whose (k+1)st derivative is everywhere non-positive or everywhere non-negative.
A planar point set P is in (k+1)-general position if no k+2 points of P lie on the graph of a polynomial of degree at most k.
By Newton’s interpolation, every (k+1)-tuple of points from P determines a unique polynomial of degree at most k whose graph contains this (k+1)-tuple and thus P determines a simple k-pseudoconfiguration.
Thus, in this setting, we can consider (k+1)st order monotonicity with respect to the graphs of the polynomials of degree at most k.
Let ESk+1(n) be the smallest positive integer N such that every set of N points in (k+1)-general position contains a (k+1)st order monotone subset of size n.
By Corollary 5, we have ESk+1(n)≤Rmon(Pnk+2) for all positive integers k and n.
It is known that this inequality is tight for k=1 [9].
Eliáš and Matoušek [6] showed that ES3(n)=tow3(Θ(n)) and thus ES3(n) and Rmon(Pn4) have asymptotically the same growth rate.
They also asked about the growth rate of ESk+1(n) for k>2.
A related interesting open question is whether ESk+1(n) and Rmon(Pnk+2) are the same, at least asymptotically.
By Corollary 5, it suffices to show that the extremal configurations for Rmon(Pnk+2) can be ‘realized’ by graphs of polynomial functions of degree at most k.
It is possible that the configurations obtained in the proof of Theorem 2 admit such realizations, which would solve the open problem of Eliáš and Matoušek about the growth rate of ESk+1(n).
We hope to discuss this direction in future work.
2.2. Arrangements of pseudohyperplanes and order-type homogeneous point sets
Felsner and Weil [12] showed that, for every r≥3, there is a one-to-one correspondence between r-monotone colorings of Knr, which they call r-signotopes, and arrangements of n pseudohyperplanes in Rr−1 that admit ‘sweeping’.
For an integer d≥2, a pseudohyperplane H in Rd is a homeomorph of a hyperplane in Rd such that the two connected components of Rd∖H are homeomorphic to an open d-dimensional ball.
Two pseudohyperplanes H1 and H2 cross, if Hi intersects both components of Rd∖H3−i for every i∈{1,2}.
An arrangement of pseudohyperplanes in Rd (or d-arrangement, for short) consists of pseudohyperplanes H1,…,Hn in Rd such that any two pseudohyperplanes Hi and Hj intersect in a pseudohyperplane in Hi≅Hj≅Rd−1 and they cross at their intersection.
Moreover, for every j∈[n], the intersections Hi∩Hj, where i∈[n]∖{j}, form an arrangement of pseudohyperplanes in Hj≅Rd−1.
A d-arrangement A is simple if any d+1 pseudohyperplanes from A have an empty intersection.
We assume that every d-arrangement A of pseudohyperplanes H1,…,Hn is normal, that is, A is simple and is embedded in Rd in the following normalized way.
Assume that A is embedded in the hypercube [0,1]d and, for i∈[d−1], let Ii be the (d−i)-dimensional subspace of Rd that contains the side of [0,1]d, which is obtained by setting the last i coordinates to [math].
We demand that A∩Ii is a (d−i)-arrangement of n pseudohyperplanes.
Moreover, the pseudohyperplanes in A are labeled by increasing first coordinate at their intersection with Id−1.
The assumption that A is embedded in [0,1]d is only for convenience so that all intersections of d pseudohyperplanes from A are contained in [0,1]d.
The reader may consider “spaces at infinity” instead by defining Ii to be the (d−i)-dimensional affine subspace obtained by setting the last i coordinates to some sufficiently small number.
A sign function of a normal d-arrangement A of n pseudohyperplanes H1,…,Hn is a function f:(d+1[n])→{−,+} such that, for given i1<⋯<id+1, f(i1,…,id+1)=− if and only if the pseudoline Hi3∩⋯∩Hid+1, which is oriented away from I1, intersects Hi1 before Hi2.
A normal d-arrangement A is called a Cd-arrangement if the normal (d−1)-arrangement formed by H∩I1 for H∈A has no + sign in its sign function.
We note that every normal arrangement of pseudolines (that is, pseudohyperplanes in R2) is a C2-arrangement, but this is no longer true for Cd-arrangements with d≥3.
This is because, for d≥3, the arrangement induced by A is not uniquely determined, while for Cd-arrangements this arrangement must be the “minimal one with respect to the sign function”.
An example of a C2-arrangement can be found in Figure 2.
Theorem 6**.**
[12*]**
For d≥2 and n∈N, there is a one-to-one correspondence between sign functions of Cd-arrangements of n pseudohyperplanes in Rd and (d+1)-monotone colorings of Knd+1.
The monotone coloring corresponding to an arrangement A is the sign function of A.*
A subset S of A is order-type homogeneous if the sign function of A attains only − or only + values on all of (d+1)-tuples of pseudohyperplanes from S.
Theorem 6 gives the following corollary.
Corollary 7**.**
For all positive integers d≥2 and n, the number Rmon(Pnd+1) is the minimum positive integer N such that every Cd-arrangement of N pseudohyperplanes contains an order-type homogeneous subset of size n.
An orientation of a (d+1)-tuple of points (p1,…,pd+1) with pi=(ai,1,…,ai,d)∈Rd is defined as
[TABLE]
A sequence of points from Rd, d≥2, is order-type homogeneous if all (d+1)-tuples of points from this sequence have the same orientation.
For positive integers n and d≥2, let OTd(n) be the minimum positive integer N such that every sequence of N points from Rd contains an order-type homogeneous subsequence with n points.
Using geometric duality, the notion of order-type homogeneous sequence of points from Rd transcribes to sequences of hyperplanes in Rd.
Thus OTd(n) is also the minimum positive integer N such that every sequence of N hyperplanes in Rd contains an order-type homogeneous subsequence of size n.
The function OTd(n) was considered by many researchers [2, 5, 7, 20].
Suk [20] showed that OTd(n)≤towd(O(n)).
The results of Bárány, Matoušek, and Pór [2] and Eliáš, Matoušek, Roldán-Pensado, and Safernová [7] give an asymptotically matching lower bound OTd(n)≥towd(Ω(n)).
For d≥3, the arrangements obtained from their lower bound on OTd(n) are not Cd-arrangements.
A natural problem is to decide whether one can obtain similar lower bounds on OTd(n) when restricted to Cd-arrangements of hyperplanes.
Corollary 7 combined with Theorem 2 suggests that this might be true, as we obtain such bounds for Cd-arrangements of pseudohyperplanes for every d≥2.
3. Proof of Theorem 2
Here, for positive integers n and r with r≥3, we construct an r-monotone coloring cr of KNr with no monochromatic copy of P2n+r−1r and with N≥towr−1((1−o(1))n).
First, we describe the construction of cr and show that cr contains no long monochromatic monotone r-uniform paths.
Then we prove that the coloring cr satisfies the monotonicity property.
Let us start with a brief overview of the construction of the coloring cr.
It is carried out iteratively with respect to r.
For every positive integer n, we will construct sets Fr(n) with r≥1 such that ∣F1(n)∣=2, ∣F2(n)∣=2n, and ∣Fr(n)∣=2∣Fr−1(n)∣/2 for r≥3.
The 2-coloring cr will have Fr(n) as its vertex set.
We will have a partition of Fr(n) into sets Fr−(n), Fr+(n), and a bijection σr:Fr−(n)→Fr+(n).
Elements A,B∈Fr(n) will be called equivalent, written A≡rB, if A=B, A=σr(B), or B=σr(A).
We say that elements from Fr−(n) and Fr+(n) have type − and +, respectively.
We will also define two orders <r and ≺r; <r will be a linear order on Fr(n) and ≺r will be a linear order on equivalence classes under the equivalence relation ≡r.
In <r, all elements of Fr−(n) will precede all elements of Fr+(n), and the bijection σr will be order-reversing.
Moreover, if we regard ≺r as an ordering on Fr−(n) and on Fr+(n), we will have (Fr−(n),≺r)=(Fr−(n),<r), and hence (Fr+(n),≺r)=(Fr+(n),>r).
The color of an edge e={A1,…,Ar} in cr, where Ai∈Fr(n) and A1<r⋯<rAr, is then defined using an iterative application of a function γ on consecutive vertices Ai and Ai+1, where γ(A,B) is the first element of B in ⪯r−1 on which A and B differ.
We apply γ on e until we reach a unique element of F1(n), which is set to be the color of e.
Now, we proceed by describing the construction of cr in full detail.
Let F2−(n):={(2n−i+1,i):i∈[n]}⊆[2n]2 and F2+(n):={(i,2n−i+1):i∈[n]}⊆[2n]2.
We define a linear ordering <2 on the disjoint union F2(n):=F2−(n)∪F2+(n) by letting (2n,1)<2(2n−1,2)<2⋯<2(1,2n)222Alternatively, one might define F2(n)=[2n]. However, we use this definition as it is more similar to the approach of Moshkovitz and Shapira..
Note that N2:=∣F2(n)∣=2n.
For convenience, we define F1−(n):={−}, F1+(n):={+}, F1(n):={−,+}, and −<1+.
Let σ2:F2−(n)→F2+(n) be the one-to-one correspondence that maps (2n−i+1,i) to (i,2n−i+1).
Two elements A and B from F2(n) are equivalent, written A≡2B, if A=B, A=σ2(B), or B=σ2(A).
We order the equivalence classes of F2(n) under ≡2 by a linear order ⪯2 by identifying each A from F2−(n) with σ2(A) and by letting ≺2 be the ordering <2 on F2−(n).
Slightly abusing the notation, we sometimes consider ⪯2 as a linear order on F2(n).
Then two equivalent elements of F2(n) are considered equal in ⪯2.
For r=1, we let σ1(−)=+ and −≡1+.
Let r≥3 be a positive integer and assume we have constructed Fr−1(n).
Let Fr(n) be the collection of sets X such that X contains exactly one set from each equivalence class of ≡r−1 on Fr−1(n).
Observe that Nr:=∣Fr(n)∣=2Nr−1/2 and that no two sets from Fr(n) are comparable in ⊆.
Also note that the minimum and the maximum element of Fr−1(n) in <r−1 are equivalent and thus X contains exactly one of them.
We let Fr−(n) and Fr+(n) be the subsets of Fr(n) consisting of sets that contain the minimum and the maximum element of Fr−1(n) in <r−1, respectively.
Since every element of Fr(n) contains either the minimal or the maximal element of Fr−1(n) in <r−1, the sets Fr−(n) and Fr+(n) partition Fr(n).
We say that sets from Fr−(n) and Fr+(n) have type − and +, respectively.
An example for r=3=n can be found in Figure 3.
Let A and B be distinct sets from Fr(n) for r≥3.
We let γ(A,B) be the element from B∩E, where E is the first equivalence class of (Fr−1(n))≡r−1 in ≺r−1 on which A and B differ.
We define the total order <r on Fr(n) by letting A<rB if γ(A,B)∈Fr−1+(n).
Observe that γ(A,B)∈Fr−1+(n) if and only if γ(B,A)∈Fr−1−(n) and thus <r is indeed a total order.
For r=2, if A=(a1,a2) and B=(b1,b2) are distinct elements from F2(n), then we let γ(A,B)=− if a1<b1 and, similarly, γ(A,B)=+ if a1>b1, where < is the standard ordering of R.
Note that, for A,B∈F2(n), γ(A,B)=− if and only if A>2B.
We define the mapping σr:Fr−(n)→Fr+(n) by letting
[TABLE]
Note that σr is a one-to-one correspondence.
Two elements A and B from Fr(n) are equivalent, written A≡rB, if A=B, A=σr(B), or B=σr(A).
We again order the equivalence classes of Fr(n) under ≡r by a linear order ⪯r that is obtained by identifying each A from Fr−(n) with σr(A) and by letting ≺r be the ordering <r on Fr−(n).
Again, slightly abusing the notation, we sometimes consider ⪯r as a linear order on the set Fr(n).
Thus two equivalent elements from Fr(n) are the same in ⪯r, (Fr−(n),<r)=(Fr−(n),≺r), and (Fr+(n),>r)=(Fr+(n),≺r).
For integers k,r≥2 and a sequence (B1,…,Bk) of sets from Fr(n) in which any two consecutive terms are distinct, we use Γ(B1,…,Bk) to denote the sequence (γ(B1,B2),…,γ(Bk−1,Bk)) of k−1 sets from Fr−1(n).
Observe that, if r≥3, the definition of γ guarantees that any two consecutive terms of Γ(B1,…,Bk) are distinct and thus we can apply the function Γ on Fr−1(n).
Applying Γ to (B1,…,Bk) iteratively i times, for some i with 1≤i≤min{k−1,r−1}, results in a sequence Γi(B1,…,Bk):=Γ(Γ(⋯Γ(B1,…,Bk)⋯)) of k−i elements from Fr−i(n).
For convenience, we set Γ0(B1,…,Bk):=(B1,…,Bk).
Letting KNrr be the ordered complete r-uniform hypergraph with the vertex set Fr(n) ordered by <r, we color KNrr with a 2-coloring cr by letting cr({A1,…,Ar}):=Γr−1(A1,…,Ar) for every edge {A1,…,Ar} of KNrr with A1<r⋯<rAr.
Lemma 8**.**
For all positive integers n and r with r≥3, there is no monochromatic copy of P2n+r−1r in KNrr colored with cr.
Proof.
Let P be a monochromatic copy of Pkr in cr for some integer k≥r.
Let A1<r⋯<rAk be vertices of P.
Let a1,…,ak−r+2 be the elements of F2(n) obtained by applying the function Γr−2 to sequences (A1,…,Ar−1),(A2,…,Ar),…,(Ak−r+2,…,Ak), respectively.
The color cr({Ai,…,Ai+r−1}) of each edge {Ai,…,Ai+r−1} of P then equals γ(ai,ai+1).
Thus if all edges of P have color − in cr, we obtain a1>2⋯>2ak−r+2.
That is, the first coordinates of a1,…,ak−r+2 increase and we get k≤2n+r−2, as a1,…,ak−r+2∈F2(n)⊆[2n]2.
Similarly, if all edges of P have color +, then a1<2⋯<2ak−r+2 and the second coordinates of a1,…,ak−r+2 increase, which again implies k≤2n+r−2.
∎
Note that if r=3, then a1,…,ak−1 all have type +, as A1<r⋯<rAk.
Using this fact, we could eventually obtain a better estimate Rmon(Pn+23)≥2n.
However, this is not optimal anyway, as we know that Rmon(Pn3)=(n−22n−4)+1.
It remains to show that the coloring cr satisfies the monotonicity property.
In other words, we want to show that there is at most one change of a sign in (cr(S(r+1)),…,cr(S(1))) for every sequence S=(A1,…,Ar+1) of sets from Fr(n) with A1<r⋯<rAr+1.
We first prove two auxiliary results that hold for every r≥2.
Lemma 9**.**
For positive integers n and r with r≥2, let (A,B,C) be a sequence of distinct sets from Fr(n).
For r≥3, γ(A,C)=min≺r−1{γ(A,B),γ(B,C)} if γ(A,B)≡rγ(B,C) and γ(A,B),γ(B,C)≺r−1γ(A,C) otherwise.
For r=2, γ(A,C)∈{γ(A,B),γ(B,C)} if γ(A,B)=γ(B,C) and γ(A,C)=γ(A,B)=γ(B,C) otherwise.
Proof.
First, we assume r≥3.
One of the following three cases occurs: γ(A,B)≺r−1γ(B,C), γ(B,C)≺r−1γ(A,B), or γ(A,B)≡r−1γ(B,C).
In the first case, the sets in B are the same as the sets in C up to γ(B,C) in ≺r−1 while the sets in A and B differ already on γ(A,B)≺r−1γ(B,C) in ≺r−1.
Thus γ(A,C)=γ(A,B).
Similarly, we obtain γ(A,C)=γ(B,C) in the second case.
If γ(A,B)≡r−1γ(B,C), then it follows from γ(A,B)=γ(B,C) that either γ(A,B)=σr(γ(B,C)) or σr(γ(A,B))=γ(B,C).
In particular, γ(B,A)=γ(B,C).
The sets in A and C thus differ for the first time on a set that is larger then both γ(A,B) and γ(B,C) in ≺r−1.
For r=2, let A=(a1,a2), B=(b1,b2), and C=(c1,c2).
If γ(A,B)=+=γ(B,C), then a1>b1 and b1>c1.
In particular, a1>c1 and γ(A,C)=+.
Analogously, if γ(A,B)=−=γ(B,C), then γ(A,C)=−.
If γ(A,B)=γ(B,C), then γ(A,C)∈{γ(A,B),γ(B,C)}, as F1(n) contains only the values − and +.
∎
Note that if A<rB<rC or A>rB>rC, then γ(A,B) and γ(B,C) have the same type and thus γ(A,B)≡r−1γ(B,C) if r≥3.
For r≥3, it follows from Lemma 9 that if γ(A,B)≡r−1γ(B,C), then γ(A,B)<r−1γ(A,C)<r−1γ(B,C) or γ(A,B)>r−1γ(A,C)>r−1γ(B,C).
This is because the lemma gives us γ(A,B),γ(B,C)≺r−1γ(A,C), which together with the facts γ(A,B)=γ(B,C) and γ(A,B)≡r−1γ(B,C) implies that γ(A,B) and γ(B,C) have different types and thus γ(A,C) lies between them in <r−1.
For r=2, it follows that (γ(A,B),γ(A,C),γ(B,C)) has at most one change of a sign.
Thus, for any distinct A,B,C from Fr(n) with r≥2, the sequence (γ(A,B),γ(A,C),γ(B,C)) is monotone in ≤r−1.
Lemma 10**.**
For positive integers n and r with r≥2, let A,B,A′,B′ be sets from Fr(n) such that A=B.
- (i)
Assume A′=B. If A≤rA′, then γ(A,B)≥r−1γ(A′,B).
2. (ii)
Assume A=B′. If B≤rB′, then γ(A,B)≤r−1γ(A,B′).
Proof.
We prove only part (i), as the proof of part (ii) is analogous.
It is easy to verify the statement for r=2 and thus we consider r≥3.
We can assume A=A′, as otherwise the statement is trivial.
There are three possibilities where to place B with respect to A and A′ in <r.
If A<rA′<rB, then Lemma 9 implies γ(A,B)⪯r−1γ(A′,B) and, since γ(A,B),γ(A′,B)∈Fr−1+(n), we have γ(A,B)≥r−1γ(A′,B).
If A<rB<rA′, then γ(A,B)∈Fr−1+(n) and γ(A′,B)∈Fr−1−(n) and we obtain γ(A,B)≥r−1γ(A′,B) immediately.
Finally, if B<rA<rA′, then Lemma 9 implies γ(B,A′)⪯r−1γ(B,A).
Since γ(B,A) and γ(A,B) are equivalent and have distinct type, and the same is true for γ(B,A′) and γ(A′,B), we have γ(A′,B)⪯r−1γ(A,B).
Using the fact that γ(A,B),γ(A′,B)∈Fr−1−(n), we again obtain γ(A,B)≥r−1γ(A′,B).
∎
Before stating the last auxiliary result, we first introduce some definitions.
For two sequences S1 and S2, we use S1⋅S2 to denote the concatenation of S1 and S2.
A profile is a sequence of symbols ≤, ≥, and =, containing at least one of the symbols ≤ and ≥.
Let Ol:=(≤,=,≤,=,…) and El:=(=,≥,=≥,…) be two profiles of length l∈N.
We say that a profile P of length l is odd or even if it can be obtained from Ol or El, respectively, by changing some occurrences of ≤ and ≥ to =.
For two profiles P1 and P2 such that each is odd or even, if P1 is odd and P2 is even, then P1 and P2 have distinct parity.
Otherwise we say that P1 and P2 have the same parity.
The opposite profile P of a profile P is the profile that is obtained from P by replacing each term ≤ with ≥ and each term ≥ with ≤.
For positive integers n, r, and s≥2, let R=(B1,…,Bs) be a sequence of s sets from Fr(n) and let P be a profile of length s−1.
We say that P is a profile of R if whenever Bj<rBj+1 or Bj>rBj+1, then the jth term of P is ≤ or ≥, respectively, for every j∈[s−1].
Lemma 11**.**
For positive integers n, r, and s with r≥3 and 3≤s≤r+1, let S:=(A1,…,As) be a sequence of s sets from Fr(n) with A1<r⋯<rAs.
Then the sequence H:=(Γs−2(S(s)),…,Γs−2(S(1))) has either odd or even profile.
Proof.
We recall that, for a sequence S and a subset {i1,…,ik} of {1,…,∣S∣}, we use S(i1,…,ik) to denote the subsequence of S obtained by deleting all elements from S that are at position ij for some j∈[k].
Also note that every sequence (A1,…,Ak) of elements from Fr(n) satisfies Γk−1(A1,…,Ak)=γ(Γk−2(A1,…,Ak−1),Γk−2(A2,…,Ak)).
In particular, we have Γs−2(S(s))=γ(Γs−3(S(s−1,s)), Γs−3(S(1,s))),
Γs−2(S(1))=γ(Γs−3(S(1,s)),Γs−3(S(1,2))),
and Γs−2(S(i))=γ(Γs−3(S(i,s)),Γs−3(S(i,1))) for every i with 2≤i≤s−1.
We use H1 to denote the sequence (Γs−3(S(s−1,s)),…,Γs−3(S(1,s))) and H2 to denote (Γs−3(S(1,s)),…,Γs−3(S(1,2))).
Let G1:=(Γs−3(S(s−1,s)))⋅H1 and G2:=H2⋅(Γs−3(S(1,2))).
That is, G1 is the sequence obtained from H1 by doubling the first term and G2 is the sequence obtained from H2 by doubling the last term.
By the definition of the function γ, for every i∈[s], the ith term of H equals γ(X,Y), where X is the ith term of G1 and Y is the ith term of G2.
We proceed by induction on s≥3 and, in each step of the induction, we construct a profile p(H) such that p(H) is a profile of H and p(H) is odd or even.
We start with the base case s=3.
We have H=(γ(A1,A2),γ(A1,A3),γ(A2,A3)), H1=(A1,A2), G1=(A1,A1,A2), H2=(A2,A3), and G2=(A2,A3,A3).
Since A1<rA2<rA3, it follows from Lemma 9 that
γ(A1,A2)=γ(A1,A3)>r−1γ(A2,A3) if Γ(S(3))≺Γ(S(1)) or
γ(A1,A2)<r−1γ(A1,A3)=γ(A2,A3) if Γ(S(1))≺r−1Γ(S(3)).
We thus choose p(H) to be the even profile (=,≥) or the odd profile (≤,=), respectively.
We also set p(H1):=(≤), p(H2):=(≤), p(G1):=(=,≤), and p(G2):=(≤,=).
Observe that if Γ(S(3))≺r−1Γ(S(1)), then p(H) is the profile p(G1) and if Γ(S(1))≺r−1Γ(S(3)), then p(H) is the profile p(G2).
Let R be a sequence of length k with the profile p(R) assigned.
We recall that the length of p(R) is k−1.
We let i1(R) be the largest i∈[k] such that the first i−1 terms of p(R) are all =.
Similarly, we let i2(R) be the smallest j∈[k] such that the last k−j terms of p(R) are all =.
In other words, i1(R) is the smallest i with 1≤i≤k such that the ith term of p(R) is not =, and i2(R) is the smallest i with 1≤i≤k such that for every j with i≤j≤k−1, the jth term of p(R) is =.
Note that i2(R)≥i1(R)+1.
In the case s=3, it is easy to check that p(H1) and p(H2) have the same parity and i1(G1)=i2(G2).
For the induction step, we assume that s≥4.
We first express each of the sequences H1 and H2 as a result of applying γ to two sequences, similarly as we have expressed H using G1 and G2.
Let H1,1:=(Γs−4(S(s−2,s−1,s)),…,Γs−4(S(1,s−1,s))) and H1,2:=(Γs−4(S(1,s−1,s)),…,Γs−4(S(1,2,s))).
By setting G1,1:=(Γs−4(S(s−2,s−1,s)))⋅H1,1 and G1,2:=H1,2⋅(Γs−4(S(1,2,s))), we obtain that the ith term of H1 is γ(X,Y), where X and Y are the ith terms of G1,1 and G1,2, respectively.
We similarly proceed with H2 and we let H2,1:=H1,2 and H2,2:=(Γs−4(S(1,2,s)),…,Γs−4(S(1,2,3))).
Setting G2,1:=(Γs−4(S(1,s−1,s))⋅H2,1 and G2,2:=H2,2⋅(Γs−4(S(1,2,3))), we get that the ith term of H2 is γ(X,Y), where X and Y are the ith terms of G2,1 and G2,2, respectively; see Figure 4 for an example.
We now define a profile p(H) and, as our induction step, we later prove that it is a profile of H.
In fact, we prove a stronger statement by additionally showing that if p(H1) and p(H2) have the same parity then either p(H)=p(G1) or p(H)=p(G2) and also i1(G1)≥i2(G2), while if p(H1) and p(H2) have distinct parity then i1(G1)≥i1(G2) and i2(G1)≥i2(G2); see Figure 5.
For every j∈[s−1], we let the jth term of a profile p(G1)∘p(G2) be = if the jth terms of both p(G1) and p(G2) are equalities and we let the jth term of p(G1)∘p(G2) be ≤ if the jth term of p(G1) or of p(G2) is ≤.
Similarly, we let the jth term of p(G1)∘p(G2) be ≥ if the jth term of p(G1) or of p(G2) is ≥.
Observe that if each of the profiles p(H1) and p(H2) is odd or even, then there is no i with 1≤i≤s−1 such that the ith term of p(G1) is ≤ while the ith term of p(G2) is ≥, or vice versa.
Thus p(G1)∘p(G2) is correctly defined under this assumption.
If p(H1) and p(H2) have distinct parity, we let p(H) be the profile p(G1)∘p(G2).
If p(H1) and p(H2) have the same parity, we let p(H) be the profile p(G1) if Γs−2(S(s))≺r−s+2Γs−2(S(1)) and the profile p(G2) if if Γs−2(S(1))≺r−s+2Γs−2(S(s)).
Recall that, as our induction step, we prove that p(H) is a profile of H and that i1(G1)≥i1(G2) and i2(G1)≥i2(G2) if p(H1) and p(H2) have distinct parity and i1(G1)≥i2(G2) if p(H1) and p(H2) have the same parity.
We already observed that this statement is true for s=3.
Note that it follows from the induction hypothesis that the parity of p(H) is the same as the parity of p(G1) or of p(G2).
In particular, the profile p(H) of H is odd or even, which gives the statement of the lemma.
By the induction hypothesis, for every i∈{1,2}, the profile p(Hi) is a profile of Hi and i1(Gi,1)≥i1(Gi,2) and i2(Gi,1)≥i2(Gi,2) if p(Hi,1) and p(Hi,2) have distinct parity and i1(Gi,1)≥i2(Gi,2) if p(Hi,1) and p(Hi,2) have the same parity.
In the latter case, we also know that p(Hi)∈{p(Gi,1),p(Gi,2)}.
Assume first that p(H1) and p(H2) have the same parity. We show that i1(G1)≥i2(G2) by distinguishing some cases.
First, we consider the case when both p(H1) and p(H2) are odd, the other one will be symmetric.
Using the definition of G1,2 and G2,1, the fact that H1,2=H2,1, and the fact that p(H1,2)=p(H2,1) contain at least one term which is not =, we obtain ij(G1,2)=ij(G2,1)−1 for every j∈{1,2}.
- (1)
We start with the cases when at least one of the following situations occurs, either p(H1)∈/{p(G1,1),p(G1,2)} or p(H2)∈/{p(G2,1),p(G2,2)} .
Note that, by the definition of p(Hi) for i∈{1,2}, if p(Hi)∈/{p(Gi,1),p(Gi,2)}, then p(Hi)=p(Gi,1)∘p(Gi,2) and the profiles p(Hi,1) and p(Hi,2) have distinct parity.
- (a)
If p(H1)∈/{p(G1,1),p(G1,2)} and p(H2)∈{p(G2,1),p(G2,2)}, then p(H1,1) is even and p(H1,2)=p(H2,1) and p(H2,2) are odd.
Since p(H1,1) and p(H1,2) have distinct parity, we get i1(G1,1)≥i1(G1,2) and i2(G1,1)≥i2(G1,2).
Since p(H2,1) and p(H2,2) have the same parity, we get i1(G2,1)≥i2(G2,2).
From p(H1)=p(G1,1)∘p(G1,2) and i1(G1,1)≥i1(G1,2), we get i1(G1)=i1(G1,2)+1 by the definition of G1.
Since p(H2) is odd, we have p(H2)=p(G2,2).
Thus i2(G2)=i2(G2,2).
Altogether, it follows from i1(G2,1)≥i2(G2,2) and i1(G1,2)=i1(G2,1)−1 that
[TABLE]
2. (b)
If p(H1)∈{p(G1,1),p(G1,2)} and p(H2)∈/{p(G2,1),p(G2,2)}, then p(H1,1) and p(H1,2)=p(H2,1) are even and p(H2,2) is odd.
Since p(H1,1) and p(H1,2) have the same parity, we get i1(G1,1)≥i2(G1,2).
Since p(H2,1) and p(H2,2) have distinct parity, we get i1(G2,1)≥i1(G2,2) and i2(G2,1)≥i2(G2,2).
From p(H2)=p(G2,1)∘p(G2,2) and i2(G2,1)≥i2(G2,2), we get i2(G2)=i2(G2,1).
Since p(H1) is odd, we have p(H1)=p(G1,1).
Thus i1(G1)=i1(G1,1)+1 by the definition of G1.
Altogether, it follows from i1(G1,1)≥i2(G1,2) and i1(G1,2)=i1(G2,1)−1 that
[TABLE]
3. (c)
If p(H1)∈/{p(G1,1),p(G1,2)} and p(H2)∈/{p(G2,1),p(G2,2)}, then p(H1,1) and p(H1,2) have distinct parity and also p(H2,1) and p(H2,2) have distinct parity.
This, however, implies that either p(H1) or p(H2) is even, which is impossible.
2. (2)
Thus now we are left with the cases p(H1)∈{p(G1,1),p(G1,2)} and p(H2)∈{p(G2,1),p(G2,2)}.
We deal with all four cases.
- (a)
If p(H1)=p(G1,1) and p(H2)=p(G2,2), then p(H1,1) is even and p(H2,2) is odd and we have i1(G1)=i1(G1,1)+1 and i2(G2)=i2(G2,2). If the parity of p(H1,2)=p(H2,1) is odd, then p(H1,1) and p(H1,2) have distinct parity and p(H2,1) and p(H2,2) have the same parity.
It follows that i1(G1,1)≥i1(G1,2) and i1(G2,1)≥i2(G2,2).
Using i1(G1,2)=i1(G2,1)−1, we derive
[TABLE]
If the parity of p(H1,2)=p(H2,1) is even, then p(H1,1) and p(H1,2) have the same parity, while p(H2,1) and p(H2,2) have distinct parity.
This implies i1(G1,1)≥i2(G1,2) and i2(G2,1)≥i2(G2,2) and we derive
[TABLE]
2. (b)
Assume that p(H1)=p(G1,1) and p(H2)=p(G2,1).
Then p(H1,1) and p(H1,2)=p(H2,1) are both even. It also follows that i1(G1)=i1(G1,1)+1 and i2(G2)=i2(G2,1).
Since the profiles p(H1,1) and p(H1,2) have the same parity, we have i1(G1,1)≥i2(G1,2), which gives
[TABLE]
3. (c)
If p(H1)=p(G1,2) and p(H2)=p(G2,2), then both p(H2,1)=p(H1,2) and p(H2,2) are odd. We also have i1(G1)=i1(G1,2)+1 and i2(G2)=i2(G2,2).
It follows that p(H2,1) and p(H2,2) have the same parity, which gives i1(G2,1)≥i2(G2,2).
This implies
[TABLE]
4. (d)
We cannot have p(H1)=p(G1,2) and p(H2)=p(G2,1), as otherwise p(H1,2) and p(H2,1) have distinct parity, which is impossible, as p(H1,2)=p(H2,1).
Altogether, if p(H1) and p(H2) are odd, we have i1(G1)≥i2(G2).
Note that in the above case analysis, we only rely on the facts that the parity of two profiles is the same or different, we do not use the actual parity.
Thus, by symmetry, the inequality i1(G1)≥i2(G2) holds if both p(H1) and p(H2) are even.
Now, we use the fact i1(G1)≥i2(G2) to show that p(H) is a profile of H and p(H)=p(G1) if Γs−2(S(s))≺r−s+2Γs−2(S(1)) or p(H)=p(G2) if Γs−2(S(1))≺r−s+2Γs−2(S(s)).
Recall that we assume that p(H1) and p(H2) have the same parity.
Thus Γs−3(S(s−1,s))<r−s+3Γs−3(S(1,s))<r−s+3Γs−3(S(1,2))) or Γs−3(S(s−1,s))>r−s+3Γs−3(S(1,s))>r−s+3Γs−3(S(1,2))).
This implies that the first term Γs−2(S(s))=γ(Γs−3(S(s−1,s)),Γs−3(S(1,s))) of H and the last term Γs−2(S(1))=γ(Γs−3(S(1,s)),Γs−3(S(1,2))) of H have the same type and, assuming s≤r, they are not equivalent.
Thus we either have Γs−2(S(s))≺r−s+2Γs−2(S(1)) or Γs−2(S(1))≺r−s+2Γs−2(S(s)).
In the first case, Lemma 9 with the parameters A:=Γs−3(S(s−1,s)),B:=Γs−3(S(1,s)),C:=Γs−3(S(1,2)) implies γ(Γs−3(S(s−1,s)),Γs−3(S(1,2)))=Γs−2(S(s)) and in the second case, the lemma with the same parameters gives γ(Γs−3(S(s−1,s)),Γs−3(S(1,2)))=Γs−2(S(1)).
For s=r+1, the terms of H lie in F1(n) and Lemma 9 gives Γs−2(S(1))=γ(Γs−3(S(s−1,s)),Γs−3(S(1,2)))=Γs−2(S(s)) immediately.
We know that the term γ(Γs−3(S(s−1,s)),Γs−3(S(1,2))) equals the first term Γs−2(S(s)) of H if Γs−2(S(s))≺r−s+2Γs−2(S(1)) and to the last term Γs−2(S(1)) of H otherwise.
We assume without loss of generality that Γs−2(S(s))≺r−s+2Γs−2(S(1)), as the other case is symmetric.
For j=i1(G1), the inequality i1(G1)≥i2(G2) implies that the jth term of H equals γ(Γs−3(S(s−1,s)),Γs−3(S(1,2)))=Γs−2(S(s)).
For every i with i≤j, the ith term of H is obtained by applying γ to the first term of G1 and the ith term of G2.
Since G2 is either non-decreasing or non-increasing in ≤r−s+3, Lemma 10 implies that all the first j terms of H are monotone in ≤r−s+2.
Thus, since the first term of H and the jth term of H are both equal to Γs−2(S(s)), we get that all the first j terms of H are equal.
Since j=i1(G1)≥i2(G2), for every i with i>j, the ith term of H is obtained by applying γ to the ith term of G1 and the last term of G2.
Together with the previous fact, Lemma 10 implies that p(H)=p(G1) and it is a profile of H.
For the rest of the proof we assume that the profiles p(H1) and p(H2) have distinct parity. For i∈[s−1], let (pi,qi) be the pair consisting of the ith term pi of p(G1) and the ith term qi of p(G2).
It follows from the definition of G1 and G2 that (pi,qi)∈{(=,=),(≤,=),(=,≥),(≤,≥)} if p(H1) is odd and p(H2) is even and that (pi,qi)∈{(=,=),(≥,=),(=,≤),(≥,≤)} if p(H1) is even and p(H2) is odd.
Thus, by Lemma 10 and by the fact that the ith term of H is obtained by applying γ to the ith terms of G1 and G2 for each i∈[s], the profile p(H) is odd or even and it is a profile of H.
For example, in the case (pi,qi)=(≤,=), we apply part (i) of Lemma 10 with A:=ith term of G1, A′:=(i+1)st term of G1, and B:=ith term of G2, which is also the (i+1)st term of G2.
Note that in the cases (pi,qi)∈{(≤,≥),(≥,≤)} we apply Lemma 10 twice.
It remains to show that i1(G1)≥i1(G2) and i2(G1)≥i2(G2).
Let j∈{1,2}.
Since p(H1)∈{p(G1,1),p(G1,2),p(G1,1)∘p(G1,2)}, we have ij(H1)∈{ij(G1,1),ij(G1,2)}.
Similarly, p(H2)∈{p(G2,1),p(G2,2),p(G2,1)∘p(G2,2)} and thus ij(H2)∈{ij(G2,1),ij(G2,2)}.
Since p(H1,2)=p(H2,1), it follows from the definition of G1,2 and G2,1 that ij(G1,2)+1=ij(G2,1).
We recall that i2(Gk,l)≥i1(Gk,l) for all k,l∈{1,2}.
Thus the induction hypothesis gives ij(G1,1)≥ij(G1,2) and ij(G2,1)≥ij(G2,2)333Here, we are considering both parity cases, namely distinct or same parity of p(Hi,1) and p(Hi,2), at the same time..
Altogether, we obtain ij(H1)≥ij(G2,1)−1 and ij(H2)≤ij(G2,1).
It follows from the definition of G1 and G2 that ij(G1)=ij(H1)+1 and ij(G2)=ij(H2).
This implies ij(G1)≥ij(G2).
∎
Lemma 11 is sufficient to guarantee the monotonicity property for cr.
Corollary 12**.**
For every integer r with r≥3, the coloring cr is r-monotone.
Proof.
For a sequence S:=(A1,…,Ar+1) of sets from Fr(n) with A1<r⋯<rAr+1, we show that there is at most one change of a sign in the sequence (cr(S(r+1)),…,cr(S(1))).
By Lemma 11 applied for s:=r+1, the sequence (Γr−1(S(r+1)),…,Γr−1(S(1))) has odd or even profile and, in particular, this sequence is monotone in ≤1.
The rest follows from the fact that cr(S(i))=Γr−1(S(i)) for every i∈[r+1].
∎
Lemma 8 and Corollary 12 together give the statement of Theorem 2.
Comparison with the construction by Moshkovitz and Shapira
For positive integers n and r≥3, Moshkovitz and Shapira [19] constructed colorings cr′ of KNr with N≥towr−1(Ω(n)) such that there is no monochromatic copy of Pnr in cr′.
However, for r≥4, their coloring cr′ is not transitive.
The construction of our coloring cr is inspired by their approach and uses similar ideas.
However, there are some differences.
First of all, the coloring cr′ is defined on a larger vertex set formed by line partitions of order r, ordered by the lexicographic order ⋖r, while the vertex set on which cr is defined can be regarded as a proper subset of the vertex set for cr′.
Second, the function γ in the definition of cr differs from a function δ(A,B) that is used in the definition of cr′ and that returns the smallest element of B∖A in ⋖r−1.
Our function γ is defined very similarly, but it uses the ordering ≺r−1 instead.
4. Proof of Theorem 3
In this section, we prove Theorem 3 by showing that the number of r-monotone colorings of Knr is of order 2nr−1/rΘ(r) for r≥3 and n≥r.
We first derive the lower bound in Subsection 4.1 and then, in Subsection 4.2, we prove the upper bound.
4.1. A lower bound on the number of monotone colorings
Here we provide a lower bound 2nr−1/rO(r) on the number of r-monotone colorings of Knr with r≥3 and n≥r.
The construction is inspired by the method used by Matoušek [16] to show that there are 2Ω(n2) simple arrangements of n pseudolines.
First, we introduce some definitions.
A composition of a positive integer m into k parts, k∈N, is an ordered k-tuple (p1,…,pk) of positive integers with p1+⋯+pk=m.
It is well-known and easy to show that the number of compositions of m into k parts is exactly (k−1m−1).
In particular, the total number of compositions of m is ∑i=1m(i−1m−1)=2m−1.
Let r and k be integers with r≥3 and 1≤k≤r.
Let σ=(p1,…,pk) be a composition of r into k parts.
The reduction step on σ maps σ to the composition (p1,…,pk−1) if pk>1 or to the composition (p1,…,pk−1) if pk=1.
We say that a composition σ′ is the reduction of σ if σ′ is a composition of one of the forms (1,…,1,2) or (p,1), for some p>1, and is obtained from σ by a sequence of reduction steps.
Note that σ has a reduction if and only if σ=(1,…,1) and σ=(r).
Moreover, the reduction, if it exists, is unique.
We now recursively define the sign of a composition σ of r using the sign of its reduction.
This is carried out by induction on r.
If r=3, then σ is negative if σ=(1,2) and σ is positive if σ=(2,1).
For r>3, we say that σ is negative if it satisfies one of the following three conditions: the reduction of σ is negative, σ=(1,…,1,2) and r is odd, or σ=(r−1,1) and r is even.
Similarly, we say that σ is positive if it satisfies one of the following three conditions: the reduction of σ is positive, σ=(1,…,1,2) and r is even, or σ=(r−1,1) and r is odd.
The notion of negative and positive integer compositions is illustrated in Figure 6.
Note that, for every r≥3, the only two compositions of r that are not negative nor positive are (1,…,1) and (r).
Let r and h be positive integers with r≥3.
We set n:=rh and m:=n/r=rh−1.
We now present a construction of a 3-coloring cr,h of Knr with colors {−,0,+} such that every 2-coloring that is obtained by replacing each occurrence of the color [math] with either − or + is r-monotone.
The construction is carried out recursively starting with the case h=1, in which n=r and cr,1 is the coloring that assigns the color [math] to the only edge [r] of Knr.
For h≥2, we let Vi:={(i−1)m+1,…,im} for every i∈[r] and we let [n] be the vertex set of Knr.
Note that the sets V1,…,Vr partition [n] and form consecutive intervals of size m in the ordering < on [n].
We define the 3-coloring cr,h of Knr on [n] as follows.
Let e={v1,…,vr}∈(r[n]) be an edge of Knr.
The sets V1,…,Vr partition e into nonempty sets e1,…,ek, for some k∈[r], that are consecutive in <.
We let pi be the size of ei for every i∈[k] and we use σ to denote the composition (p1,…,pk) of r.
We choose cr,h(e):=− if σ is negative and cr,h(e):=+ if σ is positive.
It remains to assign the color cr,h(e) to edges e for which σ is not negative nor positive, that is, to edges e for which either σ=(r) or σ=(1,…,1).
If σ=(r), then e⊆Vi for some i∈[r] and, in particular, {v1−(i−1)m,…,vr−(i−1)m}⊆[m].
We then use the coloring cr,h−1 from the previous step of the construction and we let cr,h(e):=cr,h−1({v1−(i−1)m,…,vr−(i−1)m}).
If σ=(1,…,1), then each vi lies in the set Vi.
In this case, we use vi′ to denote the integer vi−(i−1)m from [m] and we let
[TABLE]
This finishes the construction of cr,h.
We show that no matter how we replace zeros with − or + signs in cr,h, the resulting coloring is r-monotone.
Lemma 13**.**
For h≥1 and r≥3, let c be an arbitrary 2-coloring of Knr that is obtained from cr,h by replacing each occurrence of [math] with − or +.
Then c is an r-monotone coloring of Knr.
Proof.
We prove the statement by induction on h.
For h=1, the statement is trivial as n=r and there is only a single edge in Krr.
Now, assume that h≥2.
We further assume that the statement is true for h−1.
Let F={v1,…,vr+1}⊆[n] be an (r+1)-tuple of vertices of Knr with v1<⋯<vr+1 and let j1<⋯<jk be indices with F∩Vji=∅.
We let σ=(p1,…,pk), k∈[r], be the composition of r+1, where pi=∣F∩Vji∣ for every i∈[k].
For every i∈[r+1], we let ei be the edge F∖{vi}.
Similarly as before, for every i∈[r+1], the partitioning of each edge ei by V1,…,Vr determines a composition σi of r.
Note that each σi can be obtained from σ by decreasing pj by 1 if pj>1 or by removing pj if pj=1, where j is a number from [k] such that ∑l=1j−1pl<i and ∑l=1jpl≥i.
We show that c is r-monotone by proving that there is at most one change of a sign in the sequence SF:=(c(e1),…,c(er+1)).
Since there are only r sets V1,…,Vr in the partition of [n], we cannot have σ=(1,…,1).
If σ=(r+1), then F⊆Vi for some i∈[r] and the statement follows from the induction hypothesis for h−1.
Thus we can assume that σ is positive or negative.
We first deal with the case σ=(1,…,1,2,1,…,1), that is, pj=2 for some j∈[r] and pi=1 for every i∈[r]∖{j}.
For such a σ, we have σj=(1,…,1)=σj+1, every σi with i>j+1 has the jth coordinate 2 and all other 1, and σi with i<j has the (j−1)st coordinate 2 and all other 1.
We show that if σi has the value 2 on an odd coordinate, then cr,h(ei)=+.
This is because we can perform reduction steps until we reach the reduction (1,…,1,2) of σi.
This reduction has an odd number of parts, which implies that it is a composition of an even number and thus the reduction of σi is positive.
By the definition of cr,h, we obtain cr,h(ei)=+.
Similarly, if the value 2 is on an even coordinate of σi, then cr,h(ei)=−.
Altogether, we see that there are ξ,ξ′,ξ′′∈{−,+} such that SF=(ξ,…,ξ,ξ′,ξ′′,−ξ,…,−ξ), where ξ′ and ξ′′ are on the jth and the (j+1)st coordinate, respectively.
Moreover, ξ=+ if j is even and ξ=− if j is odd.
Since vj,vj+1∈Vij=Vj and vj<vj+1, we have vj′<vj+1′.
Moreover, since ej=F∖{vj}, ej+1=F∖{vj+1}, the definition of cr,h implies that ξ′≤ξ′′ if j is odd and ξ′≥ξ′′ if j is even and either cr,h(ej) or cr,h(ej+1) is not [math].
Thus there is at most one change of a sign in SF.
In the rest of the proof, we assume that σ is a negative or a positive composition of r+1 that is not of the form (1,…,1,2,1,…,1).
Let σ′ be the reduction of σ.
We know that σ′ is a composition of some integer r′ with 3≤r′≤r+1 and σ′=(r′−1,1) or σ′=(1,…,1,2).
First, we consider the case where σ′ is of the form (r′−1,1).
For every i∈[k] with i>r′, the composition σi has the same reduction as σ and thus all the edges ei with i>r′ have the same color ξ∈{−,+} in cr,h.
Assume that r′>3.
Then every composition σi with i<r′ has the reduction (r′−2,1) and thus every edge ei with i<r′ has the color −ξ in cr,h.
It follows that c is r-monotone, as SF=(−ξ,…,−ξ,ξ′,ξ,…,ξ) for some ξ′∈{−,+}.
Now, assume r′=3.
Since σ=(2,1,…,1), there is an entry in σ of size larger than 1 not lying on the first position and thus σ1 and σ2 have the same reduction of the form (1,…,1,2).
Since r+1≥4 and r′=3, there is at least one entry in σ3 besides the first entry r′−1=2 and thus σ3 has the same reduction (r′−1,1)=(2,1) as any σi with i>r′=3.
It follows that SF=(ξ′,ξ′,ξ,…,ξ) for some ξ′∈{−,+}.
Now, we consider the case σ′=(1,…,1,2).
The composition σ′ is the reduction of σi for every i∈[k] with i>r′ and thus all the edges ei with i>r′ have the same color ξ∈{−,+} in cr,h.
Since σ=(1,…,1,2,1,…,1), the compositions σr′−1 and σr′ have the same reduction.
Assume r′>3.
Then every σi with i≤r′−2 has the reduction (1,…,1,2), which is a composition of r′−1.
Consequently, for every i≤r′−2, the edge ei has color −ξ in cr,h.
Thus SF=(−ξ,…,−ξ,ξ′,ξ′,ξ,…,ξ) for some ξ′∈{−,+}.
If r′=3, then σ′=(1,2) and the reduction of σ1 is (p2,1).
If p2≥3, then the compositions σ2,…,σr+1 have the same reduction and SF=(ξ′,ξ,…,ξ) for some ξ′∈{−,+}.
If p2=2, then the reduction of σ1 is (2,1) and, since (2,1) is positive and (1,2) is negative, we obtain SF=(+,ξ′,ξ′,−,…,−) for some ξ′∈{−,+}.
In any case, there is at most one change of a sign in SF and c is r-monotone.
∎
By Lemma 13, every coloring obtained from cr,h is r-monotone.
Thus, to finish the proof of the lower bound in Theorem 3, it suffices to estimate the number of such colorings from below.
Lemma 14**.**
For positive integers h and r with r≥3, there are at least
[TABLE]
colorings that can be obtained from cr,h by replacing each occurrence of [math] with − or +.
Proof.
Let fr(h) be the number of 2-colorings that can be obtained from cr,h by replacing each occurrence of color [math] with either − or +.
Clearly, we have fr(1)=2.
For h≥2, we have fr(h)≥2x, where x is the number of edges of color [math] in cr,h that are not contained in any Vi.
We recall that m=rh−1≥r.
We estimate the number x as follows.
Consider an arbitrary (r−1)-tuple T=(t1,…,tr−1) of numbers from [⌈m/2⌉] such that not all terms of T are equal.
Clearly, there are ⌈m/2⌉r−1−⌈m/2⌉ such (r−1)-tuples.
Let I and J be two sets of sizes ⌈(r−1)/2⌉ and ⌊(r−1)/2⌋, respectively, whose union is a partition of [r−1] such that d:=∑i∈Iti−∑j∈Jtj is minimum and positive.
Such a partition exists, as not all terms of T are equal and ∣I∣≥∣J∣.
We claim that d≤m.
Suppose for contradiction that d>m.
Let ta be the largest element from (ti:i∈I) and let tb be the smallest element from (tj:j∈J).
Note that ta>tb, as d>m and every element from (ti:i∈I) is at most ⌈m/2⌉≤m.
Let I′:=(I∖{a})∪{b} and J′:=(J∖{b})∪{a}.
The value ∑i∈I′ti−∑j∈J′tj decreases by 2(ta−tb) when compared to ∑i∈Iti−∑j∈Jtj.
Since 1≤2(ta−tb)≤2⌈m/2⌉−2≤m, we have 0<∑i∈I′ti−∑j∈J′tj<d, which contradicts the choice of I and J.
We let E:=(ti:i∈I) and O be the sequence that is obtained from (tj:j∈J) by adding the element d∈[m].
Then ∣O∣=⌊(r−1)/2⌋+1=⌈r/2⌉, ∣E∣=⌈(r−1)/2⌉=⌊r/2⌋, and ∑o∈Oto=∑e∈Ete.
We choose v2i−1′ to be the ith element of O for every i∈[⌈r/2⌉] and v2j′ to be the jth element of E for every j∈[⌊r/2⌋].
Then we set vi:=vi′+(i−1)m∈Vi for each i∈[r] and obtain cr,h({v1,…,vr})=0.
One sequence (v1′,…,vr′) is obtained from at most r! (r−1)-tuples T with d added.
Thus x≥(⌈m/2⌉r−1−⌈m/2⌉)/r!.
Altogether, we have an estimate fr(h)≥2((m/2)r−1−⌈m/2⌉)/r!, which is at least 2r(r−1)(h−1)−2r, as m=rh−1.
∎
If n=rh, then the bound from Lemma 14 gives the lower bound 2nr−1/r3r on the number of r-monotone colorings of Knr.
For n that is not a power of r, we have rh−1<n<rh for some h∈N and we can use the estimate 2nr−1/r4r.
4.2. An upper bound on the number of monotone colorings
Here, using a result of Felsner and Valtr [11], we show that, for integers r≥3 and n≥r, the number of r-monotone colorings of Knr is at most 22r−2nr−1/(r−1)!.
We proceed by induction on r.
For r=3, Felsner and Valtr [11] showed that the number of sign functions of simple arrangements of n pseudolines is at most 20.657n2≤2n2.
By Theorem 6, sign functions of simple arrangements of n pseudolines correspond to 3-monotone colorings of Kn3 and thus the number of such monotone colorings is also at most 2n2.
This constitutes the base case.
For the induction step, we assume r≥4.
Let c be an r-monotone coloring of Knr with vertex set [n].
For i∈{r,…,n}, the ith projection of Knr is the function pi that maps an edge {v1,…,vr−1,i} of Knr with v1<⋯<vr−1<i to {v1,…,vr−1}.
The image of Knr via pi is the ordered complete (r−1)-uniform hypergraph Ki−1r−1.
Note that for every edge e of Ki−1r−1 there is a unique edge e′=pi−1(e) of Knr with pi(e′)=e.
If c is an r-monotone coloring of Knr, then we use pi(c) to denote the 2-coloring of Ki−1r−1 obtained by coloring an edge e of Ki−1r−1 with the color c(pi−1(e)).
We show that every pi(c) is an (r−1)-monotone coloring of Ki−1r−1.
Suppose for contradiction that there is an i∈{r,…,n} such that pi(c) is not an (r−1)-monotone coloring of Ki−1r−1.
Then there is an r-tuple R of vertices from [i−1] such that the sequence SR=(pi(c)(R(r)),…,pi(c)(R(1))) has at least two changes of a sign.
It follows from the definition of pi that, for the (r+1)-tuple T=R∪{i}, we have c(T(j))=pi(c)(R(j)) for every j∈[r].
Thus the sequence ST=(c(T(r+1)),…,c(T(1))) equals to the sequence that is obtained from SR by adding the first coordinate c(T(r+1))=c(R).
Then, however, there are at least two changes of a sign in ST, which contradicts the assumption that c is r-monotone.
Every r-monotone coloring c of Knr thus yields a sequence Sc=(pr(c),…,pn(c)) of (r−1)-monotone colorings.
Moreover, the mapping c↦Sc is injective.
For every i∈{r,…,n}, the number of choices for pi(c) is at most 22r−3(i−1)r−2/(r−2)! by the induction hypothesis.
Altogether, the number of sequences Sc, and thus also the number of r-monotone colorings of Knr, is at most
[TABLE]
To derive the last inequality, we used the estimate ∑i=1nir−2≤r−1nr−1+nr−2≤2nr−1/(r−1) for the power sum [3].
This finishes the proof of the upper bound in Theorem 3.
Acknowledgment
First of all, I would like to thank the anonymous referees for carefully going through the manuscript and for very helpful suggestions that really improved the overall presentation of the paper.
I would also like to thank Attila Pór and Pavel Valtr for interesting discussions during the early stages of the research.