This paper establishes decision criteria and an algorithm for finite difference Groebner bases of normal binomial difference ideals, simplifying complex properties to elementary polynomial conditions.
Contribution
It introduces novel criteria and an algorithm for finite difference Groebner bases of normal binomial difference ideals, reducing complex properties to elementary polynomial checks.
Findings
01
Criteria for finite difference Groebner bases are established.
02
An algorithm for computing these bases is provided.
03
Complex properties are reduced to elementary polynomial conditions.
Abstract
In this paper, we give decision criteria for normal binomial difference polynomial ideals in the univariate difference polynomial ring F{y} to have finite difference Groebner bases and an algorithm to compute the finite difference Groebner bases if these criteria are satisfied. The novelty of these criteria lies in the fact that complicated properties about difference polynomial ideals are reduced to elementary properties of univariate polynomials in Z[x].
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPolynomial and algebraic computation · Coding theory and cryptography · Cryptography and Residue Arithmetic
Full text
Criteria for Finite Difference Gröbner Bases of
Normal Binomial Difference Ideals††thanks: Partially supported by a grant from NSFC 11101411.
Yu-Ao Chen and Xiao-Shan Gao
KLMM, UCAS, Academy of Mathematics and Systems Science
The Chinese Academy of Sciences, Beijing 100190, China
Abstract
In this paper, we give decision criteria for normal binomial
difference polynomial ideals in the univariate difference polynomial ring F{y} to have finite difference Gröbner bases
and an algorithm to compute the finite difference Gröbner bases
if these criteria are satisfied.
The novelty of these criteria lies in the fact that
complicated properties about difference polynomial ideals
are reduced to elementary properties of univariate polynomials in Z[x].
Difference algebra founded by Ritt and Cohn aims to study algebraic
difference equations in a similar way that polynomial equations are
studied in commutative algebra and algebraic geometry [18, 5, 14, 21].
The Gröbner basis invented by Buchberger is a powerful
tool for solving many mathematical problems [4].
The concepts of difference Gröbner bases was extended to
linear difference polynomial ideals in [14, 15, 11]
and nonlinear difference polynomial ideals in [11].
Many applications of difference Gröbner bases were given [9, 16, 14, 15].
Since difference polynomial ideals can be infinitely
generated, their difference Gröbner bases are generally infinite.
Even for finitely generated difference polynomial ideals,
their difference Gröbner bases could be infinite as shown by Example 2.2
in this paper.
This makes it impossible to compute difference Gröbner bases
for general difference polynomial ideals
and thus it is a crucial issue to give criteria for difference polynomial ideals to
have finite difference Gröbner bases.
Let F be a difference field and y a difference indeterminate.
In this paper, we will give decision criteria for normal binomial
difference polynomial ideals in F{y} to have finite difference Gröbner bases
and an algorithm to compute these finite difference Gröbner bases under these criteria.
A difference ideal I in F{y} is called normal if
MP∈I implies P∈I for any difference monomial M in F{y}
and P∈F{y}.
I is called binomial if it is generated by difference polynomials with
at most two terms [7, 6].
For f∈Z[x],
let f+,f−∈N[x] be the
positive part and the negative part of f such that f=f+−f−.
For h=∑i=0maixi∈N[x], denote
yh=∏i=0m(σiy)ai, where σ is the difference operator of F. Then any difference monomial in F{y} can be written
as yg for some g∈N[x].
For a given f∈Z[x] with a positive leading coefficient,
we consider the following binomial difference polynomial ideal in F{y}:
[TABLE]
where
sat is the difference saturation ideal to be defined in Section 2 of this paper.
Let
[TABLE]
We prove that If has a finite difference Gröbner basis
if and only if f∈Φ1.
This criterion is then extended to general normal binomial difference ideals in F{y}.
The decision of f∈Φ1 is quite nontrivial and
we give the following criteria for f∈Φ1 based on the roots of f:
if f has no positive roots, then f∈Φ1;
2. 2.
if f has more than one positive roots (with multiplicity counted), then f∈Φ1;
3. 3.
if f has one positive root x+ and a root z
such that ∣z∣>x+, then f∈Φ1;
4. 4.
if f has one positive root x+ and a root z
such that ∣z∣=x+, then we can compute another f∗∈Z[x]
and x∗∈R>0
such that f∗(x∗)=0, f∗(w)=0 and ∣w∣=x∗ imply w=x∗,
and f∗(w)=0 and ∣w∣=x∗ imply ∣w∣<x∗.
Furthermore, f∈Φ1 if and only if f∗∈Φ1;
5. 5.
if f∈/Φ0 has a unique positive real root x+ and x+<1,
then f∈Φ1;
6. 6.
if f(1)=0 and any other root z of f satisfies |z|<\,$$1, then f∈Φ1 if and only if f(x)/(x-\,$$1)\in\,$$\mathbb{Z}[x^{\delta}] for some δ∈N>0 and f(x)(x^{\delta}-1)/(x-\,$$1)\in\Phi_{0}.
With these criteria, only one case is open:
f has a unique positive real root x+, x+>1, and x+>∣z∣ for any other root z of f.
We conjecture that f∈Φ1 in the above case based on numerical computations.
If If has a finite difference Gröbner basis according to one of the six criteria listed above,
we also give an algorithm to compute it.
As far as we know the above criteria are the first non-trivial ones
for a difference polynomial ideal to have a finite difference Gröbner basis.
The novelty of these criteria lies in the fact that
complicated properties about difference polynomial ideals
are reduced to elementary properties of univariate polynomials in Z[x].
The rest of this paper is organized as follows.
In Section 2, preliminaries on Gröbner basis for difference polynomial ideals
are given.
In Section 3, criteria for normal binomial difference ideals in F{y} to have finite difference Gröbner bases are given.
In Section 4, criteria for f∈Φ1 and an algorithm to compute
the finite difference Gröbner basis of If under these criteria are given.
In Section 5, we propose an approach based on integer programming
to find g such that fg∈Φ0 and give a lower bound
for deg(g) in certain cases.
2 Preliminaries on Gröbner basis of difference polynomial ideals
2.1 Gröbner basis of a difference polynomial ideal
An ordinary difference field, or simply a σ-field, is a field
F with a third unitary operation σ satisfying: for any
a,b∈F, σ(a+b)=σ(a)+σ(b),
σ(ab)=σ(a)σ(b), and σ(a)=0 if and only if
a=0.
We call σ the difference or transforming operator of F.
A typical example of σ-field is
Q(λ) with σ(f(λ))=f(λ+1).
In this paper, we use σ- as the abbreviation for
difference or transformally.
For a in any σ-extension ring of F and n∈N>0,
σn(a) is called the n-th transform of a and denoted by axn, with the usual assumption
a0=1 and x0=1.
More generally, for p=∑i=0scixi∈N[x], denote
ap=∏i=0s(σia)ci.
For instance, a3x2+x+4=(σ2(a))3σ(a)a4.
It is easy to check that ap satisfies the properties of powers [7].
Let S be a subset of a σ-field G which
contains F. We will denote
Θ(S)={σka∣k∈N,a∈S}, F{S}=F[Θ(S)].
Now suppose Y={y1,…,yn} is a set of
σ-indeterminates over F. The elements of F{Y} are called σ-polynomials over F in Y.
A σ-polynomial idealI, or simply a
σ-ideal, in F{Y} is a
possibly infinitely generated ordinary algebraic ideal
satisfying σ(I)⊂I.
If S is a subset of F{Y}, we use (S) and [S] to denote the algebraic ideal and the
σ-ideal generated by S.
A monomial order in F{Y} is called compatible with the
σ-structure, if yixk1<yjxk2 for k1<k2.
Only compatible monomial orders are considered in this paper.
When a monomial order is given, we use LM(P) and LC(P) to denote the largest monomial
and its coefficient in P respectively, and LT(P)=LC(P)LM(P) the leading term of P.
Definition 2.1**.**
G⊂F{Y}* is called a σ-Gröbner basis of
a σ-ideal I if for any P∈I, there exist m∈N
and G∈G such that (LM(G))xm∣LM(P).*
From the definition, G is a σ-Gröbner basis of I if and only if
Θ(G) is a Gröbner basis of I treated as an algebraic polynomial ideal
in F[Θ(Y)].
Note that I is generally an infinitely generated ideal and the concept of
infinite Gröbner basis [12] is adopted here.
From this observation, we may see that a σ-Gröbner basis satisfies
most of the properties of the usual algebraic Gröbner basis.
For instance, G is a σ-Gröbner basis of a σ-ideal I if
and only if for any P∈I, we have grem(P,Θ(G))=0, where grem(P,Θ(G)) is
the normal form of P modulo Θ(G) in the theory of Gröbner basis.
The concepts of reduced σ-Gröbner bases could be similarly introduced.
A σ-polynomial Q is called σ-reduced w.r.t. another σ-polynomial P if there does not exist a k∈N such that LM(P)xk divides any monomial in Q.
Then, a σ-Gröbner G basis is called reduced, if any P∈G
is σ-reduced w.r.t G∖{P}.
It is easy to see that a σ-ideal has a unique reduced σ-Grb̈ner basis.
The following example shows that even a finitely generated σ-ideal may have an infinite σ-Gröbner basis.
As a consequence, there exist no general algorithms to compute the σ-Gröbner basis.
Example 2.2**.**
Let I=[y1y2x−y1xy2,y1y3−1]. Assume y1<y2<y3. Then under a compatible monomial order,
the reduced σ-Gröbner basis of I∩F{y1,y2} is
{y1y2xi−y1xiy2∣i∈N>0}.
2.2 Characteristic set for a difference polynomial ideal
The elimination rankingR on Θ(Y)={σkyi∣1≤i≤n,k∈N} is used in this paper:
σkyi>σlyj if and only if i>j or i=j
and k>l, which is a total order over Θ(Y). By
convention, 1<σkyj for all k∈N.
Let f be a σ-polynomial in F{Y}. The
greatest yjxk w.r.t. R which appears
effectively in f is called the leader of f, denoted by
ld(f) and correspondingly yj is called the *leading
variable *of f, denoted by lvar(f)=yj.
The leading coefficient of f as a univariate polynomial in
ld(f) is called the initial of f and is denoted by
initf.
Let p and q be two σ-polynomials in F{Y}.
q is said to be of higher rank than p if
ld(q)>ld(p) or
ld(q)=ld(p)=yjxk and deg(q,yjxk)>deg(p,yjxk).
Suppose ld(p)=yjxk. q is said to be Ritt-reduced
w.r.t. p if deg(q,yjxk+l)<deg(p,yjxk) for all
l∈N.
A finite sequence of nonzero σ-polynomials A:A1,…,Am is said to be a
difference ascending chain, or simply a σ-chain, if
m=1 and A1=0 or
m>1, Aj>Ai and Aj is Ritt-reduced w.r.t. Ai for 1≤i<j≤m.
A σ-chain A can be written as the following form [8]
[TABLE]
where lvar(Aij)=yci for j=1,…,ki, ord(Aij,yci)<ord(Ail,yci) and
deg(Aij,ld(Aij))>deg(Ail,ld(Ail))
for j<l.
The following are two σ-chains
[TABLE]
Let A:A1,A2,…,At be a σ-chain with
Ii as the initial of Ai, and P any σ-polynomial.
Then there exists an algorithm, which reduces
P w.r.t. A to a σ-polynomial R that is
Ritt-reduced w.r.t. A and satisfies the relation
[TABLE]
where the ei∈N[x] and
R=prem(P,A) is called the σ-Ritt-remainder of P w.r.t. A [8].
A σ-chain C contained in a σ-polynomial
set S is said to be a characteristic set of
S, if S does not contain any nonzero
element Ritt-reduced w.r.t. C. Any σ-polynomial
set has a characteristic set.
A characteristic set
C of a σ-ideal J reduces to zero all
elements of J.
Let A:A1,…,At be a σ-chain, Ii=init(Ai),
ylixoi=ld(Ai).
A is called regular if for any j∈N, Iixj is
invertible w.r.t A [8] in the sense that
[A1,…,Ai−1,Iixj] contains a nonzero
σ-polynomial involving no ylixoi+k,k=0,1,….
To introduce the concept of coherent σ-chain, we need to
define the Δ-polynomial first. If Ai and Aj have
distinct leading variables, we define Δ(Ai,Aj)=0. If Ai
and Aj (i<j) have the same leading variable yl,
ld(Ai)=ylxoi, and ld(Aj)=ylxoj, then
oi<oj [8].
Define
Δ(Ai,Aj)=prem((Ai)xoj−oi,Aj).
Then A is called coherent if prem(Δ(Ai,Aj),A)=0 for all i<j
[8].
Both A1 and A2 in (2) are regular and coherent σ-chains.
Let A be a σ-chain. Denote IA to be the minimal multiplicative set containing the initials
of elements of A and their transforms. The saturation ideal of A is defined to be
[TABLE]
The following result is needed in this paper.
Theorem 2.3**.**
[8, Theorem 3.3]**
A σ-chain A is a characteristic set of sat(A) if and
only if A is regular and coherent.
We also need the concept of algebraic saturation ideal.
Let C be an algebraic triangular set in F[x1,…,xn]
and I the product of the initials of the polynomials in C.
Then define
[TABLE]
2.3 σ-Gröbner basis for a binomial σ-ideal
A σ-monomial in Y can be written as Yf=∏i=1nyifi, where f=(f1,…,fn)τ∈N[x]n.
A nonzero vector f=(f1,…,fn)τ∈Z[x]n is said to be normal if the leading coefficient of fs is positive, where s
is the largest subscript such that fs=0.
For f∈Z[x]n, let f+,f−∈Nn[x] denote respectively the
positive part and the negative part of f such that f=f+−f−.
Then gcd(Yf+,Yf−)=1 for any f∈Z[x]n.
If f∈Z[x]n is normal, then Yf+>Yf− and LT(Yf+−cYf−)=Yf+ under a monomial
order compatible with the σ-structure.
A σ-binomial in Y is a σ-polynomial with
at most two terms, that is, aYa+bYb where a,b∈F
and a,b∈N[x]n.
A σ-ideal in F{Y} is called binomial if it is
generated by, possibly infinitely many, σ-binomials [7].
We have
A σ-ideal I is binomial if and only if the reduced
σ-Gröbner basis for I consists of σ-binomials.
Let \mathbbmm be the multiplicative set generated by yixj for
i=1,…,n,j∈N.
A σ-ideal I is called normal if for M∈\mathbbmm
and P∈F{Y}, MP∈I implies P∈I.
Normal σ-ideals in F{Y} are closely related with the
Z[x]-modules in Z[x]n [13, 7], which will be explained below.
We first introduce a new concept.
Definition 2.5**.**
A partial character ρ on Z[x]n is a homomorphism
from a Z[x]-module Lρ in Z[x]n to the multiplicative group
F∗ satisfying ρ(xf)=(ρ(f))x=σ(ρ(f)) for f∈Lρ.
A Z[x]-module generated by h1,…,hm∈Z[x]n is denoted as
(h1,…,hm)Z[x].
Let ρ be a partial character over Z[x]n and \mathbbmf={f1,…,fs} a reduced
Gröbner basis of the Z[x]-module Lρ=(\mathbbmf)Z[x].
For h∈Z[x]n and H⊂Lρ, denote Ph=Yh+−ρ(h)Yh−
and PH={Ph∣h∈H}.
Introduce the following notations associated with ρ:
[TABLE]
It is shown that [7] A+(ρ) is a regular and coherent σ-chain and hence is a characteristic set of sat(A+(ρ)) by Theorem 2.3. Furthermore, we have
Theorem 2.6**.**
The following conditions are equivalent.
I* is a normal binomial σ-ideal in F{Y}.*
2. 2.
I=I+(ρ)* for a partial character ρ over Z[x]n.
*
3. 3.
I=sat(A+(ρ))* for a partial character ρ over Z[x]n.*
Furthermore, for f∈Z[x]n, Yf+−cYf−∈I⇔f∈Lρ and c=ρ(f).
As a direct consequence of Proposition 2.4 and Theorem 2.6, we have
Corollary 2.7**.**
Let ρ be a partial character over Z[x]n.
Then PLρ is a σ-Gröbner basis of I+(ρ).
Note that for f∈Z[x]n, either f or −f is normal and
we need only consider the normal vectors in the σ-Gröbner basis.
So, for simplicity, we may assume that all given vectors are normal.
We have the following criterion for the σ-Gröbner basis of normal binomial σ-ideals.
Corollary 2.8**.**
Let ρ be a partial character over Z[x]n and H⊂Lρ.
Then PH is a σ-Gröbner basis of I+(ρ) if and only if
for any normal g∈Lρ, there exist h∈H and j∈N, such that
g+−xjh+∈N[x]n.
Proof:
By Corollary 2.7, PLρ is a σ-Gröbner basis of I+(ρ).
Then PH is a σ-Gröbner basis of I+(ρ) if and only if
for any normal g∈Lρ, there exist h∈H and j∈N such that
LM(xjPh)∣LM(Pg), which is equivalent to
g+−xjh+∈N[x]n.
Example 2.9**.**
Let f=[1−x,x−1], L=(f)Z[x], and ρ the trivial
partial character on L, that is, ρ(h)=1 for h∈L.
Then Pf=y1y2x−y1xy2.
By Theorem 2.6, I+(ρ)=sat(Pf).
By Corollary 2.7, a σ-Gröbner basis of I+(ρ)
is {Yg+−Yg−∣g=hf,h∈Z[x],lc(h)>0}.
By Example 2.2, sat(Pf)=[Pf,y1y3−1]∩Q{y1,y2}=[y1y2xi−y1xiy2∣i∈N>0], and
a reduced σ-Gröbner basis of I+(ρ) is
{y1y2xi−y1xiy2∣i∈N>0}.
3 Criteria for finite σ-Gröbner basis
In this section, we will give a criterion for the σ-Gröbner basis of
a normal binomial σ-ideal in F{y} to be finite,
where y is a σ-indeterminate.
Without loss of generality, we assume ρ(h)=1 for all partial characters ρ over Z[x] and h∈Lρ.
3.1 Case 1: characteristic set contains a single σ-polynomial
In this section, we consider the simplest case: n=1 and Lρ=(f)Z[x] is generated by one polynomial f∈Z[x].
We will see that even this case is highly nontrivial.
For g∈Z[x], we use lc(g), lm(g), and lt(g) to represent the
leading coefficient, leading monomial, and leading term of g, respectively.
In the rest of this section, we assume f∈Z[x] and lc(f)>0.
Then Pf=yf+−yf− and LT(Pf)=yf+ under a monomial order compatible with the σ-structure.
By Theorem 2.6, all normal binomial σ-ideals in F{y} whose
characteristic set consists of a single σ-polynomial can be written as the following form:
[TABLE]
In this section, we will give a criterion for If
to have a finite σ-Gröbner basis. Define
[TABLE]
We now give the main result of this section, which can be deduced from Lemma 3.3 and Lemma 3.7.
Theorem 3.1**.**
If* in (6) has a finite σ-Gröbner basis under a monomial order compatible
w.r.t the σ-structure if and only if f\in\,$$\Phi_{1}.*
For two polynomials h1 and h_{2}\in\,$$\mathbb{Z}[x], denote h_{1}\succeq\,$$h_{2} if h_{1}-\,$$h_{2}\in\mathbb{N}[x].
For h1 and h_{2}\in\,$$\mathbb{N}[x], we have h_{1}\succeq\,$$h_{2} if and only if yh2∣yh1.
Lemma 3.2**.**
If f∈Φ0, then {Pf} is a σ-Gröbner basis of If.
Proof:
For g∈(f)Z[x] with lc(g)>[math], \exists\,h\in\,$$\mathbb{Z}[x] with lc(h)>[math] such that g=\,$$fh. Since f∈Φ0, we have lt(f)=f+. Then,
[TABLE]
By Corollary 2.8, {Pf} is a σ-Gröbner basis of If.
Lemma 3.3**.**
If f\in\,$$\Phi_{1}, then If has a finite σ-Gröbner basis.
Proof:
Let h=fg∈Φ0, where g is monic.
Then lc(h)=lc(f) and lt(h)=lt(f)lm(g)=h+.
Ideg(h)=If⋂F[y,yx,⋯,yxdeg(h)]
is a polynomial ideal in a polynomial ring with finitely many variables, which has a finite
Gröbner basis denoted by G⩽deg(h).
Let Pu∈If and lc(u)>[math]. If \textup{deg}(u)\leqslant\,$$\textup{deg}(h),
then there exists a Pt∈G⩽deg(h) such that t⪯u.
Otherwise, we have deg(u)>deg(h) and lc(u)≥lc(f). Then
[TABLE]
Since that Ph∈Ideg(h),
by Corollary 2.8, G⩽deg(s) is a finite σ-Gröbner basis of If.
Corollary 3.4**.**
Let f\in\,$$\Phi_{1}, h=gf∈Φ0, g a monic polynomial in Z[x] , and D=deg(h).
Then the Gröbner basis of the polynomial ideal
ID=If⋂F[y,yx,⋯,yxD]
is a finite σ-Gröbner basis for If.
f=x2+x+1∈Φ1, because (x−1)f=x3−1∈Φ0.
The finite σ-Gröbner basis is G={yx2+x+1−1,yx3−y}.
Let D be R or Z. We will use the following new notation
[TABLE]
Lemma 3.6**.**
N[x]⊆Φ1.
Proof:
Let g=a_{n}x^{n}+a_{n-1}x^{n-1}+\cdots+a_{0}\in\,$$\mathbb{N}[x] with d$$=\textup{max}\{d\in\,$$\mathbb{N}\,|\,x^{d}\,|\,g\} the multiplicity of f at [math]. Then ad>[math]. Let s=\,$$(x^{n-d}+x^{n-d-1}+\cdots+1)g=a_{n}x^{2n-d}+(a_{n}+a_{n-1})x^{2n-d-1}+\cdots+(a_{n}+\cdots+a_{d})x^{n}+(a_{n-1}+\cdots+a_{d})x^{n-1}+\cdots+a_{d}x^{d}. Rewrite s=b2n−dx2n−d+⋯+bdxd.
Then s/xd∈Z>0[x].
Let M=⌈max{bi−1/bi∣d+1⩽i⩽2n−d}⌉+1.
Then (x−M)s=b2n−dx2n−d+1+(b2n−d−1−Mb2n−d)x2n−d+⋯+(bd−Mbd+1)xd+1−Mbdxd∈Φ0.
So both s and g are in Φ1.
Lemma 3.7**.**
If f∈Φ1, then If does not have a finite σ-Gröbner basis.
Proof:
Suppose otherwise, If has a finite σ-Gröbner basis G=PH,
where H={f1,⋯,fl}⊂Z[x] with each lc(fi)>0.
Since f has the lowest degree in (f)Z[x], we have f∈H.
Let Hc≜{h∈H∣lc(h)=lc(f)}.
Since f∈/Φ1, we have Hc⋂Φ1=∅.
By Lemmas 3.2 and 3.6, for all h∈Hc,
h+ has at least two terms and h− has at least one term.
For u∈Z[x] with lc(u)>0,
define a function
[TABLE]
which is the degree gap between the first two highest monomials of u+.
Suppose h1 is an element in Hc such that
deg(h1)=max{deg(h)∣h∈Hc}.
h1 exists because f∈Hc=∅ and Hc is a finite set.
Denote lt(h1)≜axn, h~1≜h1−lt(h1), lt(h~1+)≜bxm, and h~~1+≜h~1+−lt(h~1+). Then h1=axn+bxm+h~~1+−h1−.
Since h1∈Φ1, we have ab>0.
Let c≜⌈b/a⌉≥1 and
[TABLE]
We have s+⪯s0≜ax2n+xnh~~1++cxmh1−,
and deg(s)=deg(s)−deg(s+−lt(s))≥deg(s0)=deg(s0)−deg(s0+−lt(s0))>n−m=deg(h1)=deg(h1)−deg(h1+−lt(h1)).
Since PH is a σ-Gröbner basis of If,
there exist h∈H and j∈N
such that t=s+−xjh+∈N[x].
We claim lt(t)=lt(s+).
If h∈Hc, then deg(s)>deg(h).
Note that deg(s+)=deg(xjh) implies that
the coefficient of the second largest monomial of s+−xjh is negative
contradicting to the fact s+−xjh∈N[x].
As a consequence, we must have deg(s+)>deg(xjh) and the claim is proved in this case.
Now let h∈H\Hc.
Since lc(h)>lc(s)=lc(f),
we have deg(xjh)<deg(s) which
implies lt(t)=lt(s+). The claim is proved.
The fact lt(t)=lt(s+) implies that when computing the
normal form Pu=grem(Ps,Θ(PH)), we always have
lt(u)=lt(s). As a consequence, Pu=0 which contradicts to
the fact that PH is a σ-Gröbner basis of
If and s∈(f)Z[x].
Note that the proof of Lemma 3.7 gives a method to
construct infinitely many elements in a σ-Grb̈ner basis as
shown in the following example.
Example 3.8**.**
Let f=x2−2x+1∈/Φ1.
In the proof of Lemma 3.7, c=⌈b/a⌉=1 and
s1=(x2−1)f=x4+2x−2x3−1.
Repeat the above procedure to s1, we obtain
s2=(x4−2x)s1=x8+3x4+2x−2x7−4x2.
Then deg(f)<deg(s1)<deg(s2)
and Psi is in a σ-Gröbner basis for all i.
Thus any σ-Gröbner basis of If is infinite.
We can show that a minimal σ-Gröbner basis is G={yx2i+1−y2xi∣i∈Z>0}⋃{yx2i+1+1−yxi+1+xi∣i∈Z>0}.
3.2 Finite σ-Gröbner bases for normal binomial σ-ideals
In this section, we consider the general normal binomial σ-ideals in F{y}.
By Theorem 2.6, all normal binomial σ-ideals in F{y}
can be written as the following form:
[TABLE]
where
[TABLE]
is a reduced Gröbner basis of the Z[x]-module L=(G)Z[x].
Gröbner bases in Z[x] have the following special structure [7].
Lemma 3.9**.**
Let G={g1,…,gk} be a reduced Gröbner basis of a
Z[x]-module in Z[x], g1<⋯<gk, and
lt(gi)=cixdi∈N[x]. Then
1) 0≤d1<d2<⋯<dk.
2) ck∣⋯∣c2∣c1 and ci=ci+1 for 1≤i≤k−1.
3) ckci∣gi for 1≤i<k. If
b1 is the primitive part of g1, then
b1∣gi for 1<i≤k.
Here are two Gröbner bases in Z[x]: {4,2x}, {15,5x,x2+3}.
In the rest of this section, let L=(G)Z[x] for
G defined in (11) and define
[TABLE]
Theorem 3.10**.**
IG* has a finite σ-Gröbner basis if and only if Li⋂Φ0=∅.*
Proof:
Suppose Li⋂Φ0=∅ and let g∈Li⋂Φ0. Then IG⋂k[y,yx,⋯,yxdeg(g)] has a finite
Gröbner basis denoted by G≤deg(g).
Let Pu∈IG and lc(u)>0. If deg(u)≤deg(g), then there exists a Ph∈G≤deg(g) such that h⪯u.
Otherwise, we have deg(u)>deg(g) and lc(u)≥lc(g). Then
[TABLE]
By Corollary 2.8, G≤deg(g) is a finite σ-Gröbner basis of IG, since Pg is in G≤deg(g).
We will prove the other direction by contradiction.
Suppose that Li∩Φ0=∅ and
IG has a finite σ-Gröbner basis PH={Pu1,⋯,Puk}.
Let H={u1,⋯,uk}, and Hc=H⋂Li.
Since grem(Pgt,Θ(PH))=0, we have
Hc=∅ and let u1 be an element of Hc with maximal deg
which is defined in (9).
Since Li∩Φ0=∅, by Lemma 3.6u1+ contains at least two terms
and u1−=0.
Similar to the proof of Lemma 3.7, we can construct
an s∈Z[x]∩L such that deg(s)>deg(u1)
and lc(s)=lc(u1). Then, grem(Ps,Θ(PH))=0 contradicting
to the fact that PH is a σ-Gröbner basis.
Corollary 3.11**.**
If IG has a finite σ-Gröbner basis,
then g1∈Φ1.
Proof:
Let b1 be the primitive part of g1.
Then by Lemma 3.9, b1∣h for any h∈L.
By Theorem 3.10, b1 and hence g1 is in Φ1.
Corollary 3.12**.**
If Lt⋂Φ1=∅ and in particular gt∈Φ1, then IG has finite σ-Gröbner Basis.
The following example shows that gt∈Φ1 is not a necessary condition for the σ-Gröbner basis to be finite.
Example 3.13**.**
Let G={2(x2−2),(x2−2)(x+1)}.
Then (x2−2)(x+1)(x−1)+2(x2−2)=x4−x2−2∈Φ0⊂Φ1,
and hence IG has a finite σ-Gröbner basis. On the other hand, we
will show (x2−2)(x+1)∈/Φ1 in Example 4.10.
In order to give another criterion, we need the following effective Polya Theorem.
Suppose that f(x)=j=0∑nanxn∈R[x] is positive on [0,∞)
and F(x,y) the homogenization of f.
Then for Nf>2λn(n−1)L−n, (1+x)Nff(x)∈R>0[x],
where λ=min{F(x,1−x)∣x∈[0,1]}
and L=max{n!k!(n−k)!∣ak∣}.
Corollary 3.15**.**
If there exists an h∈L with no positive real roots, then IG has a finite σ-Gröbner basis.
Proof:
Write h=xm1h1 such that h1(0)=0.
By Lemma 3.14, there exists an N∈N such that h2=(x+1)Nh∈Z>0[x].
Take a sufficiently large N such that deg(h2)>dt=deg(gt).
Then there exists a sufficiently large M∈N, such that
g=xm1(xdeg(h2)−deg(gt)+1gt−Mh2)∈Φ0.
Since g∈Li, by Lemma 3.10, I has a finite σ-Gröbner Basis.
4 Membership decision for Φ1 and σ-Gröbner basis computation
In Section 3, we prove that sat(Pf) has a finite σ-Gröbner basis
if and only if f∈Φ1. In this section, we will give criteria and an algorithm for f∈Φ1.
If f∈Φ1, we also give an algorithm to compute the finite σ-Gröbner basis.
From the definition of Φ1, a necessarily condition for f∈Φ1 is lc(f)>0.
Also, it is easy to show that f∈Φ1 if and only if cxmf∈Φ1 for positive integers
c and m.
So in the rest of this paper, we assume
[TABLE]
such that n>0, lc(f)=an>0, f(0)=a0=0, and gcd(a0,a1,…,an)=1.
4.1 Decision criteria
In this subsection, we will study whether f∈Φ1 by examining properties of the roots of f(x)=0.
Lemma 4.1**.**
If f∈Z[x] has no positive real roots, then f∈Φ1.
Proof:
By Lemma 3.14, there exists an N\in\,$${\mathbb{N}}, such that (x+1)^{N}f\in\,$${\mathbb{Z}}^{>0}[x]\subseteq{\mathbb{N}}[x]. By Lemma 3.6, (x+1)^{N}f\in\,$${\mathbb{N}}[x]\subseteq\Phi_{1}, and thus f∈Φ1.
By Lemma 4.1, we need only consider those polynomials which have positive roots.
Lemma 4.2**.**
Let f=anxn+⋯+a0∈Φ0. Then
f has a simple and unique positive real root x+,
and for any root z of f, we have ∣z∣≤x+.
Proof:
Since f∈Φ0∖Z, the number of sign differences of f is one.
Then by Descartes’ rule of signs[1], the number of positive real roots of f
(with multiplicities counted) is one or less than one by an even number.
Then f has a simple and unique positive real root x+.
For any root z of f, since −ai≥0 for i=0,…,n−1, we have
[TABLE]
Thus f(∣z∣)⩽[math] and hence f has at least one real root in [∣z∣,∞).
Since f has a unique positive real root x+, we have |z|\leqslant\,$$x_{+}.
We now consider those f which has a root z=x+ and ∣z∣=x+.
Such a z must be either −x+ or a complex root.
Lemma 4.3**.**
Let f=anxn+⋯+a0∈Φ0 and x+ the unique positive root of f.
If f has a root z=x+ but ∣z∣=x+, then we have
zδf∈R>0* and z is a simple root of f,
where δf=gcd{i∣ai=0}>1.*
2. 2.
f* is a polynomial in xδf: f=f∘xδf, where ∘ is the function composition.
Furthermore, f(w)=0 and ∣w∣=x+δf imply w=x+δf.*
3. 3.
f* has exactly δf roots with absolute value x+: {z∣f(z)=0,∣z∣=x+}={ζkx+∣ζ=eδf2πi,k=1,…,δf}, where i=−1.*
Proof:
Let z=x+ be a root of f such that ∣z∣=x+.
Then f(∣z∣)=f(x+)=an∣z∣n+an−1∣z∣n−1+⋯+a0=0,
which, combining with (14), implies
∣−an−1zn−1−⋯−a0∣=−an−1∣z∣n−1−⋯−a0.
The above equation is possible if and only if
−aizi∈R>0 for each i≤n−1 and ai=0.
Also note, zn=(−an−1∣z∣n−1−⋯−a0)/an∈R>0.
Then, zi∈R>0 for each i≤n and ai=0.
Note that zm∈R>0 and zk∈R>0 imply zm−k∈R>0.
As a consequence, zδf∈R>0 for δf=gcd{i∣ai=0}.
Since z=x+, we have δf>1.
Part 1 of the lemma is proved.
From the definition of δf, f is a polynomial of xδf:
f(x)=f(x)∘(xδf).
It is easy to see that f(x)∈Φ0.
Let f(x)=bkxk+⋯+b1x+b0.
Then gcd{j∣bj=0}=1.
By the first part of this lemma, we know x+δf is the only root of f whose absolute value
is x+δf.
Since zδf and x+δf are both the unique positive real roots of f(x),
we have z^{\delta_{f}}=\,$$x_{+}^{\delta_{f}} and hence z is a simple root of f.
Part 2 of the lemma is proved.
Part 3 of the lemma comes from the fact z^{\delta_{f}}=\,$$x_{+}^{\delta_{f}} is
the unique positive real root of f
and f(z)=f(zδf)=0.
Corollary 4.4**.**
If f\in\,$$\Phi_{1} has at least one positive real root x+, then x+ is the unique positive real root of f, x+ is simple and for any root z of f, x_{+}\geqslant\,$$|z|.
If f has a root z=x+ satisfying |z|=\,$$x_{+}, then
z is simple, and
zδ∈R>0 for some δ∈N>1, or equivalently,
the argument of z satisfies \textup{Arg}(z)/\pi\in\,$$\mathbb{Q}.
Example 4.5**.**
f=(x2−5)(x2−2x+5)∈/Φ1, because the root
z=1+2i satisfies ∣z∣=5 but zδ∈/R>0
for any δ∈N.
The following example shows that the multiplicity for
a root z satisfying ∣z∣<x+ could be any number.
Example 4.6**.**
For any n,k∈N>1, (x+1)n(x−k)∈Φ1.
Let n=1, (x+1)(x−k)∈Φ0.
Let f1(x)=(x+1)2 and fn+1(x)=fn(x)(x2⌊deg(fn)/2⌋+1+1)
for n>1. Then we have (x+1)n+1∣fn(x), fn(x)∈Z>0[x], and all coefficients of fn are either 1 or 2. Thus, fn(x)(x−k)∈Φ0 and (x+1)n(x−k)∈Φ1 by definition.
Lemma 4.7**.**
Let q(x)∈Z[x] be a primitive irreducible polynomial and δ∈N>1. Then (q)Z[x]⋂Z[xδ]=(q(xδ))Z[xδ],
where q∈Z[x] is primitive and irreducible and
q(xδ)m=Ru(uδ−xδ,q(u)) for some m∈N.
We use Ru to denote the Sylvester resultant w.r.t. the variable u.
Furthermore, the roots of q(x) are {zδ∣q(z)=0}.
Proof:
Let q(x)=a∏j=1n(x−zj), ζδ=e2πi/δ, and
[TABLE]
We claim that R(xδ) is primitive.
We have lc(Ru(uδ−xδ,q(u)))=lc(∏l=1δq(ζδlx))=aδ.
Let c∈Z be a prime factor of aδ or a.
Since q is primitive, q=0(modc). Let q(x)=bxm+⋯(modc).
Then lt(R(xδ))=lt(∏l=1δq(ζδlx))=∏l=1δb(ζδlx)m=bδxδm=0(modc). So c∤R(xδ) and thus R(xδ) is primitive.
Since Q[xδ] is a PID and R(xδ)∈(q)Q[x]⋂Q[xδ],
there exists a primitive polynomial q∈Z[x]
such that (q(xδ))Q[xδ]=(q)Q[x]⋂Q[xδ].
Since q(x)∣q(xδ) and q is irreducible,
q(x) must be irreducible.
Since both q(x) and q(x) are primitive,
we can deduce
(q(xδ))Z[xδ]=(q)Z[x]⋂Z[xδ]
from (q(xδ))Q[xδ]=(q)Q[x]⋂Q[xδ].
Since q(x)∣q(xδ), Zδ={ζδkzj∣k=1,…,δ,j=1,…,n} is a subset of the roots of q(xδ).
Let S(x) be the square-free part of R(x)∈Z[x],
which is also primitive.
Since Zδ contains exactly the roots of R(xδ) and S(xδ),
we have S(x)∣q(x).
Since q(x) is irreducible and S(x) is the square-free part of R(x), we have S(x)=q(x)
and hence R(xδ)=q(xδ)m for some m∈N[x].
Finally, since the roots of q(xδ) are Zδ,
the roots of q(x) are {zδ∣q(z)=0}.
Corollary 4.8**.**
Let δ∈N and f=∏j=1mqjαj, where
∈N and qj are primitive irreducible polynomials in Z[x] with positive leading coefficients.
Let qi∗(xδ) be the square-free part of Ru(uδ−xδ,qi(u)) and f∗≜lcm({qj∗αj∣j}).
Then
[TABLE]
Furthermore, the roots of f∗(x) are {zδ∣f(z)=0}.
Proof:
By Lemma 4.7, we have (qi)Z[x]⋂Z[xδ]=(qi∗(xδ))Z[xδ].
Then
(f)Z[x]⋂Z[xδ]=i=0⋂s((qiαi)Z[x]⋂Z[xδ]=i=0⋂s(qi∗αi)Z[xδ]=(lcm({qi∗αi∣i}))Z[xδ]=(f∗(xδ))Z[xδ].
From f∗≜lcm({qj∗αj∣j}) and Lemma 4.7,
the roots of f∗(x) are {zδ∣f(z)=0}.
Theorem 4.9**.**
Let f\in\,$$\mathbb{Z}[x] have a unique positive root x+
and any root w of f satisfies ∣w∣≤x+.
If there exists a minimal δ∈N>1 such that
for all root z=x+ of f, ∣z∣=x+ implies zδ∈R>0.
Let f∗(xδ)∈Z[xδ] be the polynomial in (15).
Then f∈Φ1 if and only if lc(f)=lc(f∗) and
f∗∈Φ1.
Proof: “⇐"
Since lc(f)=lc(f∗) and (f)∩Z[xδ]=(f∗(xδ)),
there exists a monic polynomial h∈Z[x] such that f∗(xδ)=fh.
Since f∗∈Φ1, there exists a monic polynomial g∈Z[x] such that f∗(x)g(x)∈Φ0.
Then f∗(xδ)g(xδ)=fhg(xδ)∈Φ0.
Since hg(xδ) is monic, we have f∈Φ1.
“⇒"
Since f\in\,$$\Phi_{1}, there exists a primitive polynomial h\in\,$$(f)\bigcap\Phi_{0} with h(0)=[math] and \textup{lc}(h)=\,$$\textup{lc}(f).
Each such h has some roots whose absolute value is x+.
Since f∣h, by part 3 of Lemma 4.3 we have δ∣δh,
where δh=gcd{k∣xk is in h}.
By Lemma 4.3, h\in\,$$\mathbb{Z}[x^{\delta_{h}}]\subset{\mathbb{Z}}[x^{\delta}]. Thus h∈(f)⋂Z[xδ]=(f∗)Z[xδ].
Since lc(f)∣lc(f∗)∣lc(h) and lc(f)=lc(h),
we have lc(f)=lc(f∗)=lc(h), so f∗∈Φ1.
Example 4.10**.**
Let f=(x2−2)(x+1). Then δ=2 and f∗=(x−2)(x−1) has two positive roots and
hence f∈Φ1 by Corollary 4.4 and Theorem 4.9.
*Let f1=x2−2, f2=x2−2x+2, and f=f1f2.
Then δ=8, f1∗=x−16, f2∗=x−16, and f∗=x−16.
Hence f∈Φ1.
*
Corollary 4.11**.**
Let f∗(x) be the polynomial defined in Theorem 4.9.
Then f∗(x) has only one root (may be a multiple root) whose absolute value is x+δ
and any root z=x+δ of f∗ satisfies ∣z∣<x+δ.
Proof:
By Corollary 4.8, the roots of f∗(x) are {zδ∣f(z)=0}.
Then the corollary comes from the fact that x+ is the unique positive real root of f
and f(z)=0,∣z∣=x+ imply zδ∈R>0.
By Corollary 4.11, when f has a unique positive real root x+,
we reduce the decision of f∈Φ1 into the decision of f∗∈Φ1,
where f∗ has only one root with absolute value x+δ.
Lemma 4.12**.**
If f\in\,$$\Phi_{1}\setminus\Phi_{0} has a unique positive real root x+, then x_{+}\geqslant\,$$1.
Proof:
There exists a monic polynomial g\in\,$$\mathbb{Z}[x] such that fg\in\,$$\Phi_{0}. Since f\notin\,$$\Phi_{0}, g is not a monomial. Without loss of generality we assume g(0)=0, and then ∏g(z)=0∣z∣=∣g(0)/lc(g)∣=∣g(0)∣≥1
which implies \max_{g(z)=0}(|z|)\geqslant\,$$1.
Since x+ is the unique positive root of fg, by Lemma 4.2, we have x_{+}\geqslant\,$$\max_{g(z)=0}(|z|)\geqslant\,$$1.
The following two lemmas give simple criteria to check whether f∈Φ1
in the case of f(1)=0.
Lemma 4.13**.**
Let f\in\,$$\mathbb{Z}[x] be a primitive polynomial, f(1)=[math].
If δ∈N is the smallest number such that all root z of f satisfies zδ=1,
then f\in\,$$\Phi_{1} if and only if f^{*}(x)=\,$$x-1, where f∗ is defined in (15).
Proof:
By Theorem 4.9, if f^{*}(x)=\,$$x-1 then f∈Φ1.
Suppose f∈Φ1. By Lemma 4.3, any root of f is simple and hence f is square-free.
Let δ=lcm{m∈N∣zm=1}. Since f is primitive,
δ∈N is the smallest number such that f(x)∣xδ−1 in Z[x].
Therefore, so f∗(x)=x−1.
Example 4.14**.**
Let f=(x−1)(x2+1)(x3+1). Then δ=12 and f∗=x−1. So, f∈Φ1.
Let f=(x−1)(x2+1)2(x3+1). Then δ=12 and f∗=(x−1)2. So, f∈/Φ1.
Lemma 4.15**.**
If f(1)=0 and any other root z of f satisfies |z|<\,$$1, then f\in\,$$\Phi_{1} if and only if f(x)/(x-\,$$1)\in\,$$\mathbb{Z}[x^{\delta}] for some δ∈N>0 and f(x)(x^{\delta}-\,$$1)/(x-\,$$1)\in\Phi_{0}.
Proof:
The necessity is obvious.
For the other direction, there exists a monic polynomial g\in\,$$\mathbb{Z}[x] such that fg\in\,$$\Phi_{0}. We claim that each root z of g has absolute value 1.
Since g is monic, ∏g(z)=0∣z∣≥1.
Since fg∈Φ0 and f(1)=0, maxg(z)=0∣z∣≤1,
and the claim is proved.
By Lemma 4.2, fg∈Z[xδ], where δ=δfg.
Since f(1)=0 and all other roots of f have absolute value <1, we have (xδ−1)∣fg and ((xδ−1)/(x−1))∣g.
By part 3 of Lemma 4.3, the roots of fg with absolute value
1 are exactly the roots of xδ−1.
Since the absolute values of all roots of g is 1 and g has no multiple roots by Lemma 4.3, g=(xδ−1)/(x−1).
Since fg∈Z[xδ] and (xδ−1)∣fg, set fg=(xδ−1)h(xδ) for h∈Z[x].
From g=(xδ−1)/(x−1), we have f/(x−1)=h(xδ)∈Z[xδ].
Now, only when f∈/Φ0, f has a unique positive real root x+>1, and any other root of f has absolute value <x+, we
do not know how to decide f∈Φ1.
By computing many examples, we propose the following conjecture.
Conjecture 4.16**.**
If f\in\,$$\mathbb{Z}[x]\backslash\Phi_{0} has a simple and unique positive real root x+, x+>1, and x+>∣z∣ for any other root z of f, then f∈Φ1.
4.2 Algorithm for f∈Φ1
Based on the results proved in the preceding section, we give the following algorithm to decide
whether f∈Φ1. Note that the last step of the algorithm depends on whether
Conjecture 4.16 is true.
In what below, we will give the details for Algorithm 1
and prove its correctness.
We will use algorithms for real root isolation
and complex root isolation for univariate polynomials.
Please refer to the latest work on these topics and
references in these papers [19, 2].
Step 1 is trivial to check.
Step 2 can be done with any real root isolation algorithm.
Step 3 can be done by first factoring f as the product of irreducible polynomials
and then isolating the real roots of each factor of f.
Step 4.1 is trivial to check.
For Step 4.2, there exists a δ∈N such that (z)δ=1
if and only if each irreducible factor of f(x) is a cyclotomic polynomial,
which can be checked with the Graeffe method in [3]
and the δ can also be founded.
The polynomial f∗ in Step 4.2 can be computed with Corollary 4.8.
In Step 4.3, the δ can be found from the fact f(x)/(x−1)∈Z[xδ].
If f(x)(xδ−1)/(x−1)∈Φ0 for some δ satisfying f(x)/(x−1)∈Z[xδ], then return true; otherwise return false.
In Steps 4.4, 4.5, and 4.6, we need to check whether
f has a root z=x+ such that ∣z∣>x+,
∣z∣=x+, and zm∈R>0 for some m∈N.
To do that, we first give a lemma.
Lemma 4.17**.**
Let p(x)=a∏i=1n(x−xi)∈Z[x], q(x)=b∏j=1m(x−yj)∈Z[x],
and xiyj=0 for all i,j.
Then the roots of Ru(p(u),q(ux)) are {yj/xi∣i=1,⋯,n,j=1,⋯,m}
and the roots of Ru(unp(x/u),q(u)) are {xiyj∣i=1,⋯,n,j=1,⋯,m}.
Proof:
The lemma comes from
Ru(p(u),q(ux))=ambn∏i,j(x−xj/yi)
and
Ru(unp(x/u),q(u))=a0mbn∏i,j(x−xixj),
where a0=p(0).
In the rest of this section, we assume
[TABLE]
where fi are primitive and irreducible polynomials with positive leading coefficients.
Also assume that f(x) has a unique positive root x+ which is the root of f0(x).
By Lemma 4.17, the real roots of all ri(x) include x+2
and zz, where z is a complex root of ri(x).
Then the condition in Step 4.4 of the algorithm can be checked with the following result
based on real root isolation.
Corollary 4.18**.**
f* has a root z such that ∣z∣>x+ if and only if
some ri(x) has a positive root larger than x+2.*
It is easy to check whether −x+ is a root of fi:
since fi is irreducible, −x+ is a root of fi
if and only if fi(−x)=±fi(x).
If z is complex root of fi such that ∣z∣=x+,
then x+2,x+2=z.z,x+2=z.z are all roots of ri.
Then, we have the following result.
Corollary 4.19**.**
Let mi be the multiplicity of x+2 as a root of ri
and ni the multiplicity of −x+ as a root of fi
(the multiplicity is set to be zero if x+2 or −x+ is not a root).
Then #{z∣f0(z)=0,∣z∣=x+,z∈/R}=m0−n0−1 and
#{z∣fi(z)=0,∣z∣=x+,z∈/R}=mi−ni for i>0.
As usual, a representation of a complex root z is a pair
(p,B) where p is an irreducible polynomial and
B a box such that p(z)=0 and z is the only root of p in B.
A box is represented by its lower-left and upper-right
vertexes: ([xl,yl],[xt,xt]).
By the following lemma, we can find representations for
all roots z of f satisfying ∣z∣=x+.
Lemma 4.20**.**
Suppose fi has s roots z1,…,zs satisfying ∣zj∣=x+.
Then, we can find representations for zj.
Proof:
Since fi is irreducible, fi is the minimal polynomial for zi.
Suppose I=(a,b) is an isolation interval for x+.
By algorithms of complex root isolation and real root isolation,
we can simultaneously refine I and the
isolation boxes of the roots of fi
such that the number of isolation boxes
meet the region a<∣x∣<b will eventually becomes s.
These s boxes are the isolation boxes for z1,…,zs,
since fi has exactly s roots satisfying ∣z∣=x+.
Lemma 4.21**.**
Let z be a root of fk satisfying ∣z∣=x+.
Then, we can find a representation for z/x+.
Proof:
Let H(x)=Ru(f0(u),fk(ux))∈Z[x] and hi(x),i=1,…,s the irreducible factors of H.
From Lemma 4.17, H(z/x+)=0 and hc(z/x+)=0 for certain c
and we will show how to find hc.
Isolate the roots of hi,i=1,…,s and refine the isolation box B=([xl,yl],[xt,xt]) of z
and the isolation interval of x+=(l,r) simultaneously such that
([xl/r,yl/r],[xt/l,xt/l]) intersects only one of the isolation boxes of hi,i=1,…,s.
This box B1 should be the isolation box for z/x+.
If B1 contains a root of fc, then fc is the minimal polynomial for z/x+.
With the following lemma, we can check whether zm∈R>0 for some m.
Lemma 4.22**.**
Let z be a root of fk satisfying ∣z∣=x+
and q the minimal polynomial for z/x+.
Then we can decide whether there exists an m∈N such that (z/x+)m=1,
and if such an m exists, we can compute the minimal m.
Proof:
There exists an m∈N such that (z/x+)m=1
if and only if q(x) is a cyclotomic polynomial,
which we can be tested by the Graeffe method in [3].
The method also gives the m such that (z/x+)m=1.
The minimal m can be found easily.
Now, we consider Step 4.5. With Corollary 4.19 and Lemma 4.20,
we can find all the roots
z of f satisfying ∣z∣=x+. For each such z,
we can check whether there exists a δz∈N
such that (z/x+)δz=1 with Lemma LABEL:lm-cyc.
Hence the conditions of Step 4.5 can be checked.
Now, we consider Step 4.6.
The δ in Step 4.6 can be computed as δ=lcm{δz∣f(z)=0,∣z∣=x+,(z/x+)δz=1}.
With δ given, f∗ in Step 4.6 can be computed with Corollary 4.8.
From Corollary 4.8, the roots of f∗ are {zδ∣f(z)=0}.
As a consequence, when running Membership\Phi_{1}$$(f^{*}),
only Steps 1, 3, 4.7 will be executed,
and no further calls to Membership\Phi_{1}$$(f^{*}) are needed.
4.3 Compute the finite σ-Gröbner basis
Let f∈Φ1, we will show how to compute the finite σ-Gröbner basis
for If=sat(Pf) in (6).
Lemma 4.23**.**
Let f∈Φ1, h=fg∈Φ0 for a monic g∈Z[x],
and D=deg(h). Then
[TABLE]
and a Gröbner basis of ID is a σ-Gröbner basis of If.
Proof:
By the remark before Theorem 2.6, Pf is regular and coherent.
Then P∈ID if and only if prem(P,Pf)=0
which is equivalent to P∈asat(Pf,Pxf,…,PxD−deg(f)f)
[7], and (18) is proved.
By Corollary 3.4, a Gröbner basis of ID is a σ-Gröbner basis of If.
The Gröbner basis of ID, denoted as G(f,D),
can be computed with the following well-known fact
[TABLE]
where J=init(Pf) and z is a new indeterminate.
Therefore, in order to compute the σ-Gröbner basis of If,
it suffices to compute D. We thus have the following algorithm.
In the rest of this section, we will prove the correctness of the algorithm.
Step 1 follows Lemma 3.2.
For Step 2, by Lemma 3.14,
(x+1)Nff∈Z>0[x].
Following the proof of Lemma 3.6,
for a sufficiently large M∈N,
(x−M)(x+1)Nff∈Φ0.
Then, D=deg((x−M)(x+1)Nff)=Nf+deg(f)+1.
For Step 3.1, following Step 4.2 of Algorithm 1,
we have f∗(xδ)=f(x)g(x)=xδ−1 for some g.
Then D=δ.
For Step 3.2, following Step 4.3 of Algorithm 1,
f(x)(xδ−1)/(x−1)∈Φ0.
Then D=deg(f)+δ−1.
For Step 3.3, from the proof of Step 4.6 of Algorithm 1,
there exist three possibilities:
f∗(x)∈Φ0, f∗(x) has at least two positive roots,
or f∗ satisfies the conditions of Conjecture 4.16.
Since we already assumed f∗∈Φ1, only f∗(x)∈Φ0 is possible.
From f∗(xδ)=f(x)s(x), we have D=δdeg(f).
We now proved the correctness of Algorithm 2.
5 Approach based on integer programming and lower bound
Given an f∈Z[x], the existence of a monic polynomial
g∈Z[x] with deg(g)≤m, such that fg∈Φ0
can be reduced to an integer programming problem.
Based on this idea, a lower bound for deg(g) is given in certain cases.
Lemma 5.1**.**
Given a polynomial f(x)=anxn+⋯+a0∈Z[x] with an>0, there exists a monic polynomial g∈Z[x] with deg(g)≤m, such that fg∈Φ0 if and only if a (bm−1,⋯,b0)∈Zm satisfies
[TABLE]
Moreover such g has degree <m if and only if b0=0 for some feasible solution of the above inequalities.
Proof:
Let g(x)=xm+bm−1xm−1+⋯+b0. The leading coefficient of fg is an>0, and the coefficient of xk is the m+n−k-row of the left side of (32) for k=m+n−1,…,0.
If deg(g)<m, the coefficients of g1(x)=xm−deg(g)g(x) is a feasible solution with b0=0. If b0=0, (1,bm−1,⋯,b1) is a feasible solution of (32) for m=m−1.
The following result gives another criterion for the existence of g.
Lemma 5.2**.**
Given a polynomial f(x)=anxn+⋯+a0∈Z[x] with an>0, let (1/f)(x)≜λ0+⋯+λmxm+⋯∈Z[a0−1][[x]].
There exists a monic polynomial g∈Z[x] with deg(g)≤m and fg∈Φ0 if and only if there exists a (cm+n−1,⋯,c0)∈Nm+n such that
[TABLE]
Proof:
Extending the proof of Lemma 5.1, let bm+n−1≜(bm+n−1,⋯,b0)T. For the following special Jordan form
[TABLE]
By Lemma 5.1, fg∈Φ0 if and only if f(Jm+n)b∈Z≤0m+n for some (bm−1,⋯,b0)∈Zm with (bm+n−1,⋯,bm)=(0,⋯,0,1). Let c=(cm+n−1,⋯,c0)T≜−f(Jm+n)b∈Nm+n. Then we have f(Jm+n)−1c=(1/f)(Jm+n)c=−b, that is
[TABLE]
Since we need only to know the existence of ci, only the first n rows
are need, and the lemma is proved.
Note that a0i+1λi∈Z for any i∈N.
We can reduce the coefficient matrix in the above lemma into
an integer matrix.
Corollary 5.3**.**
Let f,g∈R[x], lc(f)>0, g monic, and
(1/f)(x)≜∑m=0∞λmxm∈R[[x]].
If lt(fg)=(fg)+, then deg(g)≥min{j∈N∣λj<0}.
Proof:
From the proof of Lemma 5.2,
there exists a monic g∈R[x] such that lt(fg)=(fg)+
if and only if (33) has a solution (cm+n−1,⋯,c0)∈R>0m+n.
If λ0,…,λm≥0, the last coordinate of (33) is ∑j=0mλjcm−j≥0, hence ∑j=0mλjcm−j=−1
and (33) has no solution in R>0m+n.
As a consequence, if lt(fg)=(fg)+, then deg(g)≥min{j∈N∣λj<0}
and the corollary is proved.
Corollary 5.4**.**
Let f(x)=ax2+bx+c∈R[x], a>0, b2−4ac<0, and z a root of f.
If fg∈Φ0 and g is monic, then deg(g)≥⌊π/∣Arg(z)∣⌋=⌊π/arctan(4ac−b2/b)⌋.
Proof:
Let f(x)=a(x−z)(x−zˉ), and z=reθi where r∈R>0
and θ=Arg(z)=kπ.
Without loss of generality, we can assume 0<θ<π. Then
[TABLE]
that is, λj=arj+2sinθsin((j+1)θ).
Since λ0=ar21>0,
min{j∈N∣λj<0}=min{j∈N∣(j+1)θ>π}=⌊π/θ−1⌋+1=⌊π/θ⌋.
By Corollary 5.3, deg(g)≥⌊π/θ⌋=⌊π/arctan(4ac−b2/b)⌋.
We can now give a lower bound for the degree of g such that fg∈Φ0
in certain case.
Theorem 5.5**.**
*If a polynomial f(x)∈Z[x] is of degree n and has at least one root
not in R, then
min{deg(g)∣g∈Z[x]is monic and fg∈Φ0}≥max{⌊π/∣Arg(z)∣⌋−n+2∣f(z)=0,z∈/R}.
*
Proof:
Since f(x)∈Z[x] has at least one root not in R, f=f1f2 where f2
is a quadratic polynomial in R[x] which has two complex roots.
Suppose there exists a monic g∈R[x] such that
lt(fg)=(fg)+ or lt(f1f2g)=(f1f2g)+.
By Corollary 5.4, deg(g)≥⌊π/∣Arg(z)∣⌋−deg(f1)=⌊π/∣Arg(z)∣⌋−n+2.
Then, min{deg(g)∣g∈Z[x]is monic and fg∈Φ0}≥min{deg(g)∣g∈R[x]is monic and lt(fg)=(fg)+}≥max{⌊π/∣Arg(z)∣⌋−n+2∣f(z)=0,z∈/R}.
The following result shows that the lower bound given in the preceding theorem
is also the upper bound for quadratic polynomials.
Proposition 5.6**.**
Let f(x)=a2x2+a1x+a0=a2(x−z)(x−zˉ) be a quadratic polynomial in Z[x]
with a root complex z=a+bi=reθi, where a2,b,r>0, 0<θ<π, zˉ=a−bi. Then min{deg(g)∣g∈Z[x] and monic,fg∈Φ0}=⌊π/θ⌋.
Proof:
If π/2<θ<π, then a1=−2a>0 and hence f∈N>0[x].
By the proof of Lemma 3.6, there exists an N such that (x−N)f∈Φ0
and hence deg(f)=1=⌊π/θ⌋.
If θ=π/2, then f=a2x2+a0. It is easy to check deg(f)=2=⌊π/θ⌋.
From now on, we assume 0<θ<π/2, so a>0 and a1<0.
Considering f1(x)=(x−a−bi)(x−a+bi)=x2−2ax+a2+b2∈Z[a2−1][x], we will solve the integer programming mentioned in Lemma 5.1:
[TABLE]
Let Δ1=−2a and r Δj+1=−2a−(a2+b2)/Δj for j>1.
Then
[TABLE]
Let m0=⌈π/θ⌉−1. Then we have Δj<0 for j=1,⋯,m0−1 but Δm0≥0.
We will do row transformations on (45) to relax its feasible region.
Let m=m0−1. We add (m+1)-th row multiplied by 1/(−Δ1)>0 to the m-th row. Then the −2a at the m-th row becomes Δ2=−2a−(a2+b2)/Δ1, and the 1 at the m-th row becomes [math].
Then add m-th row multiplied by 1/(−Δ2)>0 to the (m−1)-th row.
Repeat the above process until Δm0≥0, and we obtain a lower triangular matrix:
[TABLE]
If Δm0>0, the first coordinate of the left side of
[TABLE]
is Δm0>0. So the feasible region of (63) is empty and hence the feasible region of (45) is also empty.
Thus fg∈/Φ0 for any monic polynomial g of degree <m0 by Lemma 5.1.
Let m=m0. We have
[TABLE]
Similarly, we can obtain a quasi-upper trangular matrix from (45) by row transformations:
In (88), we need to show that there exists a rational number bj satisfying
[TABLE]
We need to show
[TABLE]
which is true from the first ‘<’ in (91) when j=j+1.
Then we can choose some rational number bm0−1,⋯,b0 satisfying (87) and (91), and then (1,bm0−1,⋯,b0) is a feasible solution of (45). Taking the common denominator N∈N≥1 of {bj∣j=0,⋯,m0−1}, we have
[TABLE]
[TABLE]
[TABLE]
[TABLE]
and then
[TABLE]
Then Δm0>0 implies deg(f)=m0=⌈π/θ⌉−1=⌊π/θ⌋.
2. 2.
If Δm0=0, π/θ=m0+1>2, z=reπi/(m0+1). Then e2πi/(m0+1) is a root of (x−1)−2Ru(f(x),f(ux))=a2a0x2+(2a2a0−a12)x+a2a0. Since e2πi/(m0+1) is integral over Z, we have a0a2∣(2a2a0−a12) or a0a2∣a12.
For 0<2π/(m0+1)<π, a2a0x2+(2a2a0−a12)x+a2a0 has no real roots,
and then we have (2a2a0−a12)2−4(a2a0)2<0, that is a12<4a0a2. Then we have m0=2 when a12=a0a2, m0=3 when a12=2a0a2 or m0=5 when a12=3a0a2.
(a)
If m0=2 and Δ2=0, f(x)=a2x2+a1x+a0, where a1=−a0a2. When solving (45) for m=3, we have
[TABLE]
In order for an integer b2 to satisfy these inequations, we need to assume
[TABLE]
Here b0<0 implies min{deg(g)∣fg∈Φ0}≥3, so deg(f)=3=π/θ=⌊π/θ⌋.
2. (b)
If m0=3 and Δ3=0, f(x)=a2x2+a1x+a0, where a1=−2a0a2. When we solve (45) for m=4, we have
[TABLE]
[TABLE]
When we want
[TABLE]
we only need
[TABLE]
Here b0≤−a02/a22<0 implies min{deg(g)∣fg∈Φ0}≥4, so deg(f)=4=π/θ=⌊π/θ⌋.
3. (c)
If m0=5 and Δ5=0, f(x)=a2x2+a1x+a0, where a1=−3a0a2. Rewriting a2f(x)=a22x2+a2a1x+3a12, When we solve (45) for a2f(x) for m=6, we get
[TABLE]
[TABLE]
Because b5<0 implies a2a1b4<3a223a1a2b4−a12b5, a2a1b4<b3 implies 3a22a1b3<3a223a1a2b3−a12b4, 3a22a1b3<b2 implies 2a2a1b2<3a223a1a2b2−a12b3, and 2a2a1b2<b1 implies 3a2a1b1<3a223a1a2b1−a12b2, there exists a feasible solution {b5,b4,b3,b2,b1,b0}∈Q<06, which is an inner point of the semi-algebraic set. Using the same notations in (92), let N∈N>1 be the common denominator of {b0,…,b5}, and we have f(x)(x6+N∑j=05bjxj)∈Φ0.
Here b0<0 implies min{deg(g)∣fg∈Φ0}≥6, so deg(f)=6=π/θ=⌊π/θ⌋.
We complete the proof.
The following example is used to illustrate the proof.
Example 5.7**.**
Let f=x2−x+2, Δ1=−1, Δ2=1>0, m0=2, deg(f)=2.
Here f∈/N[x] implies deg(f)>1, and (x2−x+2)(x2−5x−7)∈Φ0 implies deg(f)≤2.
Example 5.8**.**
Let f=x2−2x+2. By the effective Polya Theorem 3.14, we have d1=min{deg(g)∣g∈Z[x] and monic,fg∈Φ0}≤10.
However, we have min{deg(g)∣g∈Z[x] and monic,fg∈Φ0}=4 by proposition 5.6, where g=x4−2x2−4x−4 and fg=x6−2x5−8.
6 Conclusion
In this paper, we study when a σ-ideal has a finite σ-Göbner basis.
We focused on a special class of σ-ideals:
normal binomial σ-ideals which can be be described by
the Gröbner basis of a Z[x]-module.
We give a criterion for a univariate normal binomial σ-ideal to have a
finite σ-Gröbner basis.
When the characteristic set of the σ-ideal consists of one σ-polynomial,
we can give constructive criteria for the σ-ideal to have a finite σ-Gröbner basis
and an algorithm to compute the finite σ-Gröbner basis under these criteria.
One case is still not solved and we summary it as a conjecture.
Also, it is desirable to extend the criteria given in this paper to multivariate binomial
σ-ideals.
Example 2.9 shows that extending Theorem 3.1 to the multivariate case is
quite nontrivial.
For σ-Gröbner basis of general σ-ideals, the work on
monomial σ-ideals may be helpful [20].
Bibliography21
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] G.E. Collins and A.G. Akritas. Polynomial Real Root Isolation Using Descarte’s Rule of Signs, Proc. 1976 ACM Symposium on Symbolic and Algebraic Computation , 272-275.
2[2] R. Becker, M. Sagraloff, V. Sharma, J. Xu, C. Yap. Complexity Analysis of Root Clustering for a Complex Polynomial Proc. ISSAC ’16 , 71-78, 2016, ACM Press.
3[3] R. J. Bradford and J. H. Davenport, Effective Tests for Cyclotomic Polynomials, ISSAC 1988 , LNCS 358, 244-251, Springer, Berlin-New York,1989
4[4] B. Buchberger. Bruno Buchberger’s Ph D thesis 1965: An algorithm for finding the basis elements of the residue class ring of a zero dimensional polynomial ideal, Journal of Symbolic Computation , 41, 475-511, 2016.
5[5] R. M. Cohn. Difference Algebra , Interscience Publishers, New York, 1965.
6[6] D. Eisenbud and B. Sturmfels. Binomial Ideals, Duke Math. J. , 84(1), 1-45, 1996.
7[7] X. S. Gao, Z. Huang, C. M. Yuan. Difference Binomial Ideals, Journal of Symbolic Computation , 80(3), 665-706, 2017.
8[8] X. S. Gao, Y. Luo, C. M. Yuan. A Characteristic Set Method for Ordinary Difference Polynomial Systems, Journal of Symbolic Computation , 44(3), 242-260, 2009.