Higher degree S-lemma and the stability of quadratic modules

Philipp Jukic

arXiv:1701.07013·math.AG·January 26, 2017

Higher degree S-lemma and the stability of quadratic modules

Philipp Jukic

PDF

Open Access

TL;DR

This paper investigates a higher-degree generalization of the S-lemma related to Hilbert's theorem on ternary quartics, demonstrating its limitations through geometric and algebraic analysis within quadratic modules.

Contribution

It introduces new tools to analyze the non-existence of a higher-degree S-lemma generalization, linking geometric and algebraic perspectives.

Findings

01

Higher-degree S-lemma generalization is not possible without extra conditions.

02

Established a connection between geometric and algebraic reasons in quadratic modules.

03

Extended tools by Netzer to analyze positivity and stability in polynomial modules.

Abstract

In this work we will investigate a certain generalization of the so called S-lemma in higher degrees. The importance of this generalization is, that it is closely related to Hilbert's 1888 theorem about tenary quartics. In fact, if such a generalization exits, then one can state a Hilbert-like theorem, where positivity is only demanded on some semi-algebraic set. We will show that such a generalization is not possible, at least not without additional conditions. To prove this, we will use and generalize certain tools developed by Netzer ([Ne]). These new tools will allow us to conclude that this generalization of the S-lemma is not possible because of geometric reasons. Furthermore, we are able to establish a link between geometric reasons and algebraic reasons. This will be accomplished within the framework of quadratic modules.

Figures5

Click any figure to enlarge with its caption.

Equations90

S (f_{1}, \dots, f_{s}) = {x \in R^{n} : f_{1} (x) \geq 0, \dots, f_{s} (x) \geq 0}

S (f_{1}, \dots, f_{s}) = {x \in R^{n} : f_{1} (x) \geq 0, \dots, f_{s} (x) \geq 0}

aff (S) = {i = 1 \sum n λ_{i} s_{i} : s_{1}, \dots, s_{n} \in S, λ_{1}, \dots, λ_{n} \in R, i = 1 \sum n λ_{i} = 1, n \in N} .

aff (S) = {i = 1 \sum n λ_{i} s_{i} : s_{1}, \dots, s_{n} \in S, λ_{1}, \dots, λ_{n} \in R, i = 1 \sum n λ_{i} = 1, n \in N} .

relint (C) = {x \in C : \exists ε > 0 : B_{ε} (x) \cap aff (C) \subseteq C} .

relint (C) = {x \in C : \exists ε > 0 : B_{ε} (x) \cap aff (C) \subseteq C} .

ψ : P_{2, n} \to R^{2}, h \mapsto (tr (A_{f_{1}} A_{h}), tr (A_{f_{2}} A_{h})) .

ψ : P_{2, n} \to R^{2}, h \mapsto (tr (A_{f_{1}} A_{h}), tr (A_{f_{2}} A_{h})) .

\forall (u_{1}, u_{2}) \in C : α_{1} u_{1} + α_{2} u_{2} \geq 0 \forall x \in R^{n} : α_{1} f (x) + α_{2} g (x) \leq 0.

\forall (u_{1}, u_{2}) \in C : α_{1} u_{1} + α_{2} u_{2} \geq 0 \forall x \in R^{n} : α_{1} f (x) + α_{2} g (x) \leq 0.

f = f_{1} (x) + f_{2} (x) + c_{f} g = g_{1} (x) + g_{2} (x) + c_{g}

f = f_{1} (x) + f_{2} (x) + c_{f} g = g_{1} (x) + g_{2} (x) + c_{g}

\overline{f} = f_{1} (x) + y f_{2} (x) + y^{2} c_{f} \overline{g} = g_{1} (x) + y g_{2} (x) + y^{2} c_{g} .

\overline{f} = f_{1} (x) + y f_{2} (x) + y^{2} c_{f} \overline{g} = g_{1} (x) + y g_{2} (x) + y^{2} c_{g} .

\overline{f} (x, y) < 0 \overline{g} (x, y) \geq 0.

\overline{f} (x, y) < 0 \overline{g} (x, y) \geq 0.

f (\frac{x}{y}) = \frac{f ( x , y )}{y ^{2}} g (\frac{x}{y}) = \frac{g ( x , y )}{y ^{2}}

f (\frac{x}{y}) = \frac{f ( x , y )}{y ^{2}} g (\frac{x}{y}) = \frac{g ( x , y )}{y ^{2}}

\overline{f} (x, 0) < 0 \overline{g} (x, 0) \geq 0.

\overline{f} (x, 0) < 0 \overline{g} (x, 0) \geq 0.

x_{1} \to \infty lim (f (x_{1}, β x_{1}, 1) - t (x_{1}, β x_{1}, 1) g (x_{1}, β x_{1}, 1)) = - \infty,

x_{1} \to \infty lim (f (x_{1}, β x_{1}, 1) - t (x_{1}, β x_{1}, 1) g (x_{1}, β x_{1}, 1)) = - \infty,

x_{1} \to \infty lim (f (x_{1}, β x_{1}, 1) - σ_{2} (x_{1}, β x_{1}, 1) g (x_{1}, β x_{1}, 1)) \to - \infty,

x_{1} \to \infty lim (f (x_{1}, β x_{1}, 1) - σ_{2} (x_{1}, β x_{1}, 1) g (x_{1}, β x_{1}, 1)) \to - \infty,

f_{1} = z_{2}^{2} x_{1}^{2} + x_{1}^{3} + z_{2} x_{1}^{4} = E -component x_{1}^{2} V_{1}^{'} -component (z_{2}^{2} + x_{1} + z_{2} x_{1}^{2})

f_{1} = z_{2}^{2} x_{1}^{2} + x_{1}^{3} + z_{2} x_{1}^{4} = E -component x_{1}^{2} V_{1}^{'} -component (z_{2}^{2} + x_{1} + z_{2} x_{1}^{2})

g_{1} = x_{1} + z_{2} x_{1} + z_{2} x_{1}^{2} = E -component x_{1} V_{2}^{'} -component (1 + z_{2} + z_{2} x_{1}) .

g_{1} = x_{1} + z_{2} x_{1} + z_{2} x_{1}^{2} = E -component x_{1} V_{2}^{'} -component (1 + z_{2} + z_{2} x_{1}) .

i < j \sum a_{ij} x_{1}^{i} (\frac{x _{2}}{x _{1}})^{j} = \frac{a _{i^{'} j^{'}} x _{2}^{j^{'}} + \sum _{i < j, (i, j) \neq = (i^{'}, j^{'})} a _{ij} x _{2}^{j} x _{1}^{Δ_{ij}}}{x _{1}^{j^{'} - i^{'}}} .

i < j \sum a_{ij} x_{1}^{i} (\frac{x _{2}}{x _{1}})^{j} = \frac{a _{i^{'} j^{'}} x _{2}^{j^{'}} + \sum _{i < j, (i, j) \neq = (i^{'}, j^{'})} a _{ij} x _{2}^{j} x _{1}^{Δ_{ij}}}{x _{1}^{j^{'} - i^{'}}} .

h (0, x_{2}) := a_{i^{'} j^{'}} x_{2}^{j^{'}} + i < j, (i, j) \neq = (i^{'}, j^{'}), Δ_{ij} = 0 \sum a_{ij} x_{2}^{j}

h (0, x_{2}) := a_{i^{'} j^{'}} x_{2}^{j^{'}} + i < j, (i, j) \neq = (i^{'}, j^{'}), Δ_{ij} = 0 \sum a_{ij} x_{2}^{j}

n \to \infty lim (ϕ (f_{1}) (x_{n}) - ϕ (t) (x_{n}) ϕ (g_{1}) (x_{n})) = n \to \infty lim (f (x_{n}) - ϕ (t) (x_{n}) g (x_{n})) = - \infty.

n \to \infty lim (ϕ (f_{1}) (x_{n}) - ϕ (t) (x_{n}) ϕ (g_{1}) (x_{n})) = n \to \infty lim (f (x_{n}) - ϕ (t) (x_{n}) g (x_{n})) = - \infty.

ν (d) = {d - 2, if d \leq 6 d - \frac{d - 6}{2} - 2, if d > 6 .

ν (d) = {d - 2, if d \leq 6 d - \frac{d - 6}{2} - 2, if d > 6 .

de g (σ_{j} f_{j}) \leq de g (i \sum σ_{i} f_{i})

de g (σ_{j} f_{j}) \leq de g (i \sum σ_{i} f_{i})

A_{d}^{(z)} := ⎩ ⎨ ⎧ δ \in N^{n}, ⟨ z, δ ⟩ = d \sum c_{δ} x_{1}^{δ_{1}} \dots x_{n}^{δ_{n}} : c_{δ} \in R ⎭ ⎬ ⎫ .

A_{d}^{(z)} := ⎩ ⎨ ⎧ δ \in N^{n}, ⟨ z, δ ⟩ = d \sum c_{δ} x_{1}^{δ_{1}} \dots x_{n}^{δ_{n}} : c_{δ} \in R ⎭ ⎬ ⎫ .

A = d \in Z ⨁ A_{d}^{(z)}

A = d \in Z ⨁ A_{d}^{(z)}

T_{K, z} := {(λ^{z_{1}} x_{1}, \dots, λ^{z_{n}} x_{n} : λ \geq 1, x = (x_{1}, \dots, x_{n}) \in K)} .

T_{K, z} := {(λ^{z_{1}} x_{1}, \dots, λ^{z_{n}} x_{n} : λ \geq 1, x = (x_{1}, \dots, x_{n}) \in K)} .

D (q) = i \sum a_{ii} x_{i}^{2} .

D (q) = i \sum a_{ii} x_{i}^{2} .

\frac{1}{2 π i} \oint_{Γ_{i, y}} h (x, y) d x = N_{i} (y),

\frac{1}{2 π i} \oint_{Γ_{i, y}} h (x, y) d x = N_{i} (y),

\frac{1}{2 π i} \oint_{Γ_{i, y}} h (x, y) d x - \frac{1}{2 π i} \oint_{Γ_{i, y}} h (x, 0) d x \to 0

\frac{1}{2 π i} \oint_{Γ_{i, y}} h (x, y) d x - \frac{1}{2 π i} \oint_{Γ_{i, y}} h (x, 0) d x \to 0

T_{K, φ} = {(φ_{1} (λ) x_{1}, \dots, φ_{n} (λ) x_{n}) : λ \geq 1, φ_{1} (λ), \dots, φ_{n} (λ) is defined, x \in K}

T_{K, φ} = {(φ_{1} (λ) x_{1}, \dots, φ_{n} (λ) x_{n}) : λ \geq 1, φ_{1} (λ), \dots, φ_{n} (λ) is defined, x \in K}

S (L_{z} (f_{1}), \dots, L_{z} (f_{s})) \subseteq R^{n}

S (L_{z} (f_{1}), \dots, L_{z} (f_{s})) \subseteq R^{n}

\hat{L}_{z} (f_{i}) (\uplambda, x) = \frac{\sum _{⟨ α, z ⟩ = δ_{i}} c _{i, α} x ^{α} h _{1, α}}{h _{2}} .

\hat{L}_{z} (f_{i}) (\uplambda, x) = \frac{\sum _{⟨ α, z ⟩ = δ_{i}} c _{i, α} x ^{α} h _{1, α}}{h _{2}} .

χ_{i} (φ_{1} x_{1}, \dots, φ_{n} x_{n}) = j \sum a_{ij} φ_{j} x_{j} = j \sum a_{ij} x_{i}^{- 1} φ_{j} x_{j} x_{i} .

χ_{i} (φ_{1} x_{1}, \dots, φ_{n} x_{n}) = j \sum a_{ij} φ_{j} x_{j} = j \sum a_{ij} x_{i}^{- 1} φ_{j} x_{j} x_{i} .

T = {(\tilde{φ}_{1} (λ) y_{1}, \dots, \tilde{φ}_{n} (λ) y_{n}) : y \in \overline{U}, λ \geq 1} \subseteq S^{'} .

T = {(\tilde{φ}_{1} (λ) y_{1}, \dots, \tilde{φ}_{n} (λ) y_{n}) : y \in \overline{U}, λ \geq 1} \subseteq S^{'} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Polynomial and algebraic computation · Commutative Algebra and Its Applications

Full text

Higher degree S-lemma and the stability of quadratic modules

Philipp Jukic

Abstract

In this work we will investigate a certain generalization of the so called S-lemma in higher degrees. The importance of this generalization is, that it is closely related to Hilbert’s 1888 theorem about tenary quartics. In fact, if such a generalization exits, then one can state a Hilbert-like theorem, where positivity is only demanded on some semi-algebraic set. We will show that such a generalization is not possible, at least not without additional conditions. To prove this, we will use and generalize certain tools developed in [Ne]. In fact, these new tools will allow us to conclude that this generalization of the S-lemma is not possible because of geometric reasons. Furthermore, we are able to establish a link between geometric reasons and algebraic reasons. This will be accomplished within the framework of quadratic modules.

0 Introduction
1 The S-lemma
1 Preliminaries
2 Proof of the S-lemma
2 Higher degree S-lemma
1 Counterexample
2 Formulating a higher degree S-lemma
3 S4-conjecture in two variables
4 The S4-conjecture: A counterexample
5 Geometric analysis
6 A generalization of the counterexample
3 Quadratic modules and stability
1 Preliminaries
2 Stability and tentacles
3 Tentacles and the S4-conjecture
4 A non-geometric counterexample
5 Final thoughts

Chapter 0 Introduction

First of all, let us talk about the motivation of this article. In 1888 Hilbert showed in his work [Hi] that a ternary quartic $f$ , that is a 4-form in three variables, can be written as a sum of three squares of quadratic forms if and only if $f$ is non-negative on $\mathbb{R}^{3}$ . The question is: Can we find a Hilbert-like theorem in a more general setting? What does a more general setting mean in this context? Instead of considering non-negative ternary quartics, we consider a ternary quartic that needs to be non-negative on a semi-algebraic set $S\subseteq\mathbb{R}^{3}$ . Furthermore, the semi-algebraic set $S$ should also satisfy the following two conditions: First, there exists a quadratic form $g$ in three variables such that $S=S(g):=\{x\in\mathbb{R}^{3}:g(x)\geq 0\}$ . Second, the set $S$ has a non-empty interior. Of course, if $S\neq\mathbb{R}^{3}$ then $f$ can, in general, not be written as a sum of three squares of quadratic forms. In this case we need a sort of a correcting term. This correcting term should also satisfy some conditions. First, we demand that this term is of the form $-tg$ , where $t$ is a non-negative quadratic form. Second, $f-tg$ should be a non-negative ternary quartic. Thus a generalization of Hilbert’s theorem could look like the following: Let $g$ be a quadratic form such that there exists a point $x^{\prime}\in\mathbb{R}^{3}$ with $g(x^{\prime})>0$ . A ternary quartic $f$ is non-negative on the set $S(g)$ if and only if there exists a non-negative quadratic form $t$ such that $f-tg$ can be written as a sum of three squares of quadratic forms.

The interpretation of this statement is simple. If $g$ is non-negative, then this statement is equal to Hilbert’s statement. If $g$ is not non-negative, then $-tg$ measures ’how far away’ $f$ is from being a sum of three squares of quadratic forms.

Let us illustrate this statement by considering the two polynomials $g=\mathrm{x}_{1}^{2}-\mathrm{x}_{2}^{2}$ and $f=\mathrm{x}_{1}^{4}-\mathrm{x}_{2}^{4}$ . It is easy to see that $S(g)$ has a non-empty interior and that $f$ is non-negative on $S(g)$ . Furthermore, we have $f-2\mathrm{x}_{2}^{2}g=\mathrm{x}_{1}^{4}-2\mathrm{x}_{1}^{2}\mathrm{x}_{2}^{2}+\mathrm{x}_{2}^{4}=\left(\mathrm{x}_{1}^{2}-\mathrm{x}_{2}^{2}\right)^{2}$ . Thus $f-2\mathrm{x}_{2}^{2}g$ is a sum of three squares of quadratic forms: One quadratic form is given by $\mathrm{x}_{1}^{2}-\mathrm{x}_{2}^{2}$ , the other two are [math].

We are looking to clarify the following question: Can such a generalization of Hilbert’s theorem be made? It turns out that this question is closely related to the so called S-lemma resp. to a certain generalization of the S-lemma. Hence the first Chapter is all about the introduction and the proof of the S-lemma. The machinery presented in this chapter relies heavily on the work of [PT] and [Bar]. The results in the first chapter are all well known. Therefore there is nothing new in this part of the article. In this chapter and throughout the whole article no fancy knowledge will be required. One should be familiar with basic linear algebra, convex geometry, and real algebraic geometry. In the second Chapter we will formulate a generalization of the S-lemma. For the sake of simplicity we will refer to the generalization as the S4-conjecture. The importance of this S4-conjecture is the following: If the conjecture is true, then the generalization of Hilbert’s theorem is possible. If it is not true, then such a generalization is impossible. However, it turns out that it is impossible because we can find a counterexample for the S4-conjecture. Although we can find a counterexample, we will still refer to this mentioned generalization as the S4-conjecture. Next, we do some geometric investigations and finally generalize the counterexample to higher degrees. In the third and last chapter we use and generalize the machinery developed in [Ne] to further investigate the counterexample. It turns out that the tools presented in [Ne] are quite suitable in analyzing the S4-conjecture. In fact, by using these new methods we will see that the conjecture fails because of geometric reasons. Since [Ne] connects geometric properties and algebraic properties, we will see that there is an interesting link between the S4-conjecture and the stability of quadratic modules. Finally, this article will be concluded by presenting some new questions that should serve as a motivation for further studies.

We will use the following notation throughout this article:

•

$\mathbb{R}$ , $\mathbb{C}$ , $\mathbb{Z}$ , $\mathbb{N},\mathbb{N}_{0}$ : The real, complex, integer, natural numbers and the natural numbers with [math].

•

$\mathbb{R}^{n}$ , $\mathbb{C}^{n}$ , $\mathbb{Z}^{n}$ : The $0\leq n$ dimensional vector spaces $\mathbb{R}^{n}$ resp. $\mathbb{C}^{n}$ and the free $\mathbb{Z}$ -module $\mathbb{Z}^{n}$ .

•

$\langle\cdot,\cdot\rangle$ : The standard scalar product in $\mathbb{R}^{n}$ .

•

$\mathbb{K}\left[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}\right]$ : The polynomial ring over a field $\mathbb{K}$ in $n\geq 1$ variables. Polynomial variables will always be denoted by upright letters $\mathrm{x},\mathrm{y},\upbeta,\uplambda$ etc.

•

$\mathbb{K}\left[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}\right]_{d}$ : The set of all polynomials $f\in\mathbb{K}\left[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}\right]$ with $\deg(f)\leq d$ .

•

A polynomial $f\in\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ is called non-negative if $\forall x\in\mathbb{R}^{n}:f(x)\geq 0$ . Negative, positive and non-positive polynomials are defined in the same manner.

•

The homogenization of a polynomial $f\in\mathbb{K}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ will be denoted by $\overline{f}$ . The dehomogenization of a homogeneous polynomial $g$ with $\tilde{g}$ .

•

$\mathbb{A}^{n}$ , $\mathbb{P}^{n}$ : The $n$ -dimensional affine space and the $n$ -dimensional projective space.

•

$\mathcal{V}(f_{1},\ldots,f_{s})$ : For polynomials $f_{1},\ldots,f_{s}\in\mathbb{K}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ the set $\mathcal{V}(f_{1},\ldots,f_{s})$ is defined to be the set of all solutions $x\in\overline{\mathbb{K}}^{n}$ ( $\overline{\mathbb{K}}$ denotes the algebraic closure of $\mathbb{K}$ ) of the polynomial equalities $f_{1}(x)=0,\ldots,f_{s}(x)=0$ . If $\mathbb{K}=\mathbb{R}$ then we will fix $\mathbb{C}$ as the algebraic closure of $\mathbb{R}$ . If $f_{1},\ldots,f_{s}$ are homogeneous, then we can interpret $\mathcal{V}(f_{1},\ldots,f_{s})$ as the set of all solutions in the projective space $\mathbb{P}^{n}$ .

•

Let $V$ be a variety defined over a field $\mathbb{K}$ and $\mathbb{L}|\mathbb{K}$ an algebraic extension of $\mathbb{K}$ . The $\mathbb{L}$ -rational points of $V$ are denoted by $V(\mathbb{L})$ .

•

$S(f_{1},\ldots,f_{s})$ : The basic closed semi-algebraic set

[TABLE]

defined by the polynomials $f_{1},\ldots,f_{s}\in\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ .

•

$\mathrm{GL}_{n}$ , $\mathrm{O}_{n}$ : The general linear group over $\mathbb{R}$ and its orthogonal subgroup over $\mathbb{R}$ .

•

Let $X$ be a topological space and $A$ a subset. The interior of $A$ is denoted with $\mathrm{int}(A)$ and the closure with $\overline{A}$ .

Chapter 1 The S-lemma

In this chapter we will formulate and prove the so called S-lemma. Before doing this, however, it shall be noted that the S-lemma has many variations in the literature. While all versions are in fact equivalent, we will use a version that is closer to real algebraic geometry. Thus the original statement of the S-lemma made by Yakubovich, that can be found in the work of Polik and Terlaky [PT], will not be used. In the sense of real algebraic geometry the S-lemma is formulated in the following way:

Theorem 0.1.

S-lemma: Let $f,g$ be polynomials in $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]_{2}$ . If there exists a point $x^{\prime}\in\mathbb{R}^{n}$ with $g(x^{\prime})>0$ , then the following statements are equivalent:

(a)

The inclusion $S(g)\subseteq S(f)$ holds. 2. (b)

There exists a non-negative real number $t$ such that $f(x)-tg(x)\geq 0$ for all $x\in\mathbb{R}^{n}$

The aim of this chapter is to provide a proof for Theorem 0.1. Simultaneously, it should serve as an introduction in what is to come later. Before we are ready to prove Theorem 0.1, we need some preparatory results, which will be bundled together in the following section.

1 Preliminaries

First of all, it is worth mentioning that one could prove Theorem 0.1 directly, without any notable machinery. One such proof can be found in [PT, pp. 376-378]. The disadvantage is, however, that it needs quite a lot of computations. As already pointed out, we will use a different approach. For the proof of theorem 0.1 we will need the following definitions, lemmas, and propositions:

Definition 1.1.

Let $f=\sum_{i=1}^{n}\sum_{j=1}^{n}a_{ij}\mathrm{x}_{i}\mathrm{x}_{j}$ be a quadratic form in $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ , where all coefficients $a_{ij}$ of $f$ lie in $\mathbb{R}$ . The matrix that corresponds to $f$ is defined to be the symmetric matrix $A_{f}=\left(\frac{1}{2}(a_{ij}+a_{ji})\right)_{1\leq i\leq n,1\leq j\leq n}$ . If $f$ is an arbitrary form of degree $d\geq 0$ in $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ , then $f$ is said to be positive semi-definite resp. positive definite if $\forall x\in\mathbb{R}^{n}:f(x)\geq 0$ resp. $\forall x\in\mathbb{R}^{n}\backslash\{0\}:f(x)>0$ .

Remark 1.2.

A quadratic form $f$ is positive (semi-) definite if and only if the corresponding matrix $A_{f}$ is positive (semi-) definite.

In the following we will assume that the coefficients of a quadratic form in $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ lie in $\mathbb{R}$ .

Definition 1.3.

Let $P_{d,n}$ be the set of all forms of even degree $d>0$ in $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ . With $P_{d,n}^{+}$ we denote the subset of $P_{d,n}$ that consist of all positive semi-definite forms in $P_{d,n}$ .

Definition 1.4.

Let $V$ be finite dimensional $\mathbb{R}$ -vector space. A (convex) cone $C\subseteq V$ is a subset of $V$ that satisfies the following two conditions:

•

The set $C$ is not empty.

•

For any real number $\lambda\geq 0$ and any element $g\in C$ we have $\lambda g\in C$ .

We say that a cone $C\subseteq V$ is pointed, if the identity $C\cap-C=\{0\}$ holds.

Remark 1.5.

One can easily see that $P_{2,n}$ is a finite dimensional $\mathbb{R}$ -vector space. To be more precise, there is a vector-space isomorphism $P_{2,n}\xrightarrow{\,\smash{\raisebox{-2.15277pt}{$ \scriptstyle\sim $}}\,}\mathbb{R}^{\frac{n(n+1)}{2}}$ . For $f,g\in P_{2,n}$ the dot product on $P_{2,n}$ is defined by $\langle f,g\rangle=\mathrm{tr}(A_{f}A_{g})$ , which is just the pullback of the dot product in $\mathbb{R}^{\frac{n(n+1)}{2}}$ . Thus $P_{2,n}$ is an euclidean space. The same is also true for $P_{d,n}$ , where $d\geq 0$ .

Finally, it should be noted that this vector space $P_{2,n}$ has a more or less surprising upcoming in algebraic geometry: See [Sha I, Example 3, p. 44] about determinantal varieties.

Definition 1.6.

Let $V$ be a finite dimensional real vector space and $C\subseteq V$ a convex subset. A convex subset $F$ of $C$ is called a face of $C$ if the following statement holds: Suppose $u$ and $v$ are two points in $C$ . If there exists a $\lambda\in(0,1)$ such that $\lambda u+(1-\lambda)v\in F$ , then $u$ and $v$ lie already in $F$ .

A face $F$ of $C$ is called proper if $\varnothing\subsetneqq F\subsetneqq C$ holds. If there is a point $u\in C$ such that $\{u\}$ is a face of $C$ , then the point $u$ is called an extremal point. With $\mathrm{ex}(C)$ we will denote the set of all extremal points of $C$ .

Definition 1.7.

Let $V$ be a finite dimensional real vector space and $C\subseteq V$ a convex subset. Let $H$ be a hyper plane given by $H=\left\{x\in V:\ell(x)=0\right\}$ , where $\ell:V\rightarrow\mathbb{R}$ is a linear form. Set $\overline{H}_{+}=\left\{x\in V:\ell(x)\geq 0\right\}$ and $\overline{H}_{-}=\left\{x\in V:\ell(x)\leq 0\right\}$ . A face $F$ of $C$ is called exposed if there exists a linear form $\ell:V\rightarrow\mathbb{R}$ such that $C$ is contained in $\overline{H}_{+}$ or $\overline{H}_{-}$ and $F=C\cap H$ .

Definition 1.8.

Let $V$ be a finite dimensional real vector space and $S$ an arbitrary subset of $V$ . The affine hull $\mathrm{aff}(S)$ of $S$ is defined by

[TABLE]

Definition 1.9.

Let $V$ be a finite dimensional real vector space. The dimension of a convex set $C\subseteq V$ is defined to be the dimension of its affine hull. In short $\dim(C)=\dim(\mathrm{aff}(C))$ .

Definition 1.10.

Let $V$ be a finite dimensional real vector space and $C$ a convex subset of $V$ . Let $B_{\varepsilon}(x)$ denote the open ball in $x$ with radius $\varepsilon>0$ . The relative interior $\mathrm{relint}(C)$ of $C$ in $V$ is defined by

[TABLE]

Lemma 1.11.

(a): For every $x\in\mathbb{R}^{n}\backslash\{0\}$ the symmetric $n\times n$ -matrix $xx^{T}$ matrix is positive semi-definite of rank $1$ .

(b): Let $A$ be a positive semi-definite matrix. The rank of $A$ is the smallest natural number $r$ such that $A$ can be written as $A=\sum_{i=1}^{r}x_{i}x_{i}^{T}$ for some $x_{1},\ldots,x_{r}\in\mathbb{R}^{n}$ .

Proof: (a): Trivial.

(b): First of all, note that statement (b) is independent with respect to transformations $S^{T}AS$ , where $S\in\mathrm{O}_{n}$ . Indeed, set $D=S^{T}AS$ and assume that $A=\sum_{i=1}^{r}x_{i}x_{i}^{T}$ , where $r$ is minimal. Then we have $D=S^{T}\sum_{i=1}^{r}x_{i}x_{i}^{T}S=\sum_{i=1}^{r}S^{T}x_{i}x_{i}^{T}S=\sum_{i=1}^{r}S^{T}x_{i}(S^{T}x_{i})^{T}$ . It is clear that $r$ is also the minimal length of the sum for $D$ : Otherwise, $A$ could be written as a sum of smaller length, which would be a contradiction. Choose $S\in\mathrm{O}_{n}$ such that $D$ is a diagonal matrix. The diagonal of $D$ consists of the eigenvalues of $A$ , which are all non-negative. Thus it is easy to see that $D$ can be written as a sum $\sum_{i=1}^{r}x_{i}x_{i}^{T}$ for some $x_{1},\ldots,x_{n}\in\mathbb{R}^{n}$ and $r=\mathrm{rk}(A)$ . It remains to verify that $r$ is minimal. But this follows from $\mathrm{rk}\left(\sum_{i=1}^{r}x_{i}x_{i}^{T}\right)\leq r\mathrm{rk}(x_{i}x_{i}^{T})=r$ . $\boxempty$

Proposition 1.12.

Let $d\geq 0$ be an even number.

(a)

The set $P_{d,n}^{+}$ is a closed cone in $P_{d,n}$ . 2. (b)

The cone $P_{d,n}^{+}$ is pointed. 3. (c)

Let $L\subseteq\mathbb{R}^{n}$ be a subspace and $F_{L}:=\left\{f\in P_{2,n}^{+}:\forall x\in L:f(x)=0\right\}$ . The set $F_{L}$ is an exposed face with $\dim(F_{L})=\frac{r(r+1)}{2}$ and $r=n-\dim(L)$ . If $f\in P_{2,n}^{+}$ and $L=\mathrm{ker}\left(A_{f}\right)$ , then $f$ is in $\mathrm{relint}(F_{L})$ .

Proof: (a): We will show that $P:=P_{d,n}\backslash P_{d,n}^{+}$ is open. Take an element $f\in P$ . Since $f\in P$ , there exists a point $x\in\mathbb{R}^{n}$ such that $f(x)<0$ . Consider the evaluation homomorphism $\mathrm{ev}_{x}:P_{d,n}\rightarrow\mathbb{R},p\mapsto p(x)$ . Furthermore, $P_{d,n}$ is, as already stated in Remark 1.5, an euclidean space. Thus $\mathrm{ev}_{x}$ is continuous. Let $U\subseteq\mathbb{R}$ be an open neighborhood of $f(x)$ such that all elements of $U$ are negative real numbers. The set $U^{\prime}=\mathrm{ev}_{x}^{-1}\left(U\right)$ is an open neighborhood of $f$ that satisfies $U^{\prime}\subseteq P$ . Thus we proved that $P$ is open resp. that $P_{d,n}^{+}$ is closed. The second assertion that $P_{d,n}^{+}$ is a cone is trivial.

(b): Trivial.

(c): In the following we will just omit the trivial parts of the proof.111Pay attention to statements that begin with ’It is easy to see’. Let us begin with the easiest part, verifying that $\dim(F_{L})=\frac{r(r+1)}{2}$ , where $r=n-\dim(L)$ . This can be done by proving $\mathrm{aff}(F_{L})\cong\mathbb{R}^{\frac{r(r+1)}{2}}$ . Let $L^{\perp}$ be the orthogonal complement of $L$ in $\mathbb{R}^{n}$ . Without loss of generality we can identify $L$ with $\mathbb{R}^{n-r}$ and $L^{\perp}$ with $\mathbb{R}^{r}$ . Consider the cone $P_{2,r}^{+}$ . It is easy to see that $F_{L}$ can be identified with $P_{2,r}^{+}$ . Furthermore, $P_{2,r}^{+}$ has a non-empty interior. A well known result in convex geometry states that a cone with non-empty interior is full. This means that $\mathrm{aff}\left(P_{2,r}^{+}\right)=P_{2,r}$ . Thus $\mathrm{aff}\left(P_{2,r}^{+}\right)\cong\mathbb{R}^{\frac{r(r+1)}{2}}$ . Identifying $\mathrm{aff}(F_{L})$ with $\mathrm{aff}\left(P_{2,r}^{+}\right)$ proves the assertion. Next, we show that $F_{L}$ is an exposed face. A quadratic form $h_{1}\in P_{2,n-r}^{+}$ and a quadratic form $h_{2}\in P_{2,r}^{+}$ give rise to a quadratic form $h\in P_{2,n}^{+}$ in an obvious manner. In fact, the corresponding matrix $A_{h}$ of $h$ is given by $A_{h}=\left(\begin{array}[]{cc}A_{h_{1}}&0\\ 0&A_{h_{2}}\end{array}\right)$ . Fix $h_{1}=\mathrm{x}_{1}^{2}+\cdots+\mathrm{x}_{n-r}^{2}$ and define $\tilde{A}_{h_{1}}=\left(\begin{array}[]{cc}A_{h_{1}}&0\\ 0&0\end{array}\right)$ , $\tilde{A}_{h_{2}}=\left(\begin{array}[]{cc}0&0\\ 0&A_{h_{2}}\end{array}\right)$ . Then we can identify $F_{L}$ with $\left\{h\in P_{2,n}^{+}:\exists h_{2}\in P_{2,r}^{+}:A_{h}=\tilde{A}_{h_{2}}\right\}$ . It is easy to see that $F_{L}$ consists of all quadratic forms $h\in P_{2,n}^{+}$ that satisfy $\mathrm{tr}\left(A_{h}\tilde{A}_{h_{1}}\right)=0$ . Thus it is convenient to consider the linear form $\ell:P_{2,n}\rightarrow\mathbb{R},p\mapsto\mathrm{tr}\left(A_{p}\tilde{A}_{h_{1}}\right)$ and the hyper plane $H=\left\{p\in P_{2,n}:\ell(p)=0\right\}$ . So far, we know that $F_{L}=P_{2,n}^{+}\cap H$ . Finally, we just have to deal with the inclusion $P_{2,n}^{+}\subseteq\overline{H}_{+}$ . Let $h=\sum_{i,j}a_{ij}\mathrm{x}_{i}\mathrm{x}_{j}$ be a quadratic form in $P_{2,n}^{+}$ . Since $h$ is non-negative, the coefficients $a_{ii}$ must be non-negative for all $i=1,\ldots,n$ . Thus the diagonal of $A_{h}$ consists of non-negative real numbers. This implies $\ell(h)\geq 0$ . Altogether we proved that $F_{L}$ is an exposed face. Let us deal with the last statement in (c). Suppose $f\in P_{2,n}^{+}$ and $L=\mathrm{ker}(A_{f})$ . As before set $r=n-\dim(L)$ . Again we identify $F_{L}$ with $P_{2,r}^{+}$ and interpret $f$ as a quadratic form in $P_{2,r}^{+}$ . Then $f$ does only vanish at the origin in $\mathbb{R}^{r}$ . Hence $f$ lies in the interior of the cone $P_{2,r}^{+}$ resp. in $\mathrm{relint}\left(F_{L}\right)$ , which proves the assertion. $\boxempty$

Lemma 1.13.

Let $C\subset\mathbb{R}^{n}$ be a non-empty closed convex set which does not contain any straight line. Then $\mathrm{ex}(C)$ is a non-empty set.

Proof: See [Bar, Lemma 3.5, p. 53]. $\boxempty$

Because the next result is very important, it will be proven, although there exists a suitable reference.

Proposition 1.14.

Let $L$ be an affine subspace of $P_{2,n}$ such that $S=L\cap P_{2,n}^{+}$ is not empty. Suppose the inequality $\mathrm{codim}_{P_{2,n}}(L)<\frac{(r+2)(r+1)}{2}$ holds for some $r\in\mathbb{N}_{0}$ . Then there exists a quadratic form $f\in S$ such that the rank of $A_{f}$ is bounded by $r$ .

Proof[Bar, Proposition 13.1, p. 83]: According to Proposition 1.12 the cone $P_{2,n}^{+}$ is pointed and closed. This means that there is no way that the cone $P_{2,n}^{+}$ contains a straight line. If $P_{2,n}^{+}$ does not contain such a line so does not the subset $S$ of $P_{2,n}^{+}$ . By using Lemma 1.13 we get $\mathrm{ex}(S)\neq\varnothing$ . Choose an arbitrary $f\in S$ and let $A_{f}$ be its corresponding matrix of rank $m$ . Consider $W=\mathrm{ker}(A_{f})$ and the exposed face $F_{W}$ (Proposition 1.12). We want to show that $f$ is an element of the set $\mathrm{relint}(L\cap F_{W})$ . Since $f$ is an element of $\mathrm{relint}(F_{W})$ and $\mathrm{relint}(L)$ , it is enough to verify the inclusion $\mathrm{relint}(F_{W})\cap\mathrm{relint}(L)\subset\mathrm{relint}(L\cap F_{W})$ . Take a point $g\in\mathrm{relint}(F_{W})\cap\mathrm{relint}(L)$ . There exist $\varepsilon_{1},\varepsilon_{2}>0$ such that $B_{\varepsilon_{1}}(g)\cap\mathrm{aff}(F_{W})\subset F_{W}$ and $B_{\varepsilon_{2}}(g)\cap\mathrm{aff}(L)\subset L$ . By setting $\varepsilon=\min\{\varepsilon_{1},\varepsilon_{2}\}$ we get $B_{\varepsilon}(g)\cap\mathrm{aff}(L)\cap\mathrm{aff}(F_{W})\subset L\cap F_{W}$ . Since $\mathrm{aff}(L\cap F_{W})\subset\mathrm{aff}(L)\cap\mathrm{aff}(F_{W})$ , we have $B_{\varepsilon}(g)\cap\mathrm{aff}(L\cap F_{W})\subset L\cap F_{W}$ which implies the assertion.

We know that $f$ lies in both sets, $\mathrm{relint}(L\cap F_{W})$ and $\mathrm{ex}(L\cap F_{W})$ . This can only work if $\dim(L\cap F_{W})=0$ holds: Suppose $\dim(L\cap F_{W})>0$ . For every $\varepsilon>0$ we can find two different points $\delta_{1},\delta_{2}\in B_{\varepsilon}(f)$ such that $\delta_{1},\delta_{2}\neq f$ and $\delta_{1},\delta_{2},f\in\mathrm{aff}(L\cap F_{W})$ . Choose $\varepsilon>0$ such that $B_{\varepsilon}(f)\cap\mathrm{aff}(L\cap F_{W})\subset L\cap F_{W}$ holds. Now, $f$ is some point on the line segment that connects the two points $\delta_{1}$ and $\delta_{2}$ . But both points lie in $L\cap F_{W}$ . This contradicts the fact that $f$ is an extremal point of $L\cap F_{W}$ .

Since $\dim(L\cap F_{W})=0$ , we get $\dim(L)+\dim(F_{W})=\dim(L+F_{W})\leq\dim(P_{2,n})$ . This and Proposition 1.12 imply $\frac{m(m+1)}{2}=\dim(F_{W})\leq\dim(P_{2,n})-\dim(L)=\mathrm{codim}_{P_{2,n}}(L)<\frac{(r+1)(r+2)}{2}$ and thus $m<r+1$ . $\boxempty$

Corollary 1.15.

Let $f_{1},f_{2}\in P_{2,n}$ be two quadratic forms. The two equations $f_{1}(x)=\alpha_{1}$ and $f_{2}(x)=\alpha_{2}$ have a simultaneous solution $x\in\mathbb{R}^{n}$ if and only if there exists a quadratic form $q\in P_{2,n}^{+}$ such that $\mathrm{tr}\left(A_{f_{1}}A_{q}\right)=\alpha_{1}$ and $\mathrm{tr}\left(A_{f_{2}}A_{q}\right)=\alpha_{2}$ .

Proof[Bar, Corollary 13.2, p. 84]: $\Rightarrow$ : Choose $x\in\mathbb{R}^{n}$ such that $f_{1}(x)=\alpha_{1}$ and $f_{2}(x)=\alpha_{2}$ . Define $X=xx^{T}$ . Then $\mathrm{tr}\left(A_{f_{i}}X\right)=\mathrm{tr}\left(A_{f_{i}}xx^{T}\right)=x^{T}A_{f_{i}}x=f_{i}(x)=\alpha_{i}$ holds for $i=1,2$ .

$\Leftarrow$ : The map $\ell_{i}:P_{2,n}\rightarrow\mathbb{R},p\mapsto\mathrm{tr}\left(A_{f_{i}}A_{p}\right)$ is obviously a vector space homomorphism for $i=1,2$ . It is easy to see that $\ell_{i}^{-1}(\alpha_{i})=\mathrm{ker}(\ell_{i})+q$ holds for $i=1,2$ . Hence $\dim(\ell_{i}^{-1}(\alpha_{i}))=\dim(P_{2,n})-1$ for $i=1,2$ . This implies $\mathrm{codim}_{P_{2,n}}(L)<3=\frac{(1+1)(2+1)}{2}$ , where $L=\ell_{1}^{-1}(\alpha_{1})\cap\ell_{2}^{-1}(\alpha_{2})$ . According to Proposition 1.14 we can find a quadratic form $h$ in $L\cap P_{2,n}^{+}$ such that $\mathrm{tr}\left(A_{f_{i}}A_{h}\right)=\alpha_{i}$ and $\mathrm{rk}(A_{h})\leq 1$ for $i=1,2$ . Now, Lemma 1.11 tells us that there exists a point $x\in\mathbb{R}^{n}$ with $A_{h}=xx^{T}$ . Substituting $A_{h}$ through $xx^{T}$ in $\mathrm{tr}\left(A_{f_{i}}A_{h}\right)$ results in $f_{i}(x)=\alpha_{i}$ for $i=1,2$ . $\boxempty$

Corollary 1.16.

Consider the quadratic forms $f_{1},f_{2}\in P_{2,n}$ and the map $\varphi:\mathbb{R}^{n}\rightarrow\mathbb{R}^{2},x\mapsto(f_{1}(x),f_{2}(x))^{T}$ . Then the set $M=\varphi(\mathbb{R}^{n})$ is a convex subset of $\mathbb{R}^{2}$ .

Proof[Bar, Corollary 13.3, p. 84]: Consider the map

[TABLE]

Since $\psi$ is linear and $P_{2,n}^{+}$ is a cone (Proposition 1.12), the image of $P_{2,n}^{+}$ under $\psi$ is a cone. Finally, Corollary 1.15 implies the equality $\psi(P_{2,n}^{+})=M$ . $\boxempty$

Proposition 1.17.

Let $A$ and $B$ be two non-empty convex subsets of $\mathbb{R}^{n}$ such that $A\cap B=\varnothing$ . Then there exists a linear form $\ell:\mathbb{R}^{n}\rightarrow\mathbb{R}$ such that $\ell(x)\leq\ell(y)$ for all $x\in A$ and $y\in B$ .

Proof: See [Bar, Proposition 1.2, p. 106]. $\boxempty$

2 Proof of the S-lemma

Before actually proving theorem 0.1 we will consider a special case, from which the Theorem 0.1 will easily follow.

Proposition 2.1.

Homogeneous S-lemma: Let $f,g$ be quadratic forms in

$\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ . If there exists a point $x^{\prime}\in\mathbb{R}^{n}$ with $g(x^{\prime})>0$ , then following statements are equivalent:

(a)

The inclusion $S(g)\subseteq S(f)$ holds. 2. (b)

There exists a non-negative real number $t\geq 0$ such that $f(x)-tg(x)\geq 0$ for all $x\in\mathbb{R}^{n}$

Proof[PT, Proposition 2.3, p. 377]: (b) $\Rightarrow$ (a): This implication is quite trivial: Suppose we could find a non-negative real number $t$ such that $f(x)-tg(x)\geq 0$ holds for all $x\in\mathbb{R}^{n}$ and a point $y\in\mathbb{R}^{n}$ with $g(y)\geq 0$ and $f(y)<0$ . It is clear that $f(y)-tg(y)$ would be negative, contradicting $f(x)-tg(x)\geq 0$ for all $x\in\mathbb{R}^{n}$ .

(a) $\Rightarrow$ (b): According to Corollary 1.16 the set $M=\left\{(f(x),g(x)):x\in\mathbb{R}^{n}\right\}$ is a convex subset of $\mathbb{R}^{2}$ . Define $C=\left\{(u_{1},u_{2}):u_{1}<0,u_{2}\geq 0\right\}$ . Because $S(g)\subseteq S(f)$ holds, the intersection between $M$ and the convex set $C$ is empty. According to Proposition 1.17 there exists a linear form $\ell:\mathbb{R}^{2}\rightarrow\mathbb{R}$ such that $\ell(x)\leq\ell(y)$ for all $x\in M$ and $y\in C$ . Since $0\in M$ and $0\in\overline{C}$ , we have $\ell(x)\leq 0\leq\ell(y)$ for all $x\in M$ and $y\in C$ . Choose $\alpha_{1},\alpha_{2}\in\mathbb{R}$ such that $\ell=\alpha_{1}\mathrm{x}_{1}+\alpha_{2}\mathrm{x}_{2}$ . Consider the two statements

[TABLE]

If we take the point $(-1,0)\in C$ and evaluate $\ell$ at $(-1,0)$ , we get $\ell(-1,0)=-\alpha_{1}\geq 0$ . Thus $\alpha_{1}$ must be a non-positive real number. For an arbitrary $\varepsilon>0$ the point $(-\varepsilon,1)$ lies in $C$ . Evaluating $\ell$ at the point $(-\varepsilon,1)$ leads to $\ell(-\varepsilon,-1)=-\alpha_{1}\varepsilon+\alpha_{2}\geq 0$ . If $\alpha_{2}\neq 0$ then $\alpha_{2}$ must be positive. Otherwise, we could choose $\varepsilon>0$ so small that the inequality $-\alpha_{1}\varepsilon+\alpha_{2}<0$ would hold. We know that there exists a point $x^{\prime}\in\mathbb{R}^{n}$ such that $g(x^{\prime})>0$ . From the inequality $\alpha_{1}f(x^{\prime})+\alpha_{2}g(x^{\prime})\leq 0$ we conclude that $\alpha_{1}$ cannot vanish. Set $\alpha=\frac{\alpha_{2}}{\alpha_{1}}$ . Without loss generality we can assume that $\alpha\neq 0$ . Otherwise, $f$ would be non-negative and therefore we could set $t=0$ . Since $\alpha<0$ , we have the inequality $\alpha\alpha_{1}f(x)+\alpha\alpha_{2}g(x)=\alpha_{2}(f(x)+\alpha g(x))\geq 0$ for all $x\in\mathbb{R}^{n}$ . Hence $f+\alpha g$ is a non-negative polynomial. Set $t=-\alpha$ and the assertion follows. $\boxempty$

Proof of theorem 0.1: (b) $\Rightarrow$ (a): This implication is trivial.

(a) $\Rightarrow$ (b): Without loss of generality we can assume that $x^{\prime}=0$ . Set $\mathrm{x}=(\mathrm{x}_{1},\ldots,\mathrm{x}_{n})$ . Let

[TABLE]

be the decompositions of $f$ and $g$ with respect to the standard grading in $\mathbb{R}[\mathrm{x}]$ , where $f_{1}$ resp. $g_{1}$ denotes the component of $f$ resp. $g$ that has degree $2$ , $f_{2}$ resp. $g_{2}$ denotes the component of $f$ resp. $g$ that has degree $1$ and finally, $c_{f}$ resp. $c_{g}$ denotes the constant component. Let $\overline{f},\overline{g}\in\mathbb{R}[\mathrm{x},\mathrm{y}]$ be given by:

[TABLE]

In fact, we just need to prove that $\overline{f}$ and $\overline{g}$ satisfy the condition (a) in Proposition 2.1: Since then, there would exist a non-negative real number $t$ such that $\overline{f}(x,y)-t\overline{g}(x,y)\geq 0$ for all $(x,y)\in\mathbb{R}^{n+1}$ and the assertion would follow from the dehomogenization of $\overline{f}(\mathrm{x},\mathrm{y})-t\overline{g}(\mathrm{x},\mathrm{y})$ .

Suppose we could find a point $(x,y)\in\mathbb{R}^{n+1}$ with $y\neq 0$ such that

[TABLE]

Then we would get a contradiction, since the two identities

[TABLE]

hold. Suppose we could find a point $(x,0)\in\mathbb{R}^{n+1}$ with

[TABLE]

This two inequalities imply that $f_{1}(x)<0$ and $g_{1}(x)\geq 0$ . Consider $f(\uplambda x)=\uplambda^{2}f_{1}(x)+\uplambda f_{2}(x)+c_{f}$ as a polynomial in the new variable $\uplambda$ . Since $f_{1}(x)<0$ , we get that $\lambda^{2}f_{1}(x)+\lambda f_{2}(x)+c_{f}$ converges to $-\infty$ as $|\lambda|$ converges to $\infty$ .

Acknowledging that $c_{g}>0$ , $g_{1}(x)\geq 0$ and treating $g(\uplambda x)=\uplambda^{2}g_{1}(x)+\uplambda g_{2}(x)+c_{g}$ as a polynomial in $\uplambda$ , leads us to the following distinctions

•

$g(\lambda x)\rightarrow\infty$ for $\lambda\rightarrow\infty$ if $g_{1}(x)>0$

•

$g(\lambda x)\rightarrow\infty$ for $\lambda\rightarrow\mathrm{sign}(g_{2}(x))\infty$ if $g_{1}(x)=0$ and $g_{2}(x)\neq 0$ .

This proves that no matter what, we can always find a suitable $\lambda\in\mathbb{R}$ such that $f(\lambda x)<0$ and $g(\lambda x)\geq 0$ are satisfied, which clearly contradicts our assumption. $\boxempty$

Chapter 2 Higher degree S-lemma

1 Counterexample

Let us revisit Proposition 2.1. The question here is, if there is a generalization of the mentioned proposition in higher degrees. To be more precise, can we give up the restriction that the degree of the two homogeneous polynomials in Proposition 2.1 is bounded by $2$ ? The answer is no, as the following simple example illustrates it.

Example 1.1.

Consider $g=\mathrm{x}_{1}^{2}-\mathrm{x}_{2}^{2}$ and $f=\mathrm{x}_{1}^{2}(\mathrm{x}_{1}^{2}-\mathrm{x}_{2}^{2})$ . Then there is no non-negative real number $t$ such that $f(x)-tg(x)\geq 0$ holds for all $x\in\mathbb{R}^{2}$ . It is easy to check that $g$ and $f$ satisfy the prerequisites and the condition (a) of Proposition 2.1. Suppose we could find a non-negative real number $t$ such that $f(x)-tg(x)\geq 0$ holds for all $x\in\mathbb{R}^{2}$ . Let $(k_{n})_{n}\subset\mathbb{R}$ be a sequence such that $k_{n}\rightarrow 0$ for $n\rightarrow\infty$ . The inequality $f(k_{n},0)-tg(k_{n},0)\geq 0$ implies $f(k_{n},0)\geq tg(k_{n},0)$ for all $n\in\mathbb{N}$ . But this is not true. Choose a natural number $N\in\mathbb{N}$ such that $k_{N}^{2}<t$ . Then $tg(k_{N},0)$ would be greater than $f(k_{N},0)$ , which would contradict our assumption.

{window}

[3, r, ,]

Remark 1.2.

Since we cannot generalize Proposition 2.1 to higher degrees (Example 1.1), the method we used to prove Proposition 2.1 should also fail in higher degrees. The interesting question is, where does it fail? In case of Lemma 1.1 it turns out that the set $M=\left\{(f(x),g(x)):x\in\mathbb{R}^{2}\right\}$ is not a convex subset of $\mathbb{R}^{2}$ anymore. We reprise that $g=\mathrm{x}_{1}^{2}-\mathrm{x}_{2}^{2}$ and $f=\mathrm{x}_{1}^{2}(\mathrm{x}_{1}^{2}-\mathrm{x}_{2}^{2})$ . Consider the half-line $H=\left\{\lambda(-1,1):\lambda\in\mathbb{R}_{\geq 0}\right\}$ . Since $S(g)\subseteq S(f)$ holds, the intersection $M\cap H$ can only consist of one point and this point is the origin $(0,0)$ . On the other hand, we can find two points $(x_{1},x_{2}),(y_{1},y_{2})\in M$ such that $x_{1}>0$ , $x_{2}>0$ , $y_{1}<0$ , $y_{2}<0$ and the line segment $L$ connecting the points $(x_{1},x_{2})$ , $(y_{1},y_{2})$ does not go through the origin. This means $L$ intersects $H$ in some other point than the origin. But this intersection point cannot be in $M$ . Thus $M$ is not convex.

{window}

[3, r, ,]

Example 1.3.

Consider the two homogeneous polynomials $q=\mathrm{x}_{1}^{3}\mathrm{x}_{2}-\mathrm{x}_{1}^{2}\mathrm{x}_{2}^{2}$ and $p=\mathrm{x}_{1}^{4}-\mathrm{x}_{1}\mathrm{x}_{2}^{3}$ . We want to show that $M=\left\{(p(x),q(x)):x\in\mathbb{R}^{2}\right\}$ is not convex, giving an example where $M$ fails to be convex even if both homogeneous polynomials have the same degree. Set $H=\left\{(0,x_{2}):x_{2}<0\right\}$ . There are points $(x_{1},x_{2}),(y_{1},y_{2})\in M$ with $x_{1}<0,x_{2}<0,y_{1}>0,y_{2}<0$ such that the segment line $L$ , connecting both points, doesn’t go through the origin. So the intersection point of $L$ and $H$ is not in $M$ . Thus $M$ is not convex.

2 Formulating a higher degree S-lemma

In the past section we have seen that there is no way to increase the degree in the homogeneous S-lemma (Proposition 2.1) and keep all other statement as they are. There are two ways one can proceed. One way would be to left the conditions (a) and (b) in the homogeneous S-lemma unchanged, and find some additional statements that might be plugged in into the homogeneous S-lemma such that the S-lemma remains true. Results in this sense can be found in [ZS].

But there is another way. Instead of keeping the statements (a) and (b) as they are, we could simply modify the condition (b) by giving up that $t$ should be a non-negative real number. We demand that $t$ should be a non-negative homogeneous polynomial. The advantage is that we do not need to make up some new statements. The disadvantage, however, is that we have less information about $t$ . This philosophy motivates to formulate

Conjecture 2.1.

S4-Conjecture: Let $f$ be a tenary quartic, that is a $4$ -form in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]$ , and let $g$ be a quadratic form in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]$ . Suppose there exists a point $x^{\prime}\in\mathbb{R}^{3}$ such that $g(x^{\prime})>0$ . Then the following statements are equivalent:

(a)

The inclusion $S(g)\subseteq S(f)$ holds. 2. (b)

There exists a non-negative homogeneous polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]_{2}$ such that $f(x)-t(x)g(x)\geq 0$ for all $x\in\mathbb{R}^{3}$ .

The statement 2.1 is originally a question posted in Mathoverflow and was answered by the author. See [MathOver].

Remark 2.2.

The homogeneous polynomial $t$ in the S4-conjecture is either a quadratic form or the zero-polynomial, which can be interpreted as a homogeneous polynomial of negative-infinite degree. The case $t=0$ can only occur if $f$ is a non-negative form. If $f$ is not a non-negative form, then $t$ must be of degree 2. This becomes clear if we take a point $x\notin S(f)$ . Then we get $f(\lambda x)-t(\lambda x)g(\lambda x)=\lambda^{4}f(x)-\lambda^{2}t(x)g(x)\geq 0$ for all $\lambda\geq 0$ (). Note that if $t$ is not of degree $2$ , then it must be a constant. But no matter what sign the constant $t$ has, we will always get $\lambda^{4}f(x)-\lambda^{2}t(x)g(x)\rightarrow-\infty$ for $\lambda\rightarrow\infty$ , which contradicts ().

Since $g$ is a quadratic form, we can take a look at the signature of $g$ . If the S4-conjecture fails, then the next obvious question is: Is there a counterexample for all non-trivial signatures $-1,0$ and $1$ . Note that if the signature of $g$ is $-2$ resp. $2$ , then $g$ is non-positive resp. non-negative. But it is obvious how to deal with the S4-conjecture if $g$ is non-positive or non-negative.

3 S4-conjecture in two variables

In this section we are going to prove the S4-conjecture in just two variables aka.

Theorem 3.1.

S4-conjecture in 2-variables: Let $f$ be a $4$ -form in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ and let $g$ be a quadratic form in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ . Suppose there exists a point $x^{\prime}\in\mathbb{R}^{2}$ such that $g(x^{\prime})>0$ . Then the following statements are equivalent:

(a)

The inclusion $S(g)\subseteq S(f)$ holds. 2. (b)

There exists a non-negative homogeneous polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]_{2}$ such that $f(x)-t(x)g(x)\geq 0$ for all $x\in\mathbb{R}^{2}$ .

Remark 3.2.

Theorem 3.1 is not true, without the condition that there exists a point $x^{\prime}\in\mathbb{R}^{2}$ such that $g(x^{\prime})>0$ ! Consider for example the polynomials $g=-\mathrm{x}_{1}^{2}$ and $f=\mathrm{x}_{1}^{4}+\mathrm{x}_{1}^{3}\mathrm{x}_{2}+\mathrm{x}_{1}\mathrm{x}_{2}^{3}$ . Obviously $g$ is a non-positive quadratic form and the equation $g(x_{1},x_{2})=0$ holds if and only if $x_{1}=0$ . But for $x_{1}=0$ we have $f(0,x_{2})=0$ , which implies $S(g)\subseteq S(f)$ . Take a homogeneous polynomial $t=a_{1}\mathrm{x}_{1}^{2}+a_{2}\mathrm{x}_{2}^{2}+a_{3}\mathrm{x}_{1}\mathrm{x}_{2}$ , where $a_{1},a_{2},a_{3}\in\mathbb{R}$ . Then $f(\mathrm{x}_{1},1)-t(\mathrm{x}_{1},1)g(\mathrm{x}_{1},1)=\mathrm{x}_{1}^{4}+\mathrm{x}_{1}^{3}+\mathrm{x}_{1}+a_{1}\mathrm{x}_{1}^{4}+a_{2}\mathrm{x}_{1}^{2}+a_{3}\mathrm{x}_{1}^{3}$ . But no matter what the coefficients $a_{1},a_{2},a_{3}$ are, the polynomial $f(\mathrm{x}_{1},1)-t(\mathrm{x}_{1},1)g(\mathrm{x}_{1},1)$ has a sign change at the origin. Thus the implication (a) $\Rightarrow$ (b) in Theorem 3.1 would be false.

Lemma 3.3.

Let $p$ and $q$ be polynomials in $\mathbb{R}[\mathrm{x}]$ such that $\deg(p)=4$ , $\deg(q)=2$ and $S(q)\subseteq S(p)$ . Suppose that there exists a point $x^{\prime}\in\mathbb{R}$ with $q(x^{\prime})>0$ . Then there exists a non-negative polynomial $t\in\mathbb{R}[\mathrm{x}]$ of degree at most $2$ such that $p(x)-t(x)q(x)\geq 0$ for all $x\in\mathbb{R}$ .

Proof: Without loss of generality we can assume that neither $p$ nor $q$ are non-negative polynomials. Otherwise we could just take $t=0$ . For the sake of simplicity and oversight we will devide the proof in several meaningful cases:

Case I: The polynomial $p$ has a real double root $y\in\mathbb{R}$ : In this situation $p$ is divided by $s=(\mathrm{x}-y)^{2}\in\mathbb{R}[\mathrm{x}]$ . Set $h=\frac{p}{s}\in\mathbb{R}[\mathrm{x}]$ . It is clear that $h$ is a polynomial of degree $2$ and that the inclusion $S(q)\subseteq S(h)$ holds: The only situation in which $S(q)\subseteq S(h)$ might fail is the one, where $y$ is an isolated point of $S(q)$ . But this would imply that $s|q$ , which means that $q$ is either a non-negative or a non-positive polynomial. Obviously non of these two cases can occur. According to the S-lemma (Theorem 0.1) there is a non-negative real number $t^{\prime}\geq 0$ such that $h(x)-t^{\prime}q(x)\geq 0$ for all $x\in\mathbb{R}$ . This implies that the inequality $0\leq s(x)(h(x)-t^{\prime}q(x))=p(x)-t^{\prime}s(x)q(x)$ holds for all $x\in\mathbb{R}$ . Set $t=t^{\prime}s$ and the assertion follows.

Case II: The polynomial $p$ has a complex root $c\in\mathbb{C}\backslash\mathbb{R}$ : If $c$ is a complex root of $p$ , then $p$ is divided by the polynomial $s=(\mathrm{x}-c)(\mathrm{x}-\overline{c})\in\mathbb{R}[\mathrm{x}]$ . Since $s$ has only complex roots, we conclude that $s$ is nowhere changing its sign. It is easy to see that $\lim_{x\rightarrow\infty}s(x)=\infty$ . Thus $s$ is non-negative. By defining $h=\frac{p}{s}$ and repeating the procedure in case I, we get the desired result.

Case III: The polynomial $p$ has four distinct real roots $x_{1}<x_{2}<x_{3}<x_{4}$ : Here we have two possibilities how $q$ may actually look like. If there exists a point $c\in[x_{1},x_{2}]$ such that $p(c)<0$ , then $q=\alpha(\mathrm{x}-\tilde{x}_{1})(\mathrm{x}-\tilde{x}_{4})$ for $\alpha>0$ and $\tilde{x}_{1}\leq x_{1}$ , $\tilde{x}_{4}\geq x_{4}$ . If there exists a point $c\in[x_{1},x_{2}]$ such that $p(c)>0$ , then $q=\alpha(\mathrm{x}-\tilde{x}_{1})(\mathrm{x}-\tilde{x}_{4})$ for $\alpha<0$ and $\tilde{x}_{1}\geq x_{1}$ , $\tilde{x}_{4}\leq x_{4}$ .

Consider the first possibility. Define $h=\alpha(\mathrm{x}-x_{1})(\mathrm{x}-x_{4})$ and $s_{1}=\frac{p^{\prime}(x_{1})}{h^{\prime}(x_{1})}$ , $s_{4}=\frac{p^{\prime}(x_{4})}{h^{\prime}(x_{4})}$ . Note that $h^{\prime}$ does not vanish at $x_{1}$ resp. $x_{4}$ . Thus $s_{1}$ and $s_{4}$ are well defined positive real numbers. Let $v\in\mathbb{R}[\mathrm{x}]$ be a positive polynomial of degree 2 such that $v(x_{1})=s_{1}$ and $v(x_{4})=s_{4}$ . For example, consider the polynomial $v=a(\mathrm{x}-x_{1})^{2}+s_{1}$ with $a=\frac{s_{4}-s_{1}}{(x_{4}-x_{1})^{2}}$ if $s_{1}\leq s_{4}$ resp. the polynomial $v=a(\mathrm{x}-x_{4})^{2}+s_{4}$ with $a=\frac{s_{1}-s_{4}}{(x_{1}-x_{4})^{2}}$ if $s_{1}>s_{4}$ . The polynomial $w=p-vh$ has two double roots, namely one double root at $x_{1}$ and the other at $x_{4}$ . This proves that $w$ is either non-negative or non-positive. Since $w(x_{3})>0$ , the polynomial $w$ is indeed non-negative. It is easy to see that the inclusion $S(q)\subseteq S(h)$ holds. According to the S-lemma there exists a real non-negative number $t^{\prime}$ such that $h(x)-t^{\prime}q(x)\geq 0$ holds for all $x\in\mathbb{R}$ . This implies $-h(x)\leq-t^{\prime}q(x)$ for all $x\in\mathbb{R}$ and therefore $w(x)\leq p(x)-t^{\prime}v(x)q(x)$ . Since $w$ is non-negative, we are done. The second possibility is considered nearly analogous: Without loss of generality we can assume that $x_{1}\leq\tilde{x}_{1}$ , $x_{2}\geq\tilde{x}_{4}$ and $h=\alpha(\mathrm{x}-x_{1})(\mathrm{x}-x_{2})$ . As before we define $s_{1}=\frac{p^{\prime}(x_{1})}{h^{\prime}(x_{1})}$ resp. $s_{2}=\frac{p^{\prime}(x_{2})}{h^{\prime}(x_{2})}$ . Let $v$ be a positive polynomial of degree $2$ such that $v(x_{1})=s_{1}$ and $v(x_{2})=s_{2}$ are satisfied. The polynomial $w=p-vh$ has two double roots at $x_{1}$ and $x_{2}$ . Since $w(x_{4})>0$ , we see that $w$ is non-negative. As before we can deduce from $S(q)\subseteq S(h)$ that there exists a positive real number $t^{\prime}$ with $h(x)-t^{\prime}q(x)\geq 0$ for all $x\in\mathbb{R}$ , implying that $w(x)\leq p(x)-t^{\prime}v(x)q(x)$ for all $x\in\mathbb{R}$ . Set $t=t^{\prime}v$ and we are done. It is clear that there are no cases left that can occur. Thus the lemma is proven. $\boxempty$

Lemma 3.4.

Let $q\in\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ be a quadratic form such that there exists a point $x^{\prime}\in\mathbb{R}^{n}$ with $q(x^{\prime})>0$ . For every point $x\in S(q)$ and for every $\varepsilon>0$ the intersection between $B_{\varepsilon}(x)$ and $\mathrm{int}(S(q))$ is non-empty.

Proof: By using an appropriate change of coordinates we can rewrite $q$ as $q=a_{1}\mathrm{x}_{1}^{2}+\cdots+a_{n}\mathrm{x}_{n}^{2}$ , where $a_{1},\ldots,a_{n}\in\mathbb{R}$ . Define $I=\left\{i\in\{1,\ldots,n\}:a_{i}>0\right\}$ . Note that $I\neq\varnothing$ because of $q(x^{\prime})>0$ . Take a point $x\in\mathbb{R}^{n}$ with $q(x)\geq 0$ . Choose an index $j\in I$ , a real positive number $\varepsilon>0$ and consider $q(x_{1},\ldots,x_{j}+\varepsilon,\ldots,x_{n})$ if $x_{j}\geq 0$ resp. $q(x_{1},\ldots,x_{j}-\varepsilon,\ldots,x_{n})$ if $x_{j}<0$ . It is clear that $q(x_{1},\ldots,x_{j}+\varepsilon,\ldots,x_{n})$ resp. $q(x_{1},\ldots,x_{j}-\varepsilon,\ldots,x_{n})$ is positive. This means that the point $y=\begin{cases}(x_{1},\ldots,x_{j}+\varepsilon,\ldots,x_{n}),\,\text{if}\,x_{j}\geq 0\\ (x_{1},\ldots,x_{j}-\varepsilon,\ldots,x_{n}),\,\text{if}\,x_{j}<0\\ \end{cases}$ is lying in both, the interior of $S(q)$ and the ball $B_{\varepsilon}(x)$ . In other words, the assertion is proven. $\boxempty$

Lemma 3.5.

Let $p\in\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ be a polynomial of even degree $m\in\mathbb{N}_{0}$ and $q\in\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ a polynomial of degree $2$ . Suppose further that there is a point $x^{\prime}\in\mathbb{R}^{n}$ such that $q(x^{\prime})>0$ . Let $\overline{p},\overline{q}\in\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n+1}]$ denote their homogenizations. If $S(q)\subseteq S(p)$ then $S(\overline{q})\subseteq S(\overline{p})$ .

Proof: We have to show that $S(\overline{q}(\mathrm{x},0))\subseteq S(\overline{p}(\mathrm{x},0))$ holds. Without loss of generality we can assume that $S(\overline{q}(\mathrm{x},0))\neq\varnothing$ . Note that $q(x^{\prime})>0$ implies $\overline{q}(x^{\prime},1)>0$ . Thus we can use Lemma 3.4. Let $c$ be an arbitrary point in $S(\overline{q}(\mathrm{x},0))$ . The task is to verify $c\in S(\overline{p}(\mathrm{x},0))$ . Lemma 3.4 tells us that $B_{\varepsilon}(c)\cap\mathrm{int}(S(\overline{q}))\neq\varnothing$ for $\varepsilon>0$ . For every $\varepsilon>0$ we can find a point $y_{\varepsilon}\in B_{\varepsilon}(c)\cap\mathrm{int}(S(\overline{q}))$ such that the $n+1$ -th component of $y_{\varepsilon}$ does not vanish. We have $\overline{q}(y_{\varepsilon,1},\ldots,y_{\varepsilon,n+1})=\frac{1}{y_{\varepsilon,n+1}^{2}}\overline{q}\left(\frac{y_{\varepsilon,1}}{y_{\varepsilon,n+1}},\ldots,\frac{y_{\varepsilon,n}}{y_{\varepsilon,n+1}},1\right)$ , where $\overline{q}\left(\frac{y_{\varepsilon,1}}{y_{\varepsilon,n+1}},\ldots,\frac{y_{\varepsilon,n}}{y_{\varepsilon,n+1}},1\right)>0$ . Therefore $\overline{p}\left(\frac{y_{\varepsilon,1}}{y_{\varepsilon,n+1}},\ldots,\frac{y_{\varepsilon,n}}{y_{\varepsilon,n+1}},1\right)\geq 0$ . Since $m$ is even, we get $\overline{p}(y_{\varepsilon,1},\ldots,y_{\varepsilon,n+1})=\frac{1}{y_{\varepsilon,n+1}^{m}}\overline{p}\left(\frac{y_{\varepsilon,1}}{y_{\varepsilon,n+1}},\ldots,\frac{y_{\varepsilon,n}}{y_{\varepsilon,n+1}},1\right)\geq 0$ . This implies that $\mathrm{dist}(c,S(\overline{p}))<\varepsilon$ . By making $\varepsilon>0$ arbitrary small and using the fact that $S(\overline{p})$ is a closed set, we conclude that $c\in S(\overline{p})$ . $\boxempty$

Proof of Theorem 3.1 aka. S4-conjecture in 2 variables: Without loss of generality we can restrict ourselves to the case where $f$ and $g$ are both not non-negative.

(b) $\Rightarrow$ (a): Trivial.

(a) $\Rightarrow$ (b): Since we are talking about quadratic forms, and $g$ is a quadratic form, it helps to take a look at its diagonal-form. Let us take a matrix $A\in\mathrm{O}_{2}$ and consider the induced map $\psi:\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]\rightarrow\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}],p\mapsto p\circ A$ . We can choose $A$ in such a way that $\psi(g)$ is in diagonal form. Applying $\psi$ on $f$ and $g$ does not mess up the prerequisites of Theorem 3.1. Thus we can assume that $g$ is already in diagonal from. Since $g$ is neither non-negative nor non-positive, two real numbers $a_{11},a_{22}\neq 0$ with $\mathrm{sgn}(a_{11})\neq\mathrm{sgn}(a_{22})$ can be found such that $g=a_{11}\mathrm{x}_{1}^{2}+a_{22}\mathrm{x}_{2}^{2}$ . Furthermore, we can demand that $a_{11}>0$ and $a_{22}<0$ because otherwise we could apply the coordinate transformation $\mathbb{R}^{2}\rightarrow\mathbb{R}^{2},(x_{1},x_{2})\mapsto(x_{2},x_{1})$ . The proof is devided into two cases:

Case I: $\deg_{\mathrm{x}_{1}}(f)$ and $\deg_{\mathrm{x}_{2}}(f)<4$ : First of all, we show that the monomial $\mathrm{x}_{1}^{3}\mathrm{x}_{2}$ cannot appear in $f$ , while the monomials $\mathrm{x}_{1}^{2}\mathrm{x}_{2}^{2}$ and $\mathrm{x}_{1}\mathrm{x}_{2}^{3}$ must appear in $f$ . Suppose the monomial $\mathrm{x}_{1}^{3}\mathrm{x}_{2}$ would appear in $f$ . Consider the polynomial $f(\mathrm{x}_{1},1)\in\mathbb{R}[\mathrm{x}_{1}]$ . Since $S(g)\subseteq S(f)$ , we get the conditions $\lim_{x_{1}\rightarrow\infty}f(x_{1},1)=\infty$ and $\lim_{x_{1}\rightarrow-\infty}f(x_{1},1)=\infty$ . But the leading term of $f(\mathrm{x}_{1},1)$ is of the form $\alpha\mathrm{x}_{1}^{3}$ for $\alpha\neq 0$ .

$S(g(\mathrm{x}_{1},1))$$S(g(1,\mathrm{x}_{2}))$$S(g)$

Thus $f_{1}(\mathrm{x}_{1},1)$ cannot fulfill the two conditions and therefore we get a contradiction. On the other side, we cannot exclude the monomial $\mathrm{x}_{1}\mathrm{x}_{2}^{3}$ . In case of the monomial $\mathrm{x}_{2}\mathrm{x}_{1}^{3}$ we exploited that the set $S(g(\mathrm{x}_{1},1))$ is symmetric and unbounded. This is not the case when we consider $S(g(1,\mathrm{x}_{2}))$ . Instead we can state that since $f$ is non-negative on the set $S(g)$ , the polynomial $f$ cannot consist of just one monomial $\mathrm{x}_{1}\mathrm{x}_{2}^{3}$ or $\mathrm{x}_{1}^{2}\mathrm{x}_{2}^{2}$ alone. So, we can rewrite $f$ as $f=\gamma\mathrm{x}_{1}^{2}\mathrm{x}_{2}^{2}+\beta\mathrm{x}_{1}\mathrm{x}_{2}^{3}$ with $\gamma>0$ and $\beta\in\mathbb{R}\backslash\{0\}$ . Define $s=-\frac{1}{2}ba_{22}\mathrm{x}_{2}^{4}+\frac{1}{2}ba_{11}\mathrm{x}_{1}^{2}\mathrm{x}_{2}^{2}+\beta\mathrm{x}_{1}\mathrm{x}_{2}^{3}$ and $t=\frac{1}{2}b\mathrm{x}_{2}^{2}$ where $a_{11}b=\gamma>0$ . A simple computation shows $f=tg+s$ . We are done, if we can show that $s$ is a non-negative polynomial. Since $s$ is divided by $\mathrm{x}_{2}^{2}$ , it is sufficient to prove that $s^{\prime}=\frac{s}{\mathrm{x}_{2}^{2}}=-\frac{1}{2}ba_{22}\mathrm{x}_{2}^{2}+\beta\mathrm{x}_{1}\mathrm{x}_{2}+\frac{1}{2}ba_{11}\mathrm{x}_{1}^{2}$ is non-negative. The discriminant of $s^{\prime}$ is given by $\mathrm{disc}(s^{\prime})=\left(\beta^{2}+a_{22}a_{11}b^{2}\right)\mathrm{x}_{1}^{2}$ . Then we have the equivalence $\mathrm{disc}(s^{\prime})\leq 0$ for all $x_{1}\in\mathbb{R}$ $\Leftrightarrow$ $\beta^{2}+a_{22}a_{11}b^{2}\leq 0$ . It is sufficient to show that $\beta^{2}+a_{22}a_{11}b^{2}\leq 0$ . To prove this, take a point $y\in\partial S(g)$ such that $y_{1},y_{2}>0$ . Furthermore we assume that $\beta<0$ . Now $y\in\partial S(g)$ implies $y_{1}=y_{2}\sqrt{\left|\frac{a_{22}}{a_{11}}\right|}$ and thus $f(y)=\left(\gamma\left|\frac{a_{22}}{a_{11}}\right|+\beta\sqrt{\left|\frac{a_{22}}{a_{11}}\right|}\right)y_{2}^{4}\geq 0$ . This is only possible if $\gamma\left|\frac{a_{22}}{a_{11}}\right|+\beta\sqrt{\left|\frac{a_{22}}{a_{11}}\right|}\geq 0$ . The inequality $\gamma\left|\frac{a_{22}}{a_{11}}\right|+\beta\sqrt{\left|\frac{a_{22}}{a_{11}}\right|}\geq 0$ is equivalent to $\gamma\sqrt{\left|\frac{a_{22}}{a_{11}}\right|}\geq|\beta|$ . By substituting $\gamma$ through $a_{11}b$ , we get $\sqrt{|a_{11}a_{22}|}b\geq|\beta|$ and finally $\beta^{2}+a_{11}a_{22}b^{2}\leq 0$ , because $a_{11}a_{22}<0$ .

If $\beta>0$ then we simply consider another point $y\in\partial S(q)$ with $y_{1}>0$ , $y_{2}<0$ , and repeat the arguments above.

Case II: $\deg_{\mathrm{x}_{1}}(f)=4$ or $\deg_{\mathrm{x}_{2}}(f)=4$ : We are only considering the case $\deg_{\mathrm{x}_{1}}(f)=4$ . Let $\tilde{f}\in\mathbb{R}[\mathrm{x}_{1}]$ be the dehomogenization of the polynomial $f$ in the variable $\mathrm{x}_{2}$ . In the same manner, let $\tilde{g}\in\mathbb{R}[\mathrm{x}_{1}]$ be the dehomogenization of $g$ . It is easy to see that $\tilde{f}$ and $\tilde{g}$ fulfill the prerequisites of Lemma 3.3. Thus there exists a non-negative polynomial $\tilde{t}\in\mathbb{R}[\mathrm{x}_{1}]$ with $\deg(\tilde{t})\leq 2$ such that $\tilde{f}(x_{1})-\tilde{t}(x_{1})\tilde{g}(x_{1})\geq 0$ for all $x_{1}\in\mathbb{R}$ . If $\tilde{f}(x_{1})-\tilde{t}(x_{1})\tilde{g}(x_{1})=0$ holds for all $x_{1}\in\mathbb{R}$ , then the assertion follows immediately. Otherwise, Lemma 3.5 tells us that $S(\tilde{f}-\tilde{t}\tilde{g})=\mathbb{R}$ and $S(\tilde{t})=\mathbb{R}$ imply $S\left(\overline{\tilde{f}-\tilde{t}\tilde{g}}\right)=\mathbb{R}^{2}$ and $S(t)=\mathbb{R}^{2}$ , where $t=\begin{cases}\overline{\tilde{t}}\,\text{if}\,\deg(\tilde{t})=2\\ \mathrm{x}_{2}^{2}\tilde{t}\,\text{if}\,\deg(\tilde{t})=0\end{cases}$ . If $\deg(\tilde{t})=2$ then we get that $f-tg=\mathrm{x}_{2}^{4}\left(\overline{\tilde{f}-\tilde{t}\tilde{g}}\right)$ , $f-tg=\mathrm{x}_{2}^{2}\left(\overline{\tilde{f}-\tilde{t}\tilde{g}}\right)$ or $f-tg=\overline{\tilde{f}-\tilde{t}\tilde{g}}$ . This follows directly from the fact that $\overline{\tilde{f}-\tilde{t}\tilde{g}}$ is of even degree and that $\deg(f)=\deg(tg)$ . If $\deg(\tilde{t})=0$ then it is easy to see that $f-tg=\overline{\tilde{f}-\tilde{t}\tilde{g}}$ . Thus $f-tg$ is non-negative. $\boxempty$

4 The S4-conjecture: A counterexample

It turns out that the S4-conjecture stated in 2.1 is wrong. First of all, we are going to give a ’lucky’ counterexample and afterwards, that means in the next chapter, we will investigate why the S4-conjecture cannot work.

Example 4.1.

A counterexample for 2.1: Consider the polynomials $f=\mathrm{x}_{1}^{3}\mathrm{x}_{3}+\mathrm{x}_{1}^{3}\mathrm{x}_{2}+\mathrm{x}_{2}^{2}\mathrm{x}_{3}^{2}$ and $g=\mathrm{x}_{1}\mathrm{x}_{3}+\mathrm{x}_{2}\mathrm{x}_{3}+\mathrm{x}_{1}\mathrm{x}_{2}$ . One can show that the inclusion $S(g)\subseteq S(f)$ holds: For example, type in

Reduce[ForAll[{x1,x2,x3},Implies[x1x3+x2x3+x1x2>=0,x1^3x3+x1^3x2+x2^2x3^2>=0]]]

in [Mathematica].

But there is no non-negative homogeneous polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]$ of degree at most $2$ such that $f(y)-t(y)g(y)\geq 0$ for all $y\in\mathbb{R}^{3}$ : Suppose we could find such a polynomial $t=a_{1}\mathrm{x}_{1}^{2}+a_{2}\mathrm{x}_{2}^{2}+a_{3}\mathrm{x}_{1}\mathrm{x}_{2}+a_{4}\mathrm{x}_{1}\mathrm{x}_{3}+a_{5}\mathrm{x}_{2}\mathrm{x}_{3}+a_{6}\mathrm{x}_{3}^{2}$ , where $a_{1},\ldots,a_{6}\in\mathbb{R}$ . A simple computation shows that $f-tg=\mathrm{x}_{1}^{3}\mathrm{x}_{2}-a_{1}\mathrm{x}_{1}^{3}\mathrm{x}_{2}-a_{3}\mathrm{x}_{1}^{2}\mathrm{x}_{2}^{2}-a_{2}\mathrm{x}_{1}\mathrm{x}_{2}^{3}+\mathrm{x}_{1}^{3}\mathrm{x}_{3}-a_{1}\mathrm{x}_{1}^{3}\mathrm{x}_{3}-a_{1}\mathrm{x}_{1}^{2}\mathrm{x}_{2}\mathrm{x}_{3}-a_{3}\mathrm{x}_{1}^{2}\mathrm{x}_{2}\mathrm{x}_{3}-a_{4}\mathrm{x}_{1}^{2}\mathrm{x}_{2}\mathrm{x}_{3}-a_{2}\mathrm{x}_{1}\mathrm{x}_{2}^{2}\mathrm{x}_{3}-a_{3}\mathrm{x}_{1}\mathrm{x}_{2}^{2}\mathrm{x}_{3}-a_{5}\mathrm{x}_{1}\mathrm{x}_{2}^{2}\mathrm{x}_{3}-a_{2}\mathrm{x}_{2}^{3}\mathrm{x}_{3}-a_{4}\mathrm{x}_{1}^{2}\mathrm{x}_{3}^{2}-a_{4}\mathrm{x}_{1}\mathrm{x}_{2}\mathrm{x}_{3}^{2}-a_{5}\mathrm{x}_{1}\mathrm{x}_{2}\mathrm{x}_{3}^{2}-a_{6}\mathrm{x}_{1}\mathrm{x}_{2}\mathrm{x}_{3}^{2}+\mathrm{x}_{2}^{2}\mathrm{x}_{3}^{2}-a_{5}\mathrm{x}_{2}^{2}\mathrm{x}_{3}^{2}-a_{6}\mathrm{x}_{1}\mathrm{x}_{3}^{3}-a_{6}\mathrm{x}_{2}\mathrm{x}_{3}^{3}$ .

Thus $f(\mathrm{x}_{1},0,1)-t(\mathrm{x}_{1},0,1)g(\mathrm{x}_{1},0,1)=\mathrm{x}_{1}^{3}-a_{1}\mathrm{x}_{1}^{3}-a_{4}\mathrm{x}_{1}^{2}-a_{6}\mathrm{x}_{1}$ . We know that $f-tg$ is a non-negative polynomial. This is only possible if $a_{1}=1$ , $a_{6}=0$ , and $a_{4}\leq 0$ . Since $t$ is non-negative and $a_{6}=0$ , the two coefficients $a_{4}$ , $a_{5}$ must also vanish. The leading term of $f(0,\mathrm{x}_{2},1)-t(0,\mathrm{x}_{2},1)g(0,\mathrm{x}_{2},1)$ is $-a_{2}\mathrm{x}_{2}^{3}$ , and therefore $a_{2}=0$ . Now only $a_{3}$ is not determined. But it is easy to see that $a_{3}$ must also vanish. Thus $f-tg$ reduces to $f-tg=-\mathrm{x}_{1}^{2}\mathrm{x}_{2}\mathrm{x}_{3}+\mathrm{x}_{2}^{2}\mathrm{x}_{3}^{2}$ , which is obviously not non-negative.

Remark 4.2.

Under an appropriate linear change of coordinates, we can rewrite $g$ as $g=\mathrm{x}_{1}^{2}-\frac{1}{2}\mathrm{x}_{2}^{2}-\frac{1}{2}\mathrm{x}_{3}^{2}$ . Thus under this new coordinates $S(g)$ is a double cone and every slice with a plane, parallel to the $x_{2},x_{3}$ -plane, is compact. Fix $c>0$ and consider $S:=S\left(c^{2}-\frac{1}{2}\mathrm{x}_{2}^{2}-\frac{1}{2}\mathrm{x}_{3}^{2}\right)$ , which is a compact subset of $\mathbb{R}^{2}$ . A simple computation shows that $f(c,x_{2},x_{3})>0$ for all $x_{2},x_{3}\in S$ . Using the Positivstellensatz of Schmüdgen, we see that $f(c,\mathrm{x}_{2},\mathrm{x}_{3})\in T(g(c,\mathrm{x}_{2},\mathrm{x}_{3}))$ , where $T(g(c,\mathrm{x}_{2},\mathrm{x}_{3}))$ denotes the preordering generated by $g(c,\mathrm{x}_{2},\mathrm{x}_{3})$ . This, however, is not true for $f$ and $g$ , i.e $f\notin T(g)$ . See Lemma 5.1 and Remark 5.2.

5 Geometric analysis

In this section we are going to take a closer look at the counterexample. In particular we are interested in the geometric properties of $V_{1}=\mathcal{V}(f)$ and $V_{2}=\mathcal{V}(g)$ and what they have to do with the counterexample. In this section we will fix $f=\mathrm{x}_{1}^{3}\mathrm{x}_{3}+\mathrm{x}_{1}^{3}\mathrm{x}_{2}+\mathrm{x}_{2}^{2}\mathrm{x}_{3}^{2}$ and $g=\mathrm{x}_{1}\mathrm{x}_{3}+\mathrm{x}_{2}\mathrm{x}_{3}+\mathrm{x}_{1}\mathrm{x}_{2}$ .

Lemma 5.1.

Let $f$ and $g$ be as in Example 6. Then there is no non-negative homogeneous polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]$ of even degree $n$ such that $f(y)-t(y)g(y)\geq 0$ for all $y\in\mathbb{R}^{3}$ .

Proof: Without loss of generality we can assume that $n>2$ . Let $t$ be a non-negative homogeneous polynomial in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]$ of even degree $n>2$ . We are going to show that $f(\mathrm{x}_{1},\mathrm{x}_{2},1)-t(\mathrm{x}_{1},\mathrm{x}_{2},1)g(\mathrm{x}_{1},\mathrm{x}_{2},1)$ is not a non-negative polynomial in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ , which is a stronger statement than that in the lemma. Because $f(\mathrm{x}_{1},\mathrm{x}_{2},0)=\mathrm{x}_{1}^{3}\mathrm{x}_{2}$ is not non-negative, we can assume that $t(\mathrm{x}_{1},\mathrm{x}_{2},0)\neq 0$ . Thus we have $\deg(t(\mathrm{x}_{1},\mathrm{x}_{2},0))=n$ .

Write $t(\mathrm{x}_{1},\mathrm{x}_{2},1)=\sum_{\alpha\in\mathbb{N}^{2},|\alpha|\leq n}c_{\alpha}\mathrm{x}^{\alpha}$ and define $I=\left\{\alpha\in\mathbb{N}^{2}:|\alpha|=n,c_{\alpha}\neq 0\right\}$ . Note that $I$ is not empty, since $\deg(t(\mathrm{x}_{1},\mathrm{x}_{2},0))=n$ . Without loss of generality we can assume that there is an element $\alpha\in I$ such that $\alpha_{2}>0$ . Otherwise, we could simply interchange the variables $\mathrm{x}_{1}$ and $\mathrm{x}_{2}$ . Let $\alpha^{\prime}$ be the uniquely determined element of $I$ that satisfies $\alpha_{2}^{\prime}>\alpha_{2}$ for all $\alpha\in I\backslash\{\alpha^{\prime}\}$ . There exists a real number $\beta>0$ such that $c_{\alpha^{\prime}}\beta^{\alpha_{2}^{\prime}}+\sum_{\alpha\in I\backslash\{\alpha^{\prime}\}}c_{\alpha}\beta^{\alpha_{2}}\neq 0$ : Indeed, $\sum_{\alpha\in I}c_{\alpha}\upbeta^{\alpha_{2}}$ is a polynomial in $\mathbb{R}[\upbeta]$ of degree $\alpha_{2}^{\prime}>0$ and therefore does not vanish. In fact, $\sum_{\alpha\in I}c_{\alpha}\beta^{\alpha_{2}}$ is the leading coefficient and $\sum_{\alpha\in I}c_{\alpha}\beta^{\alpha_{2}}\mathrm{x}_{1}^{\alpha_{1}+\alpha_{2}}=\sum_{\alpha\in I}c_{\alpha}\beta^{\alpha_{2}}\mathrm{x}_{1}^{n}$ the leading term of the polynomial $t(\mathrm{x}_{1},\beta\mathrm{x}_{1},1)$ . So, the real positive number $\beta$ is needed to make sure that this leading term does not vanish. Because $t(\mathrm{x}_{1},\beta\mathrm{x}_{2},1)$ is non-negative, the coefficient $\sum_{\alpha\in I}c_{\alpha}\beta^{\alpha_{2}}$ is positive. The leading term of $tg$ is $\left(\sum_{\alpha\in I}c_{\alpha}\beta^{\alpha_{2}+1}\right)\mathrm{x}_{1}^{n+2}$ with a positive coefficient $\sum_{\alpha\in I}c_{\alpha}\beta^{\alpha_{2}+1}$ . This implies that the leading term of $f-tg$ is $-\left(\sum_{\alpha\in I}c_{\alpha}\beta^{\alpha_{2}+1}\right)\mathrm{x}_{1}^{n+2}$ . Therefore we get

[TABLE]

which proves the lemma. $\boxempty$

Remark 5.2.

Lemma 5.1 combined with the Counterexample 4.1 proves that $f\notin T(g)$ . Suppose we could find sums of squares $\sigma_{1}$ and $\sigma_{2}$ in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]$ such that $f=\sigma_{1}+\sigma_{2}g$ . Then we have $\tilde{f}=\tilde{\sigma}_{1}+\tilde{\sigma}_{2}\tilde{g}$ , where we dehomogenize with respect to $\mathrm{x}_{3}$ . Without loss of generality we can assume that $\deg(\tilde{\sigma}_{2})\geq 2$ . We distinguish between two cases:

•

We have $\deg\left(\tilde{\sigma}_{2}\tilde{g}\right)=\deg(\tilde{f})=4$ : Under this condition, we have $f-\overline{\tilde{\sigma}_{2}}g=\mathrm{x}_{3}^{n}\left(\overline{\tilde{f}-\tilde{\sigma}_{2}\tilde{g}}\right)$ , where $n\leq 4$ is a even number. By using Lemma 3.5 we see that $\overline{\tilde{f}-\tilde{\sigma}_{2}\tilde{g}}$ is non-negative. Thus $f-\overline{\tilde{\sigma}_{2}}g$ is non-negative. Since $\overline{\tilde{\sigma}_{2}}$ is a non-negative polynomial of degree $2$ , we get a contradiction.

•

We have $\deg\left(\tilde{\sigma}_{2}\tilde{g}\right)>\deg(\tilde{f})=4$ : In this case we proceed as in Lemma 5.1: Choose a real positive number $\beta$ such that the leading monomial of $f(\mathrm{x}_{1},\beta\mathrm{x}_{1},1)-\sigma_{2}(\mathrm{x}_{1},\beta\mathrm{x}_{1},1)g(\mathrm{x}_{1},\beta\mathrm{x}_{1},1)$ is $-L\beta\mathrm{x}_{1}^{2}$ , where $L$ denotes the leading monomial of $\sigma_{2}(\mathrm{x}_{1},\beta\mathrm{x}_{1},1)$ . Since $\sigma_{2}(\mathrm{x}_{1},\beta\mathrm{x}_{1},1)$ is a sum of squares in $\mathbb{R}[\mathrm{x}_{1}]$ , the polynomial $L$ is a sum of squares and therefore $L\beta\mathrm{x}_{1}^{2}$ is non-negative. Thus

[TABLE]

which contradicts our assumption.

Lemma 5.3.

The two $\mathbb{R}$ -varieties $\mathcal{V}(g)$ and $\mathcal{V}(f)$ are both geometrically irreducible. Furthermore, the set $H=\left\{(0,x_{2},0):x_{2}\in\mathbb{R}\right\}$ is the set of all $\mathbb{R}$ -rational singularities of $\mathcal{V}(g)$ and $\mathcal{V}(f)$ .

Proof: First, $\mathcal{V}(f)$ resp. $\mathcal{V}(g)$ is irreducible if and only if $\mathcal{V}(\tilde{f})$ resp. $\mathcal{V}(\tilde{g})$ is irreducible, where $\tilde{f}=\mathrm{x}_{1}^{3}+\mathrm{x}_{1}^{3}\mathrm{x}_{2}+\mathrm{x}_{2}^{2}$ and $\tilde{g}=\mathrm{x}_{1}+\mathrm{x}_{2}+\mathrm{x}_{1}\mathrm{x}_{2}$ . This statement is a well known fact. Consider the polynomial $\tilde{g}$ as an element of the ring $\mathbb{C}[\mathrm{x}_{1}][\mathrm{x}_{2}]$ . Furthermore, $\tilde{g}$ is a primitive polynomial. According to [Bo, Satz 2, p. 68] we know that $\tilde{g}$ is irreducible in $\mathbb{C}[\mathrm{x}_{1}][\mathrm{x}_{2}]$ if the image of $\tilde{g}$ in $\left(\mathbb{C}[\mathrm{x}_{1}]/\mathrm{x}_{1}\mathbb{C}[\mathrm{x}_{1}]\right)[\mathrm{x}_{2}]$ is irreducible, which is easy to verify. Hence $\mathcal{V}(\tilde{g})$ is an irreducible $\mathbb{C}$ -variety. Let us consider $\tilde{f}$ as an element of the polynomial ring $\mathbb{C}[\mathrm{x}_{1}][\mathrm{x}_{2}]$ . The polynomial $\tilde{f}$ cannot be divided by any irreducible polynomial in $\mathbb{C}[\mathrm{x}_{1}]$ : Indeed, $\mathbb{C}[\mathrm{x}_{1}][\mathrm{x}_{2}]/\mathrm{x}_{1}\mathbb{C}[\mathrm{x}_{1}][\mathrm{x}_{2}]\cong\left(\mathbb{C}[\mathrm{x}_{1}]/\mathrm{x}_{1}\mathbb{C}[\mathrm{x}_{1}]\right)[\mathrm{x}_{2}]$ and the image of $\tilde{f}$ in $\left(\mathbb{C}[\mathrm{x}_{1}]/\mathrm{x}_{1}\mathbb{C}[\mathrm{x}_{1}]\right)[\mathrm{x}_{2}]$ is not a unit. Suppose an irreducible factor $h$ of $\tilde{f}$ has the same degree in $\mathrm{x}_{2}$ as $\tilde{f}$ . Then $h$ must coincide with $\tilde{f}$ : If there would exist another irreducible factor $v$ , then $v$ must lie in $\mathbb{C}[\mathrm{x}_{1}]$ . But this would be a contradiction, since $v\nmid\tilde{f}$ . It is impossible that $\tilde{f}$ factors into more than one component in $\mathbb{C}[\mathrm{x}_{1}][\mathrm{x}_{2}]$ : Suppose we could write $\tilde{f}=h_{1}h_{2}$ , where $h_{1},h_{2}\in\mathbb{C}[\mathrm{x}_{1}][\mathrm{x}_{2}]$ are polynomials of degree $1$ in $\mathrm{x}_{2}$ . The polynomials $h_{1}$ and $h_{2}$ can be written as $h_{1}=\mathrm{x}_{2}-v_{1}$ and $h_{2}=\mathrm{x}_{2}-v_{2}$ , where $v_{1},v_{2}\in\mathbb{C}[\mathrm{x}_{1}]$ . Then $\tilde{f}=\mathrm{x}_{2}^{2}-\mathrm{x}_{2}v_{2}-\mathrm{x}_{2}v_{1}+v_{1}v_{2}$ . Therefore $v_{1}v_{2}=\mathrm{x}_{1}^{3}$ and $-\mathrm{x}_{2}(v_{1}+v_{2})=\mathrm{x}_{2}\mathrm{x}_{1}^{3}$ , which is utterly impossible. Hence $\mathcal{V}(\tilde{f})$ is an irreducible $\mathbb{C}$ -variety. It remains to verify the statement about the singularities. Consider a point $x\in\mathcal{V}(\tilde{f})$ such that $\nabla\tilde{f}(x)=0$ . Then we have $\nabla f(x^{\prime})=0$ for $x^{\prime}=(x,1)$ . On the other hand, suppose there is a point $x^{\prime}\in V_{1}$ with $x_{3}^{\prime}\neq 0$ such that $\nabla f(x^{\prime})=0$ . Then the point $x=\left(\frac{x_{1}^{\prime}}{x_{3}^{\prime}},\frac{x_{2}^{\prime}}{x_{3}^{\prime}}\right)$ will satisfy $\nabla\tilde{f}(x)=0$ . This shows that the singular points $x^{\prime}\in V_{1}$ with $x_{3}^{\prime}\neq 0$ ’come from’ the singular points of $\mathcal{V}(\tilde{f})$ . Thus it suffices to show that $\mathcal{V}(\tilde{f})$ has only one $\mathbb{R}$ -rational singularity at the origin and that all other $\mathbb{R}$ -rational singularities of $V_{1}$ are in $H$ .

The equation $\nabla\tilde{f}(x)=\left(\begin{array}[]{c}3x_{1}^{2}+3x_{1}^{2}x_{2}\\ x_{1}^{3}+2x_{2}\end{array}\right)=0$ has only one real solution $x=(0,0)$ in $\mathcal{V}(\tilde{f})$ , proving the first assertion of the last statement.

Finally, consider the equation $\nabla f(x_{1},x_{2},0)=\left(\begin{array}[]{c}3x_{1}^{2}x_{2}\\ x_{1}^{3}\\ x_{1}^{3}\end{array}\right)=0$ , where the set of all solutions in $\mathbb{R}^{3}$ is exactly $H$ . Note that $H$ is a subset of $V(f)(\mathbb{R})$ and $V(g)(\mathbb{R})$ . The same argumentation applied to $g$ gives the same result. Thus the lemma is proven. $\boxempty$

Remark 5.4.

A standard theorem in algebraic geometry states that a irreducible variety $V$ over $\mathbb{C}$ is connected with respect to the norm topology. For a proof see [Sha II, Theorem 1, p. 126]. In 5.3 we proved that $V_{2}=V(g)$ is irreducible. Thus $V_{2}\subseteq\mathbb{C}^{2}$ is a connected set. But it is easy to see that $V_{2}(\mathbb{R})$ is not connected. This implies that [Sha II, Theorem 1, p. 126] is not true if we just consider the $\mathbb{R}$ -rational points.

Lemma 5.5.

Let $(q_{n})_{n}$ be a convergent sequence of quadratic forms in $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ and $(p_{n})_{n}$ a convergent sequence of forms in $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ of degree $d$ such that $S(q_{n})\subseteq S(p_{n})$ for every $n\in\mathbb{N}$ . If $q$ and $p$ are the limits of the sequences $(q_{n})_{n}$ and $(p_{n})_{n}$ , then $S(q)$ is a subset of $S(p)$ if $q\neq 0$ .

Proof: First of all we are going to prove the assertion under the assumption that $\mathrm{int}(S(q))\neq\varnothing$ . Under an appropriate change of coordinates we can assume that $q=a_{1}\mathrm{x}_{1}^{2}+\cdots+a_{n}\mathrm{x}_{n}^{2}$ , where $a_{1},\ldots,a_{n}\in\mathbb{R}$ . For every point $y\in S(q)$ there exists a sequence $(y_{n})_{n}\subset\mathrm{int}(S(q))$ such that $\lim_{n\rightarrow\infty}y_{n}=y$ . Since $\mathrm{int}(S(q))$ is not empty, we can handle this statement with Lemma 3.4. Consider a point $x$ lying in the interior of $S(q)$ . Then there exists a number $N\in\mathbb{N}$ such that $q_{n}(x)>0$ for all $n\geq N$ , implying that $\lim_{n\rightarrow\infty}q_{n}(x)\geq 0$ . Since $S(q_{n})$ is a subset of $S(p_{n})$ , we get $\lim_{n\rightarrow\infty}p_{n}(x)\geq 0$ resp. $x\in S(p)$ . If $x$ lies in $\partial S(q)$ , then there exists a sequence $(x_{n})\subset\mathrm{int}(S(q))$ such that $\lim_{n\rightarrow\infty}x_{n}=x$ . But we showed above that this sequence also lies in $S(p)$ . Thus $x$ lies in $S(p)$ , since $S(p)$ is closed.

Finally, the case $\mathrm{int}(S(q))=\varnothing$ must be considered. This is only possible if the coefficients of $q=a_{1}\mathrm{x}_{1}^{2}+\cdots+a_{n}\mathrm{x}_{n}^{2}$ are all non-positive and at least one of them is negative. If all coefficients are negative, then we are done, since any form in $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ is non-negative at the origin. Therefore we can assume that not all coefficients are negative. Thus the set $I^{\prime}=\left\{i\in\{1,\ldots,n\}:a_{i}=0\right\}$ is not empty. Set $H=\prod_{i\notin I^{\prime}}\mathbb{R}\times\prod_{i\in I^{\prime}}\{0\}$ , $H^{\prime}=\prod_{i\in I^{\prime}}\mathbb{R}\times\prod_{i\notin I^{\prime}}\{0\}$ and consider $q^{\prime}=q|_{H}$ , $p^{\prime}=p|_{H}$ . Then we have $S(q^{\prime})=\{0\}$ by assumption and therefore $S(q^{\prime})\subseteq S(p^{\prime})$ . Since $S(q)=S(q^{\prime})\cup H^{\prime}$ , we have to make sure that $p^{\prime\prime}=p|_{H^{\prime}}$ is non-negative. Consider $q_{n}^{\prime\prime}=q_{n}|_{H^{\prime}}$ , $q^{\prime\prime}=q|_{H^{\prime}}$ and $p_{n}^{\prime\prime}=p_{n}|_{H^{\prime}}$ as polynomials in $\mathbb{R}\left[\mathrm{x}_{i}:i\in I^{\prime}\right]$ . By using the facts that $S(q^{\prime\prime})$ has a non-empty interior and $S(q_{n}^{\prime\prime})\subseteq S(p_{n}^{\prime\prime})$ for all $n\in\mathbb{N}$ , we can apply the result made in the first part to deduce that $S(p^{\prime\prime})\supseteq S(q^{\prime\prime})$ . Thus the lemma is proven. $\boxempty$

Proposition 5.6.

Let

Set $S=\left\{(q,p)\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]_{2}\times\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]_{4},q\,\text{quadratic form},\,p\,\text{4-form}\right\}$ and let $S_{4}$ be the set of all $(q,p)\in S$ that satisfy the following condition:

•

There exists a non-negative homogeneous polynomial $t\in\mathbb{R}[x_{1},x_{2},x_{3}]_{2}$ such that $p(y)-t(y)q(y)\geq 0$ for all $y\in\mathbb{R}^{3}$ .

Then the set $S_{4}$ is a closed subset of $S$ .

Proof: Let $P_{4,3}\subset\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]_{4}$ be the set of all non-negative 4-forms and $P_{2,3}\subset\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]_{2}$ the set of all non-negative quadratic forms. It is well known that $P_{4,3}$ and $P_{2,3}$ are closed cones (see Proposition 1.12).

Let $(q_{n},p_{n})_{n}\subset S_{4}$ be a convergent sequence in $S$ . For every $n\in\mathbb{N}$ there is a $t_{n}\in P_{2,3}$ such that $p_{n}-t_{n}q_{n}\in P_{4,3}$ . Or in other words, there exists a sequence $(t_{n})_{n}\subset P_{2,3}$ such that $(p_{n}-t_{n}q_{n})_{n}\subset P_{4,3}$ . Since $P_{2,3}$ and $P_{4,3}$ are closed, we get $t=\lim_{n\rightarrow\infty}t_{n}\in P_{2,3}$ and $\lim_{n\rightarrow\infty}(p_{n}-t_{n}q_{n})=p-tq\in P_{4,3}$ , where $p=\lim_{n\rightarrow\infty}p_{n}\in P_{4,3}$ and $q=\lim_{n\rightarrow\infty}q_{n}\in P_{2,3}$ .

Lemma 5.5 tells us that $S(q)$ is a subset of $S(p)$ if $q\neq 0$ . If $q=0$ then $f-tq\in P_{4,3}$ implies that $f\in P_{4,3}$ , which leads straight to $S(q)=S(f)=\mathbb{R}^{3}$ . Altogether we have that $(q,p)\in S_{4}$ and therefore $S_{4}$ is a closed subset of $S$ . $\boxempty$

Let us consider the following statement:

Conjecture 5.7.

Dehomogenized S4-Conjecture: Let $f$ be a polynomial fo degree $4$ in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ and let $g$ be a polynomial of degree $2$ in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ . Suppose there exists a point $x^{\prime}\in\mathbb{R}^{2}$ such that $g(x^{\prime})>0$ . Then the following statements are equivalent:

(a)

The inclusion $S(g)\subseteq S(f)$ holds. 2. (b)

There exists a non-negative polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]_{2}$ such that $f(x)-t(x)g(x)\geq 0$ for all $x\in\mathbb{R}^{2}$ .

Note that Lemma 3.5 makes sure that if we find a counterexample for 5.7 we have a counterexample for the original S4-conjecture by homogenization:

If $p,q\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]$ satisfy the S4-conjecture, then $\tilde{p}\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ and $\tilde{q}\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ will satisfy the dehomogenized S4-conjecture. Suppose $\tilde{p},\tilde{q}\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ satisfy the dehomogenized S4-conjecture. Then $\tilde{p}-\tilde{t}\tilde{q}$ is non-negative. Lemma 3.5 tells us that $\overline{\tilde{p}-\tilde{t}\tilde{q}}$ and $t:=\begin{cases}\overline{\tilde{t}},\,\text{if}\,\deg(\tilde{t})=2\\ \mathrm{x}_{3}^{2}\tilde{t},\,\text{if}\,\deg(\tilde{t})=0\end{cases}$ are non-negative. Let $p$ and $q$ be the homogenizations of $\tilde{p}$ and $\tilde{q}$ . Then we have $f-tg=\mathrm{x}_{3}^{4}\left(\overline{\tilde{p}-\tilde{t}\tilde{q}}\right)$ , $f-tg=\mathrm{x}_{3}^{2}\left(\overline{\tilde{p}-\tilde{t}\tilde{q}}\right)$ or $f-tg=\overline{\tilde{p}-\tilde{t}\tilde{q}}$ , which implies the non-negativity of $p-tq$ . It is easy to see that the polynomials $\tilde{f}=\mathrm{x}_{1}^{3}+\mathrm{x}_{1}^{3}\mathrm{x}_{2}+\mathrm{x}_{2}^{2}$ and $\tilde{g}=\mathrm{x}_{1}+\mathrm{x}_{2}+\mathrm{x}_{1}\mathrm{x}_{2}$ form a counterexample for 5.7. What is the point with the dehomogenized S4-conjecture?

We want to use Proposition 5.6 to prove that for a small $\varepsilon>0$ the two homogeneous polynomials $f_{\varepsilon}=f+\varepsilon\mathrm{x}_{3}^{4}$ and $g_{\varepsilon}=g+\varepsilon\mathrm{x}_{3}^{2}$ form still a counterexample to the S4-conjecture. First of all, we must make sure that $S(g_{\varepsilon})\subseteq S(f_{\varepsilon})$ . But it is easy to see that $S\left(\tilde{g}_{\varepsilon}\right)\subseteq S\left(\tilde{f}_{\varepsilon}\right)$ holds for small $\varepsilon>0$ . Using Lemma 3.5 we can deduce that $S(g_{\varepsilon})\subseteq S(f_{\varepsilon})$ .

By using Proposition 5.6 and shrinking $\varepsilon>0$ further if necessary, we can achieve $(g_{\varepsilon},f_{\varepsilon})\notin S_{4}$ . Thus we are getting a counterexample for the S4-conjecture.

But if we consider the geometry of $\mathcal{V}(g_{\varepsilon})$ and $\mathcal{V}(f_{\varepsilon})$ , then not much has changed. A simple computation shows that these two varieties have the same $\mathbb{R}$ -rational singularities as $\mathcal{V}(g)$ and $\mathcal{V}(f)$ . But $\mathcal{V}(\tilde{g}_{\varepsilon})$ and $\mathcal{V}(\tilde{f}_{\varepsilon})$ have not just no $\mathbb{R}$ -rational singularities, they are indeed non-singular varieties. While the geometric situation has not change in the homogeneous situation, the situation concerning the dehomogenized S4-conjecture is obviously different. But as we mentioned at the beginning of this investigation, $\tilde{f}$ and $\tilde{g}$ resp. $\tilde{f}_{\varepsilon}$ and $\tilde{g}_{\varepsilon}$ will fail the dehomogenized S4-conjecture. The point with dehomogenized S4-conjecture is, that it reflects the geometric differences between $\tilde{f},\tilde{g}$ and $\tilde{f}_{\varepsilon},\tilde{g}_{\varepsilon}$ in a better way than its homogeneous counterpart. In fact this result tells us, that the reason $f,g$ and $f_{\varepsilon},g_{\varepsilon}$ are failing the S4-conjecture must lie in some other geometric properties.

6 A generalization of the counterexample

Before continuing our investigation, it is worth to prove a generalization of the counterexample . We can extend this result by using some simple results in algebraic geometry. Let us consider the polynomials $f=\mathrm{x}_{1}^{3}+\mathrm{x}_{1}^{3}\mathrm{x}_{2}+\mathrm{x}_{2}^{2}$ and $g=\mathrm{x}_{1}+\mathrm{x}_{2}+\mathrm{x}_{1}\mathrm{x}_{2}$ which are the dehomogenizations of the polynomials in Example 6. Instead of introducing new symbols for the dehomogenization of the polynomials in Example 6, we will refer to them with the same symbols in this section.

Definition 6.1.

(Blow-up of $\mathbb{A}^{2}$ ): Let $Y$ be the variety that is defined by the equation $x_{1}z_{2}=x_{2}z_{1}$ , $(x_{1},x_{2};z_{1}:z_{2})\in\mathbb{A}^{2}\times\mathbb{P}^{1}$ . The restriction $\sigma:Y\rightarrow\mathbb{A}^{2}$ of the projection $\mathbb{A}^{2}\times\mathbb{P}^{1}\rightarrow\mathbb{A}^{2}$ onto $\mathbb{A}^{2}$ is called the blow-up of $\mathbb{A}^{2}$ centered at the origin.

Remark 6.2.

Definition 6.1 might suggest that a blow-up centered at a regular point $x$ of a variety is unique in nature. But this is not the fact. In case of Definition 6.1 it is true. But in case of an arbitrary quasi-projective variety $X$ , where $X$ is not projective, we have just uniqueness up to ’isomorphism’ [Sha I, Lemma, p. 117].

Proposition 6.3.

Let $V$ be a irreducible curve in $\mathbb{A}^{2}$ and $\sigma:Y\rightarrow\mathbb{A}^{2}$ the blow-up of $\mathbb{A}^{2}$ centered at the origin. Consider the curve $V^{\prime}=\overline{\sigma^{-1}\left(V\backslash\{0\}\right)}$ , where the bar denotes the Zariski-closure of $\sigma^{-1}\left(V\backslash\{0\}\right)$ . Then we have the following statements about $V$ and $V^{\prime}$ :

(a)

If $0\notin V$ then there is an isomorphism $V\xrightarrow{\,\smash{\raisebox{-2.15277pt}{$ \scriptstyle\sim $}}\,}V^{\prime}$ . 2. (b)

If $0\in V$ then $\sigma^{-1}(V)$ decomposes into two irreducible components $E=\{0\}\times\mathbb{P}^{1}$ (exceptional curve) and $V^{\prime}$ (birational transform).

Proof: See [Sha I, Theorem 1, p.118], though the statement is much more general. $\boxempty$

Let us return to our polynomials $f$ and $g$ . Set $V_{1}=\mathcal{V}(f)$ and $V_{2}=\mathcal{V}(g)$ . If we want to know more about the behaviour of $V_{1}$ resp. $V_{2}$ under the blow-up of $\mathbb{A}^{2}$ centered at the origin, we have to make sure that $f$ and $g$ are irreducible polynomials in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ . But this has been done in Lemma 5.3.

Consider the variety $Y$ given by the equation $x_{1}z_{2}=x_{2}z_{1}$ , $(x_{1},x_{2};z_{1}:z_{2})\in\mathbb{A}^{2}\times\mathbb{P}^{1}$ . Suppose $z_{1}\neq 0$ . Then we can choose $z_{1}=1$ and therefore we are getting $x_{2}=x_{1}z_{2}$ . In other words, we are considering $Y$ on the affine piece $\mathbb{A}^{2}\times\mathbb{A}^{1}$ . Substituting $\mathrm{x}_{2}$ through $\mathrm{x}_{1}\mathrm{z}_{2}$ leads to

[TABLE]

resp.

[TABLE]

We can make the following statement about $f_{1}$ and $g_{1}$ :

Proposition 6.4.

There is no non-negative polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{z}_{2}]$ such that $f_{1}(y)-t(y)g_{1}(y)\geq 0$ for all $y\in\mathbb{R}^{2}$ .

Proof: Suppose there is such a polynomial $t$ . Consider the homomorphism $\phi:\mathbb{R}[\mathrm{x}_{1},\mathrm{z}_{2}]\rightarrow\mathbb{R}\left[\mathrm{x}_{1},\frac{\mathrm{x}_{2}}{\mathrm{x}_{1}}\right],p(\mathrm{x}_{1},\mathrm{z}_{2})\mapsto p\left(\mathrm{x}_{1},\frac{\mathrm{x}_{2}}{\mathrm{x}_{1}}\right)$ . The next step is to verify that $\phi(t)$ is a polynomial in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ . Write $\phi(t)=\sum_{i,j}a_{ij}\mathrm{x}_{1}^{i}\left(\frac{\mathrm{x}_{2}}{\mathrm{x}_{1}}\right)^{j}=\sum_{i<j}a_{ij}\mathrm{x}_{1}^{i}\left(\frac{\mathrm{x}_{2}}{\mathrm{x}_{1}}\right)^{j}+\sum_{j\leq i}a_{ij}\mathrm{x}_{1}^{i}\left(\frac{\mathrm{x}_{2}}{\mathrm{x}_{1}}\right)^{j}$ . Suppose $\phi(t)\notin\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ , i.e $\sum_{i<j}a_{ij}\mathrm{x}_{1}^{i}\left(\frac{\mathrm{x}_{2}}{\mathrm{x}_{1}}\right)^{j}\neq 0$ . Choose $(i^{\prime},j^{\prime})\in\mathbb{N}^{2}_{0}$ such that the following conditions are satisfied:

•

We have $j^{\prime}>i^{\prime}$ and $a_{i^{\prime}j^{\prime}}\neq 0$ .

•

Define $\Delta_{ij}=(j^{\prime}-i^{\prime})-(j-i)$ , where $j,i\in\mathbb{N}_{0}$ . The inequality $\Delta_{ij}\geq 0$ holds for all $(i,j)\in\mathbb{N}^{2}_{0}$ with $a_{ij}\neq 0$ and $j>i$ .

•

We have $j^{\prime}>j$ for all $(i,j)\in\mathbb{N}^{2}_{0}$ with $j>i$ , $a_{ij}\neq 0$ and $\Delta_{ij}=0$ .

By using $\mathrm{x}_{1}^{j^{\prime}-i^{\prime}}$ as a common denominator for $\sum_{i<j}a_{ij}\mathrm{x}_{1}^{i}\left(\frac{\mathrm{x}_{2}}{\mathrm{x}_{1}}\right)^{j}$ we get

[TABLE]

Note that the polynomial

[TABLE]

has degree $j^{\prime}>0$ in the variable $\mathrm{x}_{2}$ . Take a point $x^{\prime}=(0,c)\in\mathbb{R}^{2}$ that satisfies $(0,c)\in\mathrm{int}(S(g))$ . All points on the $x_{2}$ -axis, but the origin, are inner points of $S(g)$ . This means that $c$ is just a positive real number. Since $\deg_{\mathrm{x}_{2}}(h(0,\mathrm{x}_{2}))>0$ , we can choose $c>0$ such that $h(x^{\prime})\neq 0$ . Consider the sequence $(x_{n})_{n\in\mathbb{N}}=\left(\left(\frac{1}{n},c\right)\right)_{n\in\mathbb{N}}$ . Then the limit $\lim_{n\rightarrow\infty}\phi(t)(x_{n})$ is not finite. While the nominator tends to a finite value $h(x^{\prime})\neq 0$ , the denominator tends to zero. Thus the limit cannot be finite. Since $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{z}_{2}]$ is non-negative, we see that $\lim_{n\rightarrow\infty}\phi(t)(x_{n})=\infty$ . But then

[TABLE]

This contradicts $f_{1}(y)-t(y)g_{1}(y)\geq 0$ for all $y\in\mathbb{R}^{2}$ . We just proved $\phi(t)\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ . We can deduce that $f(y)-\phi(t)(y)g(y)\geq 0$ for all $y\in\mathbb{R}^{2}$ , since $f_{1}(y)-t(y)g_{1}(y)\geq 0$ for all $y\in\mathbb{R}^{2}$ . But this contradicts Lemma 5.1. Thus the proposition is proven. $\boxempty$

Set $f_{1}^{\prime}=\mathrm{z}_{2}^{2}+\mathrm{x}_{1}+\mathrm{z}_{2}\mathrm{x}_{1}^{2}$ and $g_{1}^{\prime}=1+\mathrm{z}_{2}+\mathrm{z}_{2}\mathrm{x}_{1}$ . Repeat the same procedure done to $V_{1}$ and $V_{2}$ with $\mathcal{V}(f_{1}^{\prime})$ and $\mathcal{V}(g_{1}^{\prime})$ . Therefore we get two new polynomials $f_{2}$ and $g_{2}$ . Substituting $f_{1}^{\prime}$ resp. $g_{1}^{\prime}$ in $f_{1}=\mathrm{x}_{1}^{2}f_{1}^{\prime}$ resp. $g_{1}=\mathrm{x}_{1}g_{1}^{\prime}$ by $f_{2}$ and $g_{2}$ gives us the polynomials $\mathrm{x}_{1}^{2}f_{2}$ and $\mathrm{x}_{1}g_{2}$ . So, what’s the point in doing that? The degree of $f_{1}$ resp. $g_{1}$ compared to the degree of $f$ resp. $g$ has increased by one. The same is also true for $\mathrm{x}_{1}^{2}f_{2}$ and $\mathrm{x}_{1}g_{2}$ with respect to $f_{1}$ and $g_{1}$ . Let $f_{i}^{\prime}$ and $g_{i}^{\prime}$ denote the equations of the birational transformations of $\mathcal{V}(f_{i-1}^{\prime})$ and $\mathcal{V}(f_{i-1}^{\prime})$ on the affine piece $\mathbb{A}^{2}\times\mathbb{A}^{1}$ for $i\geq 2$ . By repeating this procedure we get the polynomials $\mathrm{x}_{1}^{2}f_{i}=\mathrm{x}_{1}^{3}f_{i}^{\prime}$ , where $f_{i}^{\prime}=\mathrm{x}_{1}^{2(i-2)+1}\mathrm{z}_{i}^{2}+1+\mathrm{z}_{i}\mathrm{x}_{1}^{2+i}$ and $\mathrm{z}_{i}=\mathrm{x}_{1}\mathrm{z}_{i-1}$ for $i\geq 2$ . By using Proposition 6.3 we can see immediately that the polynomial $f_{i}^{\prime}$ is irreducible for $i\geq 2$ . On the other hand, we get $\mathrm{x}_{1}g_{i}=\mathrm{x}_{1}g_{i}^{\prime}$ , where $g_{i}^{\prime}=1+\mathrm{z}_{i}+\mathrm{z}_{i}\mathrm{x}_{1}^{i}$ for $i\geq 2$ . Hence $\deg(\mathrm{x}_{1}^{2}f_{i})=\deg(\mathrm{x}_{1}^{2}f_{i-1})+2$ and $\deg(\mathrm{x}_{1}g_{i})=\deg(\mathrm{x}_{1}g_{i-1})+1$ for $i>2$ . The only specifics we used about the polynomials $f_{1}$ and $g_{1}$ in the proof of Proposition 6.4, was that $f_{1}$ resp. $g_{1}$ emerged from $f$ resp. $g$ by blowing up $V_{1}$ resp. $V_{2}$ and that there is no non-negative polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ such that $f-tg$ is non-negative. Hence we can make the same statement with respect to the polynomials $\mathrm{x}_{1}^{2}f_{i}$ and $\mathrm{x}_{1}g_{i}$ , where $i\geq 2$ . Thus we get a counterexample for the dehomogenized S4-conjecture in higher degrees. Finally, we can state:

Proposition 6.5.

For any natural number $d\in\{2n:n\geq 4\}\cup\{4,5,6\}$ there are polynomials $f$ and $g$ in $\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ that satisfy the following statements:

•

The degree of $f$ is $d$ and the degree of $g$ is $\nu(d)$ , where

[TABLE]

•

There is a point $x^{\prime}\in\mathbb{R}^{2}$ such that $g(x^{\prime})>0$ .

•

The inclusion $S(g)\subseteq S(f)$ holds.

•

There is no non-negative polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ such that $f(y)-t(y)g(y)\geq 0$ for all $y\in\mathbb{R}^{2}$ .

Of course, by applying the blow-up procedure to other counterexamples the result in Proposition 6.5 can be refined. As a hint one could start with the polynomials $\mathrm{x}_{1}^{2}\mathrm{x}_{2}-\mathrm{x}_{1}^{2}+1$ and $-\mathrm{x}_{1}^{2}+\mathrm{x}_{2}$ . But since a refinement of Proposition 6.5 is not our aim, we will not further pursue it.

Chapter 3 Quadratic modules and stability

The aim of this chapter is to clarify the reasons why Example 4.1 does form a counterexample for the S4-conjecture. In the first part of this chapter we will introduce the necessary tools to answer this question. This tools will be based on the article [Ne]. Finally, the second part is meant to deal with the question mentioned at the beginning, by answering it through a geometric criterion.

1 Preliminaries

The following definitions and theorems can be found in [Ne]. The aim is to provide a list of basic tools for later needs.

Definition 1.1.

A subset $M\subseteq\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]=:A$ is called a quadratic module, if $1\in M$ , $M+M\subseteq M$ , and $A^{2}\cdot M\subseteq M$ holds, where $A^{2}$ denotes the set of squares in $A$ and $\Sigma A^{2}$ denotes the sum of squares in $A$ . Furthermore, $\mathrm{QM}(f_{1},\ldots,f_{s})=\left\{\sigma_{0}+\sigma_{1}f_{1}+\cdots+\sigma_{s}f_{s}:\sigma_{0},\ldots,\sigma_{s}\in\Sigma A^{2}\right\}$ is called the quadratic module generated by $f_{1},\ldots,f_{s}\in A$ .

Throughout this chapter $A$ will denote the polynomial ring $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ .

Definition 1.2.

Let $A=\bigoplus_{\gamma\in\Gamma}A_{\gamma}$ be a grading and let $M\subseteq A$ be a finitely generated quadratic module. $M$ is totally stable with respect to the grading if $\deg(f)\leq\deg(f+g)$ holds for all $f,g\in M$ . This is equivalent to the fact that there are generators $f_{1},\ldots,f_{s}$ of $M$ such that

[TABLE]

holds for all $\sigma_{j}\in\Sigma A^{2}$ . Any finite set of generators of $M$ fulfills this condition then.

Definition 1.3.

For $z\in\mathbb{Z}$ and $d\in\mathbb{Z}^{n}$ we define

[TABLE]

Then

[TABLE]

is a grading that we will call the $z$ -grading of $A$ . For an element $f\in A$ we define $L_{z}(f)$ to be the degree component (component with the highest degree) of $f$ with respect to the $z$ -grading of $A$ .

Remark 1.4.

In the literature the polynomials that lie in $A^{(z)}_{d}$ are called quasi-homogeneous polynomials of type $z$ and degree $d$ .

Definition 1.5.

For a compact set $K\subseteq\mathbb{R}^{n}$ with non-empty interior, we define the tentacle of $K$ in direction of $z\in\mathbb{Z}^{n}$ in the following way:

[TABLE]

Theorem 1.6.

Let $f_{1},\ldots,f_{s}$ be polynomials in the graded polynomial algebra $A=\bigoplus_{d\in\mathbb{Z}}A_{d}^{(z)}$ , where $z\in\mathbb{Z}^{n}$ . If the set $S(f_{1},\ldots,f_{s})\subseteq\mathbb{R}^{n}$ contains a tentacle $T_{K,z}$ , then the quadratic module $M=\mathrm{QM}(f_{1},\ldots,f_{s})$ is totally stable with respect to the $z$ -grading. If $M$ is closed under multiplication, then $S(f_{1},\ldots,f_{s})$ must contain such a tentacle for $M$ to be totally stable.

Proof: See [Ne][Theorem 5.2]. $\boxempty$

2 Stability and tentacles

Definition 2.1.

Let $q=\sum_{i,j}a_{ij}\mathrm{x}_{i}\mathrm{x}_{j}$ be a quadratic form in $A$ . The diagonal part $\mathcal{D}(q)$ of $q$ is defined by

[TABLE]

Definition 2.2.

Let $f$ be a polynomial in $A$ . The set $\mathcal{T}_{0}(f)$ is defined to be the set of all $z\in\mathbb{Z}^{n}$ , under which the quadratic module $\mathrm{QM}(f)$ is totally stable.

Proposition 2.3.

Let $f=\sum_{i=0}^{n}a_{i}\mathrm{x}^{i}$ be a polynomial of degree $n>0$ in $\mathbb{C}[\mathrm{x}]$ with distinct roots $x_{1},\ldots,x_{r}\in\mathbb{C}$ , where $r\leq n$ . Furthermore we define $f(\mathrm{x},y)=\sum_{i=0}^{n}(a_{i}+y_{i})\mathrm{x}^{i}$ for a point $y\in\mathbb{C}^{n+1}$ . For every $0<\varepsilon$ there exists a $\delta>0$ such that all distinct roots of $f(\mathrm{x},y)$ lie in $\bigcup_{i=1}^{r}B_{\varepsilon}(x_{i})$ for all $\left\|y\right\|_{2}<\delta$ .

Proof: Define $h(\mathrm{x},y)=\frac{f^{\prime}(\mathrm{x},y)}{f(\mathrm{x},y)}$ and $d=\begin{cases}\frac{1}{2},\,\text{if}\,r=1\\ \frac{1}{2}\min_{i>j}|x_{i}-x_{j}|,\,\text{otherwise}\end{cases}$ . Without loss of generality we can assume that $0<\varepsilon<d$ . It is therefore easy to see that we can find a simple closed, null-homologous path $\Gamma_{i,y}$ in $B_{\varepsilon}(x_{i})$ such that $x_{i}$ is in the interior of $\Gamma_{i,y}$ and $f(\mathrm{x},y)$ does not vanish on $\Gamma_{i,y}$ . For example, choose $\Gamma_{i,y}$ to be a circle around $x_{i}$ such that $f(\mathrm{x},y)$ does not vanish on this circle. Take an arbitrary $i=1,\ldots,r$ . According to a consequence of the residual theorem [RS][Proposition 13.2.3, p. 350] we have

[TABLE]

where $N_{i}(y)$ denotes the number of roots with multiplicity of $f(\mathrm{x},y)$ in $B_{\varepsilon}(x_{i})$ . It is easy to see that

[TABLE]

as $\left\|y\right\|_{2}\rightarrow 0$ . Since both integrals are integer numbers, there exists a real positive number $\delta_{i}$ such that $\frac{1}{2\pi i}\oint_{\Gamma_{i,y}}h(x,y)\mathrm{d}x=\frac{1}{2\pi i}\oint_{\Gamma_{i,y}}h(x,0)\mathrm{d}x$ for all $\left\|y\right\|_{2}<\delta_{i}$ . This implies that $N_{i}(y)=N_{i}(0)$ , where $N_{i}(0)$ is simply the multiplicity of the root $x_{i}$ of $f$ . Setting $\delta=\min\left\{\delta_{i}:i=1,\ldots,r\right\}$ concludes the proof. $\boxempty$

Corollary 2.4.

Suppose $f$ and $g$ are polynomials of degree $n_{1}>0$ resp. $n_{2}>0$ in $\mathbb{C}[\mathrm{x}]$ with distinct roots $x_{1},\ldots,x_{r_{1}}\in\mathbb{C}$ , where $r_{1}\leq n_{1}$ resp. $x_{1}^{\prime},\ldots,x_{r_{2}}^{\prime}\in\mathbb{C}$ , where $r_{2}\leq n_{2}$ . For every $y\in\mathbb{C}^{n+1}$ let $f(\mathrm{x},y)$ resp. $g(\mathrm{x},y)$ be defined as in the preceding proposition. Then for every $0<\varepsilon$ there exists a $\delta>0$ such that all roots and poles of $\frac{f(\mathrm{x},y)}{g(\mathrm{x},y)}$ lie in $\bigcup_{i=1}^{r_{1}}B_{\varepsilon}(x_{i})\cup\bigcup_{i=1}^{r_{2}}B_{\varepsilon}(x_{i}^{\prime})$ for all $\left\|\delta\right\|_{2}<\delta$ .

Theorem 2.5.

For a quadratic form $q\in A\backslash\{0\}$ the following statements hold:

(a)

Suppose $\mathcal{D}(q)$ is negative-definite. Then $\mathcal{T}_{0}(q)\subsetneqq\mathbb{Z}^{n}$ . 2. (b)

If $\mathcal{D}(q)$ is non-negative, then $\mathcal{T}_{0}(q)=\mathbb{Z}^{n}$ .

Proof: We will write $q=\sum_{i,j}a_{ij}\mathrm{x}_{i}\mathrm{x}_{j}$ in this proof.

(a): We have to show that $\mathcal{T}_{0}(q)\subsetneqq\mathbb{Z}^{n}$ . But this is quiet easy: Because $\mathcal{D}(q)$ is negative-definite there is a coefficient $a_{ii}$ for some $i\in\{1,\ldots,n\}$ such that $a_{ii}<0$ . Take $z\in\mathbb{Z}^{n}$ such that all components but the $i$ -th vanish and let the $i$ -th component be a large positive number. It is clear that $z\notin\mathcal{T}_{0}(q)$ .

(b): Suppose that $\mathcal{D}(q)=0$ . We have to show that $\mathcal{T}_{0}(q)=\mathbb{Z}^{n}$ . Because $\mathcal{T}_{0}(q)$ cannot be any bigger that $\mathbb{Z}^{n}$ , it is enough to prove that the inclusion $\mathcal{T}_{0}(q)\supseteq\mathbb{Z}^{n}$ holds. For a given $z\in\mathbb{Z}^{n}$ let $I$ be the set of all $(i_{1},j_{1})\in\mathbb{N}^{2}$ with $a_{i_{1}j_{1}}+a_{j_{1}i_{1}}\neq 0$ such that there is no $(i_{2},j_{2})\in\mathbb{N}^{2}$ with $a_{i_{2}j_{2}}+a_{j_{2}i_{2}}\neq 0$ and $z_{i_{2}}+z_{j_{2}}>z_{i_{1}}+z_{j_{1}}$ . Take an arbitrary $z\in\mathbb{Z}^{n}\backslash\{0\}$ and take $(i,j)\in I$ . Without loss generality we can demand that $a_{ij}+a_{ji}>0$ . Otherwise substitute the variable $\mathrm{x}_{i}$ through $-\mathrm{x}_{i}$ and define $\tilde{a}_{ij}=-a_{ij}$ resp. $\tilde{a}_{ji}=-a_{ji}$ as the new coefficient of $\mathrm{x}_{i}\mathrm{x}_{j}$ resp. $\mathrm{x}_{j}\mathrm{x}_{i}$ . In the next step we prove that there exists a point $x^{\prime}\in\mathbb{R}^{n}$ such that

•

$(a_{ij}+a_{ji})x_{i}^{\prime}x_{j}^{\prime}>0$

•

$q(x^{\prime})>0$

•

$x_{1}^{\prime},\ldots,x_{n}^{\prime}\neq 0$ .

•

$(a_{ij}+a_{ji})x_{i}^{\prime}x_{j}^{\prime}>2\left|\sum_{(i^{\prime},j^{\prime})\in I\backslash\{(i,j),(j,i)\}}a_{i^{\prime}j^{\prime}}x_{i^{\prime}}^{\prime}x_{j^{\prime}}^{\prime}\right|$

Let us start with a point $x\in\mathbb{R}$ that satisfies $x_{1},\ldots,x_{n}\neq 0$ and $\mathrm{sign}(x_{i})=\mathrm{sign}(x_{j})\neq 0$ . Without loss of generality we can assume that $q(x)\leq 0$ . Let us modify the point $x$ . Consider $q(x_{1},\ldots,\uplambda x_{i},\ldots,\uplambda x_{j},\ldots,x_{n})\in\mathbb{R}[\uplambda]$ . The leading term of this polynomial in $\uplambda$ is $(a_{ij}+a_{ji})x_{i}x_{j}\uplambda^{2}$ . Since $(a_{ij}+a_{ji})x_{i}x_{j}>0$ , we have $q(x_{1},\ldots,\lambda x_{i},\ldots,\lambda x_{j},\ldots,x_{n})\rightarrow\infty$ for $\lambda\rightarrow\infty$ . Choosing a large $\lambda\in\mathbb{R}$ and a new point $x^{\prime}\in\mathbb{R}$ with $x^{\prime}_{k}=x_{k}$ for $k\neq i,j$ and $x_{i}^{\prime}=\lambda x_{i}$ , $x_{j}^{\prime}=\lambda x_{j}$ leads to $q(x^{\prime})>0$ . Finally, we can achieve $(a_{ij}+a_{ji})x_{i}^{\prime}x_{j}^{\prime}>2\left|\sum_{(i^{\prime},j^{\prime})\in I\backslash\{(i,j),(j,i)\}}a_{i^{\prime}j^{\prime}}x_{i^{\prime}}^{\prime}x_{j^{\prime}}^{\prime}\right|$ by enlarging $\lambda$ further if necessary. Next, we want to find an appropriate point $x^{\prime}\in\mathbb{R}^{n}$ and a neighborhood $U$ of $x^{\prime}$ such that $T_{\overline{U},z}\subseteq S(q)$ . Take $(i,j)\in I$ and a point $x^{\prime}\in\mathbb{R}^{n}$ satisfying the four conditions mentioned above. Then we have $\sum_{(r,s)\in I}a_{rs}x_{r}^{\prime}x_{s}^{\prime}>0$ . Furthermore, $\sum_{(r,s)\in I}a_{rs}x_{r}^{\prime}x_{s}^{\prime}$ is the leading coefficient of the polynomial $\hat{q}(\uplambda,x^{\prime})=q\left(\uplambda^{z_{1}}x_{1}^{\prime},\ldots,\uplambda^{z_{n}}x_{n}^{\prime}\right)\in\mathbb{R}[\uplambda]$ . Consider $\hat{q}(\lambda,x^{\prime})$ for $\lambda\geq 1$ . So far, we have shown that the leading coefficient of $\hat{q}(\lambda,x^{\prime})$ is positive. Therefore $\hat{q}(\lambda,x^{\prime})\rightarrow\infty$ for $\lambda\rightarrow\infty$ . This implies the existence of a $\lambda^{\prime}\geq 1$ such that $\hat{q}(\lambda,x^{\prime\prime})=q\left(\lambda^{z_{1}}x_{1}^{\prime\prime},\ldots,\lambda^{z_{n}}x_{n}^{\prime\prime}\right)>0$ for all $\lambda\geq 1$ , where $x^{\prime\prime}\in\mathbb{R}^{n}$ is defined by $x_{i}^{\prime\prime}=\lambda^{\prime z_{i}}x_{i}^{\prime}$ for $i=1,\ldots,n$ . Let $U$ be a ’small’ neighborhood of $x^{\prime\prime}$ such that $\sum_{\alpha\in I}a_{\alpha}y^{\alpha}>0$ for all $y\in U$ . Interpreting $\hat{q}(\uplambda,x^{\prime\prime})$ as a polynomial in $\mathbb{C}[\uplambda]$ and using Proposition 2.3, we see that no real root of $\hat{q}(\uplambda,y)$ can be greater than $1$ , if $U$ is small enough. This implies that $\hat{q}(\lambda,y)$ is positive for all $\lambda\geq 1$ and all $y\in U$ . Hence $T_{\overline{U},z}\subseteq S(q)$ . Let us assume that $\mathcal{D}(q)$ is positive definite. We need to verify that $\mathcal{T}_{0}(q)=\mathbb{Z}^{n}$ . Consider the quadratic form $\tilde{q}=q-\mathcal{D}(q)$ . Suppose that $\tilde{q}=0$ . Since $q(x)\geq\tilde{q}(x)$ holds for all $x\in\mathbb{R}^{n}$ , we see that $S(\tilde{q})$ is a subset of $S(q)$ . This implies $S(q)=\mathbb{R}^{n}$ and $\mathcal{T}_{0}(q)=\mathbb{Z}^{n}$ . If $\tilde{q}\neq 0$ then we get $\mathcal{T}_{0}(\tilde{q})=\mathbb{Z}^{n}$ , since $\mathcal{D}(\tilde{q})=0$ . The inclusion $S(\tilde{q})\subseteq S(q)$ leads straight to $\mathcal{T}_{0}(q)=\mathbb{Z}^{n}$ . $\boxempty$

Definition 2.6.

Let $\varphi=(\varphi_{1},\ldots,\varphi_{n})\in\mathbb{R}(\uplambda)^{n}$ be a tuple of rational fractions $\varphi_{1},\ldots,\varphi_{n}\neq 0$ . Under a rational tentacle we understand the set

[TABLE]

where $K\subseteq\mathbb{R}^{n}$ is an compact set with non-empty interior. Furthermore, we denote by $\mathcal{T}_{0}(S)$ resp. $\mathcal{T}(S)$ the set of all tentacles resp. rational tentacles that are contained in a semi-algebraic set $S\subseteq\mathbb{R}^{n}$ .

Remark 2.7.

We want to establish a link between rational tentacles and the $z$ -gradings of $A$ . Let $f_{1},\ldots,f_{s}$ be polynomials in $A$ and $\mathcal{T}$ the set of all rational tentacles $T$ such that $T\subseteq S(f_{1},\ldots,f_{s})$ . We say that a tentacle $T\in\mathcal{T}$ is of degree $z\in\mathbb{Z}^{n}$ if there exists compact set $K\subseteq\mathbb{R}^{n}$ with non-empty interior and a tuple of rational fractions $\varphi\in\mathbb{R}(\uplambda)^{n}$ such that $T=T_{K,\varphi}$ and $(\deg(\varphi_{1}),\ldots,\deg(\varphi_{n}))=z$ , where $\deg$ is defined to be the negative degree valuation $-v_{\infty}$ . Therefore we can assign each $T$ a tuple in $\mathbb{Z}^{n}$ by $D(\varphi)=(\deg(\varphi_{1}),\ldots,\deg(\varphi_{n}))$ . However, this assignment is not unique. Thus a rational tentacle may have more degrees than merely just one.

We are now able to generalize Proposition [Ne, Proposition 5.1] and Theorem 1.6:

Proposition 2.8.

Let $f_{1},\ldots,f_{s}$ be polynomials in the graded polynomial algebra $A=\bigoplus_{d\in\mathbb{Z}}A_{d}^{(z)}$ , where $z\in\mathbb{Z}^{n}$ . Then the set

[TABLE]

is Zariski-dense in $\mathbb{R}^{n}$ if and only if the set $S(f_{1},\ldots,f_{s})\subseteq\mathbb{R}^{n}$ contains a rational tentacle $T_{K,\varphi}$ of degree $z$ for some compact set $K\subseteq\mathbb{R}^{n}$ with non-empty interior.

Proof: $\Rightarrow$ : The same proof as in [Ne, Proposition 5.1].

$\Leftarrow$ : For each $k=1,\ldots,s$ let $f_{k}$ be given by $f_{k}=\sum_{\alpha}a_{k,\alpha}\mathrm{x}_{1}^{\alpha_{1}}\cdots\mathrm{x}_{n}^{\alpha_{n}}$ , where $a_{k,\alpha}\in\mathbb{R}$ . Suppose there exists a rational tentacle $T:=T_{K,\varphi}$ such that $T\subseteq S(f_{1},\ldots,f_{s})$ and $D(\varphi)=z$ .

We are going to show that there exists a point $x\in\mathrm{int}(K)$ and an open neighborhood $U$ of $x$ such that the component of $\hat{f}_{i}(\uplambda,x)=f_{i}(\varphi_{1}x_{1},\ldots,\varphi_{n}x_{n})=\sum_{\alpha}c_{i,\alpha}x^{\alpha}\varphi_{1}^{\alpha_{1}}\cdots\varphi_{n}^{\alpha_{n}}$ with the highest degree is $\hat{L}_{z}(f_{i})(\uplambda,x)=\sum_{\langle\alpha,z\rangle=\delta_{i}}c_{i,\alpha}x^{\alpha}\varphi_{1}^{\alpha_{1}}\cdots\varphi_{n}^{\alpha_{n}}$ for all $i=1,\ldots,s$ and $x\in U$ , where $\delta_{i}=\max\left\{\langle\alpha,z\rangle:c_{i,\alpha}\neq 0\right\}$ . In other words, we have to show that $\deg_{\uplambda}\left(\hat{L}_{z}(f_{i})(\uplambda,x)\right)=\delta_{i}$ for all $i=1,\ldots,s$ and all $x\in U$ . There are polynomials $h_{1,\alpha},h_{2}\in\mathbb{R}[\uplambda]$ such that $\hat{L}_{z}(f_{i})(\uplambda,x)$ can be rewritten as

[TABLE]

Let $m_{\alpha}$ be the leading coefficient of $h_{1,\alpha}$ if no other $h_{1,\alpha^{\prime}}$ ( $c_{i,\alpha^{\prime}}\neq 0$ ), appearing in the sum above, has a higher degree. Otherwise, we set $m_{\alpha}=0$ . The only situation in which the degree of $\hat{L}_{z}(f_{i})(\uplambda,x)$ in $\uplambda$ is smaller than $\delta_{i}$ , is the one, where $\sum_{\alpha}c_{i,\alpha}m_{\alpha}x^{\alpha}=0$ . Since not all $m_{\alpha}$ can vanish, the sum $\sum_{\alpha}c_{i,\alpha}m_{\alpha}\mathrm{x}_{1}^{\alpha_{1}}\cdots\mathrm{x}_{n}^{\alpha_{n}}$ interpreted as an element of $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ is not the zero polynomial. Since $\mathrm{int}(K)$ is not empty, we can find a point $x\in\mathrm{int}(K)$ such that $\sum_{\alpha}c_{i,\alpha}m_{\alpha}x^{\alpha}\neq 0$ . Additionally, we can find a neighborhood $U_{i}$ of $x$ , where $\sum_{\alpha}c_{i,\alpha}m_{\alpha}y^{\alpha}\neq 0$ for all $y\in U_{i}$ . We just proved that for every $y\in U_{i}$ , the degree of $\hat{L}_{z}(f_{i})(\uplambda,y)$ is exactly $\delta_{i}$ .

Let us construct the following subset $U$ of $\mathrm{int}(K)$ : Start with a point $x_{1}\in\mathrm{int}(K)$ and an open neighborhood $U_{1}$ of $x_{1}$ such that $\deg_{\uplambda}\left(\hat{L}_{z}(f_{1})(\uplambda,y)\right)=\delta_{1}$ for all $y\in U_{1}$ . Since $U_{1}$ is open, we can find another point $x_{2}\in U_{1}$ and an open neighborhood $U_{2}\subseteq U_{1}$ of $x_{2}$ such that $\deg_{\uplambda}\left(\hat{L}_{z}(f_{2})(\uplambda,y)\right)=\delta_{2}$ for all $y\in U_{2}$ . By repeating this procedure for the remaining polynomials $f_{3},\ldots,f_{s}$ , we get the open neighborhoods $U_{3},\ldots,U_{s}$ . Set $U=\bigcap_{k=1}^{s}U_{k}$ . Hence $\deg_{\uplambda}\left(\hat{L}_{z}(f_{i})(\uplambda,x)\right)=\delta_{i}$ for all $i=1,\ldots,s$ and all $x\in U$ .

Fix a point $x^{\prime}\in U$ with $x^{\prime}_{k}\neq 0$ for all $k=1,\ldots,n$ and consider again the rational fraction $\hat{L}_{z}(f_{i})(\uplambda,x^{\prime})$ and $\hat{f}_{i}(\uplambda,x^{\prime})$ . Since $T\subseteq S(f_{1},\ldots,f_{s})$ , there exists a $\lambda_{i}\geq 1$ such that

•

$\hat{f}_{i}(\lambda,x^{\prime})>0$

•

$\hat{L}_{z}(f_{i})(\lambda,x^{\prime})>0$

•

$\varphi_{1}(\lambda),\ldots,\varphi_{n}(\lambda)\neq 0$

•

$\varphi_{1}(\lambda),\ldots,\varphi_{n}(\lambda)$ is defined

for all $\lambda\geq\lambda_{i}$ . If we take a small neighborhood $\widetilde{U}_{i}\subseteq U$ of $x^{\prime}$ , the inequalities $\hat{f}_{i}(\lambda,y)>0$ and $\hat{L}_{z}(f_{i})(\lambda,y)>0$ will still hold for all $\lambda\geq\lambda_{i}$ and all $y\in\widetilde{U}_{i}$ : To be more precise, we take an open neighborhood $\widetilde{U}_{i}\subseteq U$ of $x^{\prime}$ , such that for every point $y\in\widetilde{U}_{i}$ no component of $y$ vanishes. According to Corollary 2.4 we can choose $\widetilde{U}_{i}$ so small that $\hat{f}_{i}(\lambda,y)$ and $\hat{L}_{z}(f_{i})(\lambda,y)$ have no poles or roots for all $\lambda\geq\lambda_{i}$ and all $y\in\widetilde{U}_{i}$ . Thus if $\widetilde{U}_{i}$ is small enough, all real roots of $\hat{f}_{i}(\lambda,y)$ and $\hat{L}_{z}(f_{i})(\lambda,y)$ will be smaller than $\lambda_{i}$ for $y\in\widetilde{U}_{i}$ and therefore $\hat{L}_{z}(f_{i})(\lambda,y)$ resp. $\hat{f}_{i}(\lambda,y)$ will be positive for all $\lambda\geq\lambda_{i}$ and all $y\in\widetilde{U}_{i}$ . Thus if $\widetilde{U}_{i}$ is a small neighborhood of $x^{\prime}$ , we get $\hat{f}_{i}(\lambda,y)>0$ and $\hat{L}_{z}(f_{i})(\lambda,y)>0$ for all $\lambda\geq\lambda_{i}$ and all $y\in\widetilde{U}_{i}$ . Set $\widetilde{U}=\bigcap_{i=1}^{s}\widetilde{U}_{i}$ , $\lambda^{\prime}=\max_{i\in\{1,\ldots,s\}}\lambda_{i}$ and consider the map $\psi:\widetilde{U}\rightarrow\mathbb{R}^{n},u\mapsto(\varphi_{1}(\lambda^{\prime})u_{1},\ldots,\varphi_{n}(\lambda^{\prime})u_{n})$ . So far, we have shown that $\psi\left(\widetilde{U}\right)\subseteq S(f_{1},\ldots,f_{s})$ and $\psi\left(\widetilde{U}\right)\subseteq S(L_{z}(f_{1}),\ldots,L_{z}(f_{s}))$ . Since $\mathrm{int}\left(\psi\left(\widetilde{U}\right)\right)\neq\varnothing$ , it is clear that both $S(f_{1},\ldots,f_{s})$ and especially $S(L_{z}(f_{1}),\ldots,L_{z}(f_{s}))$ are Zariski-dense in $\mathbb{R}^{n}$ . $\boxempty$

Theorem 2.9.

Let $A=\bigoplus_{d\in\mathbb{Z}}A_{d}^{(z)}$ be a $z$ -grading and $M$ a finitely generated quadratic module in $A$ .

If for a set of generators $f_{1},\ldots,f_{s}$ of $M$ the set $S(L_{z}(f_{1}),\ldots,L_{z}(f_{s}))\subseteq\mathbb{R}^{n}$ is Zariski dense, then $M$ is totally stable with respect to the $z$ -grading. If $M$ is closed under multiplication, then total stability implies the Zariski denseness for any finite set of generators of $M$ .

Proof: See [Ne, Theorem 4.3]. $\boxempty$

Theorem 2.10.

Let $f_{1},\ldots,f_{s}$ be polynomials in the graded polynomial algebra $A=\bigoplus_{d\in\mathbb{Z}}A_{d}^{(z)}$ , where $z\in\mathbb{Z}^{n}$ . If the set $S(f_{1},\ldots,f_{s})\subseteq\mathbb{R}^{n}$ contains some rational tentacle $T_{K,\varphi}$ , then the quadratic module $M=\mathrm{QM}(f_{1},\ldots,f_{s})$ is totally stable with respect to the $z$ -grading. If $M$ is closed under multiplication, then $S(f_{1},\ldots,f_{s})$ must contain a tentacle $T_{K,z}$ for $M$ to be totally stable.

Proof: Combine Proposition 2.8 and Theorem 2.9. $\boxempty$

Remark 2.11.

If someone is interested in stability and the quadratic module $M=\mathrm{QM}(f_{1},\ldots,f_{s})$ is closed under multiplication, then there is no point using Theorem 2.10 over Theorem 1.6. In general, however, this is not true as the next remark will illustrate it. Another advantage of the tentacle is that it is more flexible than an ordinary tentacle. A tentacle may loose its property of being a tentacle even by small manipulations, while it is harder doing so with respect to a rational tentacle.

Remark 2.12.

Stability under isomorphism: Let $\chi:\mathbb{R}^{n}\xrightarrow{\,\smash{\raisebox{-2.15277pt}{$ \scriptstyle\sim $}}\,}\mathbb{R}^{n}$ be given by a matrix in $\mathrm{GL}_{n}$ . Consider a basic closed semi-algebraic set $S=S(f_{1},\ldots,f_{s})\subseteq\mathbb{R}^{n}$ . Set $S^{\prime}=\chi(S)$ . Then we have $S^{\prime}=S\left(f_{1}\circ\chi^{-1},\ldots,f_{s}\circ\chi^{-1}\right)$ . Hence $S^{\prime}$ is again a semi-algebraic set. In the following we write $\chi_{i}=\sum_{j}a_{ij}\mathrm{x}_{j}$ , where $a_{ij}\in\mathbb{R}$ . Reprise that $\mathcal{T}_{1}:=\mathcal{T}(S)$ resp. $\mathcal{T}_{2}:=\mathcal{T}\left(S^{\prime}\right)$ is the set of all rational tentacles $T$ such that $T\subseteq S$ resp. $T\subseteq S^{\prime}$ . Let $D_{1}$ resp. $D_{2}$ denote the set of all degrees of all tentacles in $\mathcal{T}_{1}$ resp. $\mathcal{T}_{2}$ . We are interested in the relationship between $D_{1}$ and $D_{2}$ . Take a rational tentacle $T:=T_{K,\varphi}\in\mathcal{T}_{1}$ of degree $z\in\mathbb{Z}^{n}$ . For the sake of simplicity let us assume that $\varphi$ is defined on the set $[1,\infty)$ . Let the $i$ -th component of $\chi\left(\varphi_{1}\mathrm{x}_{1},\ldots,\varphi_{n}\mathrm{x}_{n}\right)$ be given by

[TABLE]

Choose a point $x\in\mathrm{int}(K)$ that satisfies the following two conditions:

•

For each $i=1,\ldots,n$ the $i$ -th component of $x$ and $\chi(x)$ does not vanish.

•

For each $i=1,\ldots,n$ the degree of $\chi_{i}\left(\varphi_{1}x_{1},\ldots,\varphi_{n}x_{n}\right)\in\mathbb{R}[\uplambda]$ is equal to $\tilde{z}_{i}:=\max\left\{z_{j}:a_{ij}\neq 0,j=1,\ldots,n\right\}$ .

Set $\tilde{\varphi}_{i}=\sum_{j}a_{ij}x_{i}^{-1}\varphi_{j}x_{j}$ . There is a small neighborhood $U\subseteq\mathrm{int}(K)$ of $x$ such that $\left(\tilde{\varphi}_{1}(\lambda)y_{1},\ldots,\tilde{\varphi}_{n}(\lambda)y_{n}\right)$ lies in $S^{\prime}$ for all $y\in\overline{U}$ and all $\lambda\geq 1$ . Let us prove this assertion. Set $\tilde{a}_{ij}=a_{ij}x_{i}^{-1}y_{i}$ . Thus every $y\in U$ defines a small perturbation of the coefficients $a_{ij}$ . Hence we can write $\tilde{a}_{ij}=a_{ij}+\varepsilon_{ij}$ , where $\varepsilon_{ij}\in\mathbb{R}$ . Note that $\varepsilon_{ij}=0$ if $a_{ij}=0$ . Hence for small $\varepsilon_{ij}$ we get $\sum_{j}\tilde{a}_{ij}\varphi_{j}(\lambda)x_{j}=\sum_{j}a_{ij}\varphi_{j}(\lambda)x_{j}+\sum_{j}\varepsilon_{ij}\varphi_{j}(\lambda)x_{j}\geq 0$ for all $i=1,\ldots,n$ and all $\lambda\geq 1$ . Therefore

[TABLE]

By the continuity of $\chi$ , an open neighborhood $U^{\prime}\subseteq U$ of $x$ can be found such that

[TABLE]

Now, the identity

[TABLE]

proves that $T^{\prime}$ is in $\mathcal{T}_{2}$ .

The degree of $T^{\prime}$ is given by $\left(\deg\left(\tilde{\varphi}_{1}\right),\ldots,\deg\left(\tilde{\varphi}_{n}\right)\right)$ , which is nothing more than

$\tilde{z}:=\left(\tilde{z}_{1},\ldots,\tilde{z}_{n}\right)$ . This gives us a map $\xi_{1}:D_{1}\rightarrow D_{2},z\mapsto\tilde{z}$ . On the other hand, we can start with a tentacle $T^{\prime}\in\mathcal{T}_{2}$ and repeat the same argumentation done so far by replacing $\chi$ with $\chi^{-1}$ . This gives us a map $\xi_{2}:D_{2}\rightarrow D_{1}$ .

If $\chi=\mathrm{id}_{\mathbb{R}^{n}}$ then it is clear that $\xi_{1}$ and $\xi_{2}$ are the identity maps. Suppose $\chi\in\mathrm{GL}_{n}$ is not the identity map. Even under this circumstances neither $\xi_{1}$ nor $\xi_{2}$ need to be linear or inverse to each other.

By using Theorem 2.10 we see that if $M=\mathrm{QM}(f_{1},\ldots,f_{s})$ is stable with respect to a $z$ -grading, then $M^{\prime}=\mathrm{QM}\left(f_{1}\circ\chi^{-1},\ldots,f_{s}\circ\chi^{-1}\right)$ is stable with respect to a $\xi_{1}(z)$ -grading. On the other hand, if $M^{\prime}$ is stable with respect to a $z^{\prime}$ -grading, then $M$ is stable with respect to a $\xi_{2}(z^{\prime})$ -grading. Note that this result is impossible by just using the ordinary tentacle defined in 1.5 and Theorem 1.6.

3 Tentacles and the S4-conjecture

Let $n$ be a natural number. For a subset $I\subseteq\{1,\ldots,n\}$ we define an involution $\pi_{I}:\mathbb{R}^{n}\rightarrow\mathbb{R}^{n}$ by $\pi_{I}(x)=x^{\prime}$ where $x_{j}^{\prime}=x_{j}$ for $j\notin I$ and $x_{j}^{\prime}=-x_{j}$ for $j\in I$ . This kind of maps form a group $G$ . Furthermore every $\pi\in G$ maps a rational tentacle $T_{K,\varphi}$ to another rational tentacle $\pi_{I}(T_{K,\varphi})=T_{\pi_{I}(K),\varphi}$ . Let $f$ be a polynomial in $A$ . In the following we denote by $L_{\mathrm{lex}}(f)$ the leading term of $f$ with respect to the lexicographical ordering. Finally, set $\mathbb{N}^{n}_{1}:=\left\{z\in\mathbb{N}^{n}:\forall i\in\{1,\ldots,n\}:z_{1}\geq z_{i}\right\}$ . Then we can state:

Theorem 3.1.

Let $S_{1}=S(q)$ and $S_{2}=S(p)$ be two semi-algebraic sets in $\mathbb{R}^{n}$ . Suppose the following conditions are satisfied:

(a)

We have $L_{\mathrm{lex}}(p)\notin L_{\mathrm{lex}}(q)A$ . 2. (b)

For every $z\in\mathbb{N}^{n}_{1}$ there exists a rational tentacle $T\in\mathcal{T}(S_{1})$ of degree $z$ and an element $\pi\in G$ such that $\pi(T)\notin\mathcal{T}(S_{2})$ . Furthermore, all unbounded $T^{\prime}\in\mathcal{T}(S_{1})$ with $\pi(T^{\prime})\subseteq\pi(T)$ satisfy $\pi(T^{\prime})\notin\mathcal{T}(S_{2})$ .

Then there is no non-negative polynomial $t\in A$ such that $p(y)-t(y)q(y)\geq 0$ for all $y\in\mathbb{R}^{n}$ .

Proof: Without loss of generality we can assume that $S_{1}\subseteqq S_{2}$ . Suppose that there exists a non-negative polynomial $t\in A$ such that $p(y)-t(y)q(y)\geq 0$ for all $y\in\mathbb{R}^{n}$ . The prove is divided in several steps:

(i): Let $L_{\mathrm{lex}}(q)=a_{\alpha}\mathrm{x}_{1}^{\alpha_{1}}\cdots\mathrm{x}_{n}^{\alpha_{n}}$ , $L_{\mathrm{lex}}(p)=b_{\beta}\mathrm{x}_{1}^{\beta_{1}}\cdots\mathrm{x}_{n}^{\beta_{n}}$ and $L_{\mathrm{lex}}(t)=d_{\gamma}\mathrm{x}_{1}^{\gamma_{1}}\cdots\mathrm{x}_{n}^{\gamma_{n}}$ denote the leading terms of $q$ , $p$ , and $t$ with respect to the lexicographical ordering. Furthermore, we define $I(q)=\left\{\delta\in\mathbb{N}_{0}^{n}:a_{\delta}\neq 0,\delta\neq\alpha\right\}$ , $I(p)=\left\{\delta\in\mathbb{N}_{0}^{n}:b_{\delta}\neq 0,\delta\neq\beta\right\}$ and $I(t)=\left\{\delta\in\mathbb{N}_{0}^{n}:d_{\delta}\neq 0,\delta\neq\gamma\right\}$ . We are going to show that there is a tuple $z\in\mathbb{N}^{n}_{1}$ such that

•

$\langle\alpha,z\rangle>\langle\delta,z\rangle$ for all $\delta\in I(q)$ .

•

$\langle\beta,z\rangle>\langle\delta,z\rangle$ for all $\delta\in I(p)$ .

•

$\langle\gamma,z\rangle>\langle\delta,z\rangle$ for all $\delta\in I(t)$ .

Let us start with $z^{\prime}=(1,\ldots,1)$ . Consider the set $N(z^{\prime})=\left\{\delta\in I(t):\langle\gamma,z^{\prime}\rangle\leq\langle\delta,z^{\prime}\rangle\right\}\cup\left\{\delta\in I(p):\langle\beta,z^{\prime}\rangle\leq\langle\delta,z^{\prime}\rangle\right\}\cup\left\{\delta\in I(q):\langle\alpha,z^{\prime}\rangle\leq\langle\delta,z^{\prime}\rangle\right\}$ and the numbers

[TABLE]

and $r(z^{\prime})=\max\{r_{1}(z^{\prime}),r_{2}(z^{\prime}),r_{3}(z^{\prime})\}$ . Suppose that $r(z^{\prime})=r_{1}(z^{\prime})$ . We see that $n>r_{1}$ , since $\gamma\succ_{\mathrm{lex}}\delta$ for all $\delta\in N(z^{\prime})\cap I(t)$ with respect to the lexicographical ordering. Now take $\delta\in N(z^{\prime})\cap I(t)$ whose components $1,\ldots,r_{1}(z^{\prime}):=r_{1}$ are identical to those of $\gamma$ . Since $\gamma\succ_{\mathrm{lex}}\delta$ , the inequality $\gamma_{r_{1}+1}>\delta_{r_{1}+1}$ must hold. Now we can enlarge the $r_{1}+1$ -th component of $z^{\prime}$ in such a way that $\langle\gamma,z^{\prime}\rangle>\langle\delta,z^{\prime}\rangle$ . In fact, we can achieve $\langle\gamma,z^{\prime}\rangle>\langle\delta,z^{\prime}\rangle$ for all $\delta\in N(z^{\prime})\cap I(t)$ whose first $r_{1}$ components are identical to those of $\gamma$ . If $r_{2}(z^{\prime})=r_{1}(z^{\prime})$ or $r_{3}(z^{\prime})=r_{1}(z^{\prime})$ we, if necessary, enlarge the $r_{1}+1$ -th component of $z^{\prime}$ further such that both inequalities $\langle\beta,z^{\prime}\rangle>\langle\delta_{1},z^{\prime}\rangle$ , $\langle\alpha,z^{\prime}\rangle>\langle\delta_{2},z^{\prime}\rangle$ hold for all $\delta_{1}\in N(z^{\prime})\cap I(p)$ , $\delta_{2}\in N(z^{\prime})\cap I(q)$ whose $1,\ldots,r_{1}$ components are identically to whose of $\beta$ resp. $\alpha$ . If $r(z^{\prime})=r_{2}(z^{\prime})$ or $r(z^{\prime})=r_{3}(z^{\prime})$ just use the same argumentation again. That is, replace $r_{1}(z^{\prime})$ by $r_{2}(z^{\prime})$ or $r_{3}(z^{\prime})$ and simply repeat the argumentation done in this matter.

Let $z^{\prime\prime}$ denote $z^{\prime}$ with the enlarged $r_{1}+1$ -th component and consider $N(z^{\prime\prime})$ resp. $r(z^{\prime\prime})$ . Then it is clear that $r(z^{\prime\prime})\leq r(z^{\prime})-1$ . Start over again with the new data $N(z^{\prime\prime})$ and $r(z^{\prime\prime})$ and notice that after each finished repetition the value $r(z^{\prime\prime})$ will decrease at least by one.

Thus after $k\leq r(z^{\prime})$ repetitions we finally get a tuple $z:=z^{(k)}$ that will satisfy $N(z)=\varnothing$ resp. $r(z)=0$ , which is the same thing as saying that $z$ will satisfy all three inequalities mentioned above. It is obvious that we can choose $z$ in such a way that the first component is the largest. To be more precise, if the first component of $z$ is not the largest, then we can enlarge it without violating the three inequalities.

(ii): According to condition (b) in the theorem, there exists a rational tentacle $T\in\mathcal{T}(S_{1})$ of degree $z$ and an involution $\pi\in G$ such that all statements in (b) are satisfied. Write $T_{K,\varphi}$ for $T$ , where as usual $K\subseteq\mathbb{R}^{n}$ is a compact set with non-empty interior and $\varphi\in\mathbb{R}(\uplambda)^{n}$ . For any polynomial $f\in A$ , we define $\hat{f}(\uplambda,x^{\prime})$ to be the rational fraction $\hat{f}(\uplambda,x^{\prime})=f\left(\varphi_{1}(\uplambda)x_{1}^{\prime},\ldots,\varphi_{n}(\uplambda)x_{n}^{\prime}\right)\in\mathbb{R}(\uplambda)$ , where $x^{\prime}\in\mathrm{int}(K)$ . We know from condition (b) and (a) that there is a point $x^{\prime}\in\mathrm{int}(K)$ with $x_{i}^{\prime}\neq 0$ for all $i=1,\ldots,n$ such that

[TABLE]

resp.

[TABLE]

holds. Set $x=\pi(x^{\prime})$ and $\varrho(\hat{q},x)=\left\{\lambda\in\mathbb{R}_{\geq 1}:\hat{q}(\lambda,x)\geq 0,\hat{q}(\lambda,x)\,\text{is defined}\right\}$ .

We are going to exclude that $\varrho(\hat{p},x)$ is unbounded. Suppose the opposite would be the case. Then there exists a $\lambda^{\prime}\geq 1$ such that the rational fractions $\varphi_{1},\ldots,\varphi_{n}$ have no poles and no roots, and that $\hat{p}(\lambda,x)$ is positive for all $\lambda\geq\lambda^{\prime}$ .

By taking a small neighborhood $U\subseteq\pi(K)$ of $x$ we can make sure that $\hat{p}(\lambda,y)$ will be positive for all $\lambda\geq\lambda^{\prime}$ and all $y\in U$ . Next, we define the rational fractions $\hat{\varphi}_{i}=\varphi_{i}(\uplambda+\uplambda^{\prime}-1)$ for $i=1,\ldots,n$ . Set $\hat{\varphi}=(\hat{\varphi}_{1},\ldots,\hat{\varphi}_{n})$ . Then the rational tentacle $T_{\overline{U},\hat{\varphi}}$ lies in $\mathcal{T}(S_{2})$ . According to our construction, $\pi^{-1}\left(T_{\overline{U},\hat{\varphi}}\right)$ is a subset of $T_{K,\varphi}$ : This follows from $q(\hat{\varphi}_{1}(\lambda)y_{1}^{\prime},\ldots,\hat{\varphi}_{n}(\lambda)y_{n}^{\prime})=\hat{q}(\lambda+\lambda^{\prime}-1,y^{\prime})\geq 0$ for all $\lambda\geq 1$ and $y^{\prime}\in\pi^{-1}\left(\overline{U}\right)\subseteq K$ . Furthermore, this implies that $T_{\overline{U},\hat{\varphi}}$ is a subset of $\pi(T_{K,\varphi})$ . Hence it is easy to see that while $T_{\overline{U},\hat{\varphi}}$ lies in $\mathcal{T}(S_{2})$ , the other rational tentacle $\pi^{-1}\left(T_{\overline{U},\hat{\varphi}}\right)$ lies in $\mathcal{T}(S_{1})$ . Since the degree of $T_{K,\varphi}$ is $z$ , we get $\lim_{\lambda\rightarrow\infty}|\varphi_{1}(\lambda)|=\infty$ . Thus $\lim_{\lambda\rightarrow\infty}|\hat{\varphi}_{1}(\lambda)|=\infty$ and therefore $T_{\overline{U},\hat{\varphi}}$ is not bounded. But that contradicts (b). In fact, the rational tentacles $T_{\overline{U},\hat{\varphi}}$ and $\pi^{-1}\left(T_{\overline{U},\hat{\varphi}}\right)$ must behave like depicted in Figure 1 for a suitable $\lambda^{\prime}\geq 1$ and neighborhood $U\subseteq\pi(K)$ of $x$ . In other words, we just saw that $S_{2}\backslash S_{1}$ is just too small to contain a rational tentacle that would allow $\varrho(\hat{p},x)$ to be unbounded.

(iii): In (ii) we showed that $\varrho(\hat{p},x)$ is bounded resp. that $\varrho(-\hat{p},x)$ is unbounded. Now, we want the same thing for $\varrho(\hat{q},x)$ resp. $\varrho(-\hat{q},x)$ . Without loss of generality, we can assume that $\pi(T)$ is not contained in $\mathcal{T}(S_{1})$ . Otherwise, we get $\hat{p}(\lambda,x)-\hat{t}(\lambda,x)\hat{q}(\lambda,x)<0$ for $\lambda>0$ big enough and therefore we are done. Thus $\varrho(-\hat{q},x)$ must be unbounded, since an infinite part of $\pi(T)$ must lie in $\mathbb{R}^{n}\backslash S_{1}$ .

(iv): We know that $\varrho(-\hat{q},x)$ and $\varrho(-\hat{p},x)$ are unbounded. Thus there exists a real number $\lambda_{0}\geq 1$ such that $\hat{p}(\lambda,x)$ and $\hat{q}(\lambda,x)$ are defined for all $\lambda\geq\lambda_{0}$ . There is a positive real number $\hat{\lambda}\geq\lambda_{0}$ such that $\hat{L}_{\mathrm{lex}}(p)(\lambda,x)<0$ , $\hat{L}_{\mathrm{lex}}(q)(\lambda,x)<0$ and $\hat{p}(\lambda,x)-\hat{t}(\lambda,x)\hat{q}(\lambda,x)>0$ for all $\lambda\geq\hat{\lambda}$ . This implies $|\hat{L}_{\mathrm{lex}}(p)(\lambda,x)|<|\hat{L}_{\mathrm{lex}}(t)(\lambda,x)\hat{L}_{\mathrm{lex}}(q)(\lambda,x)|$ for all $\lambda\geq\hat{\lambda}$ , if $\hat{\lambda}$ is large enough.

The same inequality $|\hat{L}_{\mathrm{lex}}(p)(\lambda,x^{\prime})|<|\hat{L}_{\mathrm{lex}}(t)(\lambda,x^{\prime})\hat{L}_{\mathrm{lex}}(q)(\lambda,x^{\prime})|$ holds for $x^{\prime}=\pi^{-1}(x)$ and all $\lambda\geq\hat{\lambda}$ . But here we have $\hat{L}_{\mathrm{lex}}(p)(\lambda,x^{\prime})>0$ , $\hat{L}_{\mathrm{lex}}(q)(\lambda,x^{\prime})>0$ for all $\lambda\geq\hat{\lambda}$ . Thus $\hat{L}_{\mathrm{lex}}(p)(\lambda,x^{\prime})-\hat{L}_{\mathrm{lex}}(t)(\lambda,x^{\prime})\hat{L}_{\mathrm{lex}}(q)(\lambda,x^{\prime})<0$ for all $\lambda\geq\hat{\lambda}$ . If we choose an appropriately large $\lambda\geq\lambda_{0}$ , we will get $\hat{p}(\lambda,x)-\hat{t}(\lambda,x)\hat{q}(\lambda,x)<0$ . However, this contradicts our assumption that $p(y)-t(y)q(y)\geq 0$ for all $y\in\mathbb{R}^{n}$ . $\boxempty$

Proposition 3.2.

Let $S_{1}=S(q)$ and $S_{2}=S(p)$ be two semi-algebraic sets in $\mathbb{R}^{n}$ . Suppose the following conditions are satisfied:

(a)

We have $L_{\mathrm{lex}}(p)\notin L_{\mathrm{lex}}(q)A$ . 2. (b)

The quadratic modules $\mathrm{QM}(q),\mathrm{QM}(-p)$ are totally stable with respect to any $z$ -grading in $\mathbb{N}_{1}^{n}$ and neither $q=0$ nor $p=0$ .

Then there is no non-negative polynomial $t\in A$ such that $p(y)-t(y)q(y)\geq 0$ for all $y\in\mathbb{R}^{n}$ .

Proof: Let us start where part (i) in the proof of Theorem 3.1 ended. Unfortunately we need some new arguments, since condition (b) of Proposition 3.2 differs from that of Theorem 3.1. This is where the new part (ii’) comes in. It serves as a link between part (i) and (ii) of Theorem 3.1, with the purpose that we can use the arguments already developed in the preceding theorem. For the sake of simplicity let us assume that $S_{1}\subseteqq S_{2}$ .

(ii’): Let $z\in\mathbb{N}_{1}^{n}$ be same tuple we used in part (ii). According to Theorem 1.6, we can find two compact sets $K,K^{\prime}\subseteq\mathbb{R}^{n}$ with non-empty interior such that $T_{K,z}\in\mathcal{T}_{0}(S(q))$ and $T_{K^{\prime},z}\in\mathcal{T}_{0}(S(-p))$ . Furthermore, we can find two points $x\in\mathrm{int}(K)$ and $x^{\prime}\in\mathrm{int}(K)$ with non-vanishing components. Note that $\hat{L}_{\mathrm{lex}}(q)(\uplambda,x)=a_{\alpha}x^{\alpha}\uplambda^{\langle\alpha,z\rangle}$ , $\hat{L}_{\mathrm{lex}}(q)(\uplambda,x^{\prime})=a_{\alpha}x^{\prime\alpha}\uplambda^{\langle\alpha,z\rangle}$ , $\hat{L}_{\mathrm{lex}}(p)(\uplambda,x)=b_{\beta}x^{\beta}\uplambda^{\langle\beta,z\rangle}$ and $\hat{L}_{\mathrm{lex}}(p)(\uplambda,x^{\prime})=b_{\beta}x^{\prime\beta}\uplambda^{\langle\beta,z\rangle}$ . It is obvious that there are two positive real numbers $\lambda_{1}$ and $\lambda_{2}$ such that

[TABLE]

holds for all $\lambda\geq\lambda_{1}$ resp.

[TABLE]

holds for all $\lambda\geq\lambda_{2}$ .

Set $M(\alpha)=\left\{i:\mathrm{sgn}\left(x_{i}^{\alpha_{i}}\right)\neq\mathrm{sgn}\left(x_{i}^{\prime\alpha_{i}}\right)\right\}$ and $M(\beta)=\left\{i:\mathrm{sgn}\left(x_{i}^{\beta_{i}}\right)\neq\mathrm{sgn}\left(x_{i}^{\prime\beta_{i}}\right)\right\}$ . Then both sets are not empty, because otherwise we would get

[TABLE]

or

[TABLE]

for all $\lambda\geq 1$ , which would result in a contradiction. The intersection $M(\alpha)\cap M(\beta)$ is not empty, since $S(q)\subseteq S(p)$ :

Suppose that the intersection would be empty. Take an element $i\in M(\beta)$ . Set $y_{1}=(1,\ldots,1,x_{i},1,\ldots,1)$ and $y_{2}=(1,\ldots,1,x_{i}^{\prime},1,\ldots,1)$ . Then $\hat{L}_{\mathrm{lex}}(p)(\lambda,y_{1})$ and $\hat{L}_{\mathrm{lex}}(p)(\lambda,y_{2})$ have different signs for all $\lambda\geq 1$ , while $\hat{L}_{\mathrm{lex}}(q)(\lambda,y_{1})$ and $\hat{L}_{\mathrm{lex}}(q)(\lambda,y_{2})$ have the same sign for all $\lambda\geq 1$ . Thus $\hat{q}(\lambda,y_{1})$ and $\hat{q}(\lambda,y_{2})$ are both negative or positive, while $\hat{p}(\lambda,y_{1})$ and $\hat{p}(\lambda,y_{2})$ have different signs for all $\lambda$ large enough. This can only work if $\hat{q}(\lambda,y_{1})$ and $\hat{q}(\lambda,y_{2})$ are negative for all sufficiently large $\lambda$ . Thus $a_{\alpha}<0$ . On the other side, we get $b_{\beta}>0$ by repeating the same arguments with $j\in M(\alpha)$ . It is not hard to see that this cannot work. Set

[TABLE]

In fact, $\hat{q}(\lambda,\tilde{y})$ is positive for all $\lambda>1$ large enough, while $\hat{p}(\lambda,\tilde{y})$ is negative, since $a_{\alpha}\tilde{y}^{\alpha}>0$ and $b_{\beta}\tilde{y}^{\beta}<0$ . But this contradicts $S(q)\subseteq S(p)$ . Take a natural number $k$ out of the set $M(\alpha)\cap M(\beta)$ and let $\pi_{k}$ denote the map $\mathbb{R}^{n}\rightarrow\mathbb{R}^{n},(x_{1},\ldots,x_{k},\ldots,x_{n})\mapsto(x_{1},\ldots,-x_{k},\ldots,x_{n})$ . Then we can find a two positive real numbers $c_{1}$ and $c_{2}$ such that the following two equations $c_{1}\hat{L}_{\mathrm{lex}}(q)(\uplambda,\pi_{k}(x^{\prime}))=\hat{L}_{\mathrm{lex}}(q)(\uplambda,x)$ and $c_{2}\hat{L}_{\mathrm{lex}}(p)(\uplambda,\pi_{k}(x^{\prime}))=\hat{L}_{\mathrm{lex}}(p)(\uplambda,x)$ hold. Hence it is easy to see that there is a positive real number $\lambda_{1}^{\prime}$ such that

[TABLE]

for all $\lambda\geq\lambda_{1}^{\prime}$ . On the other side, we still have

[TABLE]

for all $\lambda\geq\lambda_{2}$ . We are now ready to construct the rational tentacles needed for the second part of Theorem 1.6. For each $i=1,\ldots,n$ we define $\varphi_{i}=(\uplambda+\uplambda_{3}-1)^{z_{i}}\in\mathbb{R}[\uplambda]$ , where $\lambda_{3}=\max\{\lambda_{1}^{\prime},\lambda_{2}\}$ . As usual set $\varphi=(\varphi_{1},\ldots,\varphi_{n})$ . By what we have done so far and by using the fact that, if we substitute $x$ by some nearby other point $y$ , none of those inequalities used in this proof will be affected (see Proposition 2.3), we see that there is an open neighborhood $U$ of $x$ such that $T_{\overline{U},\varphi}\in\mathcal{T}(S(q))$ and $\pi_{k}(T_{\overline{U},\varphi})\notin\mathcal{T}(S(p))$ . Repeating part (ii)-(iv) of Theorem 1.6 concludes the proof. $\boxempty$

Remark 3.3.

Let $n$ be a natural number greater than $2$ . Suppose $f$ and $g$ are two different irreducible homogeneous polynomials in $\mathbb{R}[\mathrm{x}_{1},\ldots,\mathrm{x}_{n}]$ of odd degree. If $f$ and $g$ satisfy the condition (a) of Theorem 3.1, then it has some interesting geometric consequences for $V_{1}=\mathcal{V}(f)$ and $V_{2}=\mathcal{V}(g)$ . Let $\Lambda_{1}$ resp. $\Lambda_{2}$ denote all singular points of $V_{1}(\mathbb{R})$ resp. $V_{2}(\mathbb{R})$ . According to [Sha I, Theorem 1, p. 239] the intersection $V_{1}(\mathbb{R})\cap V_{2}(\mathbb{R})$ is not empty. Consider a point $x\in V_{1}(\mathbb{R})\cap V_{2}(\mathbb{R})$ . The following cases may occur:

•

The point $x$ is in $\Lambda_{1}$ : If $x$ is a local minimum of $g$ , then is must also a local minimum of $f$ . Thus $x$ is a point in $\Lambda_{2}$ .

•

The point $x$ is not in $\Lambda_{1}$ : It is easy to see that $x$ is a boundary point of $S(g)$ . If $x$ is in $\Lambda_{2}$ , then $x$ is either a local minimum of $f$ or it is a saddle point of $f$ . If $x$ is not in $\Lambda_{2}$ , then $V_{1}$ and $V_{2}$ intersect non-transversely at $x$ .

{window}

[3, r, ,]

Example 3.4.

Consider the polynomials $g=\mathrm{x}_{1}+\mathrm{x}_{2}+\mathrm{x}_{1}\mathrm{x}_{2}^{3}$ and $f=\mathrm{x}_{1}^{5}+\mathrm{x}_{1}^{5}\mathrm{x}_{2}+\mathrm{x}_{2}^{2}$ . Let us check if $f$ and $g$ satisfy all conditions of Theorem 3.1.

By using [Mathematica] as we did in 4.1, we see that $S(g)\subseteq S(f)$ .

(a): Obviously, $L_{\mathrm{lex}}(g)=\mathrm{x}_{1}\mathrm{x}_{2}^{3}$ and $L_{\mathrm{lex}}(f)=\mathrm{x}_{1}^{5}\mathrm{x}_{2}$ . Since $\deg_{\mathrm{x}_{2}}\left(L_{\mathrm{lex}}(g)\right)>\deg_{\mathrm{x}_{2}}\left(L_{\mathrm{lex}}(f)\right)$ , the polynomial $L_{\mathrm{lex}}(f)$ cannot lie in $L_{\mathrm{lex}}(g)\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ .

(b): Set $y^{\prime}=(5,5)$ , $y=(5,-5)$ , and consider $\hat{g}(\uplambda,y^{\prime})=g\left(\uplambda^{z_{1}}y_{1}^{\prime},\uplambda^{z_{2}}y_{2}^{\prime}\right)=5\uplambda^{z_{1}}+5\uplambda^{z_{2}}+625\uplambda^{z_{1}+3z_{2}}$ , $\hat{f}(\uplambda,y^{\prime})=f\left(\uplambda^{z_{1}}y_{1}^{\prime},\uplambda^{z_{2}}y_{2}^{\prime}\right)=3125\uplambda^{5z_{1}}+25\uplambda^{2z_{2}}+15625\uplambda^{5z_{1}+z_{2}}$ , $\hat{g}(\uplambda,y)=g\left(\uplambda^{z_{1}}y_{1},\uplambda^{z_{2}}y_{2}\right)=5\uplambda^{z_{1}}-5\uplambda^{z_{2}}-625\uplambda^{z_{1}+3z_{2}}$ , $\hat{f}(\uplambda,y)=f\left(\uplambda^{z_{1}}y_{1},\uplambda^{z_{2}}y_{2}\right)=3125\uplambda^{5z_{1}}+25\uplambda^{2z_{2}}-15625\uplambda^{5z_{1}+z_{2}}$ , where $(z_{1},z_{2})\in\mathbb{Z}^{2}$ . It is easy to see that if we take $(z_{1},z_{2})\in\mathbb{N}_{1}$ , then $\hat{g}(\lambda,y^{\prime}),\hat{f}(\lambda,y^{\prime})$ are positive for all $\lambda\geq 1$ , while $\hat{g}(\lambda,y^{\prime}),\hat{f}(\lambda,y^{\prime})$ are negative for all $\lambda\geq 1$ . By taking a small compact neighborhood $U^{\prime}$ of $y^{\prime}$ resp. $U$ of $y$ , we get a tentacle $T_{U^{\prime},z}$ belonging to $\mathcal{T}(S(g))$ and another one $T_{U,z}$ belonging to $\mathcal{T}(S(-f))$ . Finally, Theorem 3.1 implies that there is no non-negative polynomial $t\in\mathbb{R}[x_{1},x_{2}]$ such that $f(x)-t(x)g(x)\geq 0$ for all $x\in\mathbb{R}^{2}$ .

{window}

[8, r, ,]

Example 3.5.

Let us revisit the counterexample $f=\mathrm{x}_{1}^{3}+\mathrm{x}_{1}^{3}\mathrm{x}_{2}+\mathrm{x}_{2}^{2}$ and $g=\mathrm{x}_{1}+\mathrm{x}_{2}+\mathrm{x}_{1}\mathrm{x}_{2}$ . Unfortunately, condition (a) of Theorem 3.1 is violated. We have $L_{\mathrm{lex}}(g)=\mathrm{x}_{1}\mathrm{x}_{2}$ and $L_{\mathrm{lex}}(f)=\mathrm{x}_{1}^{3}\mathrm{x}_{2}$ . Thus $L_{\mathrm{lex}}(f)=\mathrm{x}_{1}^{2}L_{\mathrm{lex}}(g)$ . On the other side, it is easy to see that the condition (b) of Theorem 3.1 is satisfied. Can we still make use of Theorem 3.1?

Let $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ be a non-negative polynomial. If $L_{\mathrm{lex}}(t)\neq\mathrm{x}_{1}^{2}$ then we can just repeat the arguments of Theorem 3.1. However, the problematic case is if $L_{\mathrm{lex}}(t)=\mathrm{x}_{1}^{2}$ holds. Interchange the variables $\mathrm{x}_{1}$ and $\mathrm{x}_{2}$ in $f$ and $g$ , giving $f=\mathrm{x}_{2}^{3}+\mathrm{x}_{2}^{3}\mathrm{x}_{1}+\mathrm{x}_{1}^{2}$ and $g=\mathrm{x}_{1}+\mathrm{x}_{2}+\mathrm{x}_{1}\mathrm{x}_{2}$ . Now, we are out for some suitable $z$ -grading such that $L_{z}(f)\notin L_{z}(g)\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ . Choose $z\in\mathbb{N}_{1}^{2}$ with $3z_{2}=z_{1}$ . Then we have $L_{z}(g)=\mathrm{x}_{1}\mathrm{x}_{2}$ , $L_{z}(f)=\mathrm{x}_{2}^{3}\mathrm{x}_{1}+\mathrm{x}_{1}^{2}$ resp. $\hat{L}_{z}(g)(\uplambda,\mathrm{x})=\mathrm{x}_{1}\mathrm{x}_{2}\uplambda^{z_{1}+z_{2}}$ , $\hat{L}_{z}(f)(\uplambda,\mathrm{x})=\left(\mathrm{x}_{2}^{3}\mathrm{x}_{1}+\mathrm{x}_{1}^{2}\right)\uplambda^{2z_{1}}$ . Obviously, $L_{z}(f)$ does not lie in $L_{z}(g)\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ . Let us check condition (b). Set $x=(5,5)$ . Then $\hat{L}_{z}(g)(\uplambda,x)=25\uplambda^{z_{1}+z_{2}}$ and $\hat{L}_{z}(f)(\uplambda,x)=5^{4}\uplambda^{2z_{1}}+5^{2}\uplambda^{2z_{1}}$ . On the other hand, if we set $x=(-5,5)$ we get $\hat{L}_{z}(g)(\uplambda,x)=-25\uplambda^{z_{1}+z_{2}}$ and $\hat{L}_{z}(f)(\uplambda,x)=-5^{4}\uplambda^{2z_{1}}+5^{2}\uplambda^{2z_{1}}$ . This means that if we take an appropriate neighborhood $U$ of $x$ and define $\pi:\mathbb{R}^{2}\rightarrow\mathbb{R}^{2},(x_{1},x_{2})\mapsto(-x_{1},x_{2})$ we get $T_{U,z}\in\mathcal{T}(S(g))$ and $\pi(T_{U,z})\in\mathcal{T}(S(-f))$ . By using the here defined $z$ -grading, instead of the one defined in the proof of Theorem 3.1, and repeating the arguments in part (ii)-(iv) we get that there is no non-negative polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]$ such that $f(y)-t(y)g(y)\geq 0$ for all $y\in\mathbb{R}^{2}$ . In fact we can state:

Theorem 3.6.

Let $S_{1}=S(q)$ and $S_{2}=S(p)$ be two semi-algebraic sets in $\mathbb{R}^{n}$ . Suppose the following condition is satisfied:

There is a $z\in\mathbb{N}_{1}^{n}$ such that $L_{z}(p)\notin L_{z}(q)A$ , a rational tentacle $T\in\mathcal{T}(S_{1})$ of degree $z$ and an element $\pi\in G$ such that $\pi(T)\notin\mathcal{T}(S_{2})$ . Furthermore, all unbounded $T^{\prime}\in\mathcal{T}(S_{1})$ with $\pi(T^{\prime})\subseteq\pi(T)$ satisfy $\pi(T^{\prime})\notin\mathcal{T}(S_{2})$ .

Then there is no non-negative polynomial $t\in A$ such that $p(y)-t(y)q(y)\geq 0$ for all $y\in\mathbb{R}^{n}$ .

Proof: Simply repeat the same arguments in part (ii)-(iv) of Theorem 3.1. $\boxempty$

In the same manner we can modify Proposition 3.2:

Proposition 3.7.

Let $S_{1}=S(q)$ and $S_{2}=S(p)$ be two semi-algebraic sets in $\mathbb{R}^{n}$ , where neither $q=0$ nor $p=0$ . Suppose the following condition is satisfied:

There is a $z\in\mathbb{N}_{1}^{n}$ such that $L_{z}(p)\notin L_{z}(q)A$ and the quadratic modules $\mathrm{QM}(q)$ and $\mathrm{QM}(-p)$ are totally stable with respect to the $z$ -grading.

Then there is no non-negative polynomial $t\in A$ such that $p(y)-t(y)q(y)\geq 0$ for all $y\in\mathbb{R}^{n}$ .

Remark 3.8.

The difference between Theorem 3.1 and Theorem 3.6 is simple. In Theorem 3.1 we demanded that $L_{\mathrm{lex}}(p)\notin L_{\mathrm{lex}}(q)A$ . Then we constructed a special $z$ -grading, where $z\in\mathbb{N}_{1}^{n}$ . Under this $z$ -grading we had $L_{z}(p)=L_{\mathrm{lex}}(p)$ resp. $L_{z}(q)=L_{\mathrm{lex}}(q)$ and therefore $L_{z}(p)\notin L_{z}(q)A$ . Let us refer to the $z$ -gradings satisfy $z\in\mathbb{N}_{1}^{n}$ and $L_{z}(p)\notin L_{z}(q)A$ as special gradings. The difference between Theorem 3.6 and Theorem 3.1 is, that the special $z$ -grading constructed in part (i) of the proof of Theorem 3.1, is already given in the prerequisites of Theorem 3.6. The disadvantage of Theorem 3.6 compared to Theorem 3.1 is, that one must find such a special $z$ -grading for Theorem 3.6 to work, while Theorem 3.1 does not require such a procedure. The advantage of Theorem 3.6 is, that it allows a wider range of gradings as Example 3.5 illustrates it. In fact, we have an two explanations why we used $g=\mathrm{x}_{1}\mathrm{x}_{3}+\mathrm{x}_{2}\mathrm{x}_{3}+\mathrm{x}_{1}\mathrm{x}_{2}$ . The first explanation is a geometric one. According to Example 3.5 the polynomial $g$ has all the necessary properties for Theorem 3.6 to work. The second explanation is an algebraic one. According to Theorem 2.5 quadratic forms $q$ that have a vanishing diagonal part give rise to quadratic module $\mathrm{QM}(q)$ that is totally stably with respect to any $z$ -grading. Thus it is (was) convenient to choose $g=\mathrm{x}_{1}\mathrm{x}_{3}+\mathrm{x}_{2}\mathrm{x}_{3}+\mathrm{x}_{1}\mathrm{x}_{2}$ .

4 A non-geometric counterexample

{window}

[2, r, ,] In the last chapter we saw, that Counterexample 6 proved the S4-conjecture wrong because of geometric reasons. The straightforward question is obvious: Can we find for all counterexamples a geometric reason? At least,we will give a counterexample that does not work because of a arithmetical reason. From now on, we set $g=(-3+\mathrm{x}_{1}-\mathrm{x}_{2})(3+\mathrm{x}_{1}-\mathrm{x}_{2})$ , $p=-\mathrm{x}_{1}^{3}+\mathrm{x}_{2}^{3}+2\mathrm{x}_{1}+1$ , $l_{1}=-3+\mathrm{x}_{1}-\mathrm{x}_{2}$ , $l_{2}=3+\mathrm{x}_{1}-\mathrm{x}_{2}$ and $f=-l_{2}p$ . By using [Mathematica] (see 6) it is easy to verify that $S(g)\subseteq S(f)$ . The next obvious step is:

Proposition 4.1.

There is no non-negative polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]_{2}$ such that $f(y)-t(y)g(y)\geq 0$ for all $y\in\mathbb{R}^{2}$ .

Proof: Suppose there is a non-negative polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2}]_{2}$ contradicting the statement of this proposition. In the following we fix a real number $x_{2}\in\mathbb{R}$ and consider $f(\mathrm{x}_{1},x_{2}),g(\mathrm{x}_{1},x_{2}),t(\mathrm{x}_{1},x_{2})$ as polynomials in $\mathrm{x}_{1}$ . The polynomial $f(\mathrm{x}_{1},x_{2})-t(\mathrm{x}_{1},x_{2})g(\mathrm{x}_{1},x_{2})\in\mathbb{R}[\mathrm{x}_{1}]$ has a root at $x_{1}=x_{2}-3$ . Since $f(\mathrm{x}_{1},x_{2})-t(\mathrm{x}_{1},x_{2})g(\mathrm{x}_{1},x_{2})\in\mathbb{R}[\mathrm{x}_{1}]$ is non-negative for every $x_{2}\in\mathbb{R}$ , it must be divided by $l_{2}^{2}(\mathrm{x}_{1},x_{2})\in\mathbb{R}[\mathrm{x}_{1}]$ .

The remainder of the polynomial division $f(\mathrm{x}_{1},x_{2}):l_{2}^{2}(\mathrm{x}_{1},x_{2})$ , as polynomials in $\mathbb{R}[\mathrm{x}_{1}]$ , is $r_{1}(\mathrm{x}_{1},x_{2})=9x_{2}^{3}-9\mathrm{x}_{1}x_{2}^{2}-52x_{2}^{2}-25\mathrm{x}_{1}x_{2}+97x_{2}^{2}-22\mathrm{x}_{1}-66\in\mathbb{R}[\mathrm{x}_{1}]$ . And for $g(\mathrm{x}_{1},x_{2}):l_{2}^{2}(\mathrm{x}_{1},x_{2})$ we have $r_{2}(\mathrm{x}_{1},x_{2})=6x_{2}-6\mathrm{x}_{1}-18\in\mathbb{R}[\mathrm{x}_{1}]$ . Finally, let $r_{3}(\mathrm{x}_{1},x_{2})\in\mathbb{R}[\mathrm{x}_{1}]$ denote the remainder of $t(\mathrm{x}_{1},x_{2}):l_{2}^{2}(\mathrm{x}_{1},x_{2})$ . Since $f(\mathrm{x}_{1},x_{2})-t(\mathrm{x}_{1},x_{2})g(\mathrm{x}_{1},x_{2})$ is divided by $l_{2}^{2}(\mathrm{x}_{1},x_{2})$ , we get the identity $r_{1}(\mathrm{x}_{1},x_{2})-r_{3}(\mathrm{x}_{1},x_{2})r_{2}(\mathrm{x}_{1},x_{2})=0$ . This leads to $r_{3}(\mathrm{x}_{1},x_{2})=\frac{r_{1}(\mathrm{x}_{1},x_{2})}{r_{2}(\mathrm{x}_{1},x_{2})}=\frac{1}{6}\left(22-25x_{2}+9x_{2}^{2}\right)$ . Set $\tilde{t}=a_{x_{2}}l_{2}^{2}+\frac{1}{6}\left(22-25\mathrm{x}_{2}+9\mathrm{x}_{2}^{2}\right)$ and choose $a_{x_{2}}\in\mathbb{R}$ such that the equality $\tilde{t}(\mathrm{x}_{1},x_{2})=t(\mathrm{x}_{1},x_{2})$ holds for $x_{2}\in\mathbb{R}$ . It is easy to see that the leading term of $f-\tilde{t}g$ in $\mathrm{x}_{2}$ is $\left(-\frac{1}{2}-a_{x_{2}}\right)\mathrm{x}_{2}^{4}$ and that $f(0,0)-\tilde{t}(0,0)g(0,0)=30+81a_{x_{2}}$ . For large $x_{2}\in\mathbb{R}$ we see that $a_{x_{2}}$ must satisfy $a_{x_{2}}\leq-\frac{1}{2}$ and $a_{x_{2}}\geq-\frac{30}{81}$ , which is impossible. $\boxempty$

Remark 4.2.

Under an appropriate change of coordinates the homogenization of $g$ can be written as $\overline{g}=-9\mathrm{x}_{1}^{2}+2\mathrm{x}_{2}^{2}$ . Thus the signature of $\overline{g}$ is [math]. Note, that the signature in Counterexample 4.1 was $-1$ . Furthermore, it is easy to see that neither Theorem 3.1 nor Theorem 3.6 can be applied. In fact, $g$ violates condition (b) in Theorem 3.1 and the condition in Theorem 3.6.

5 Final thoughts

Finally, we have come to the end of this article. Hence let us summarize what we have learned so far. First, we learned that the S4-conjecture is not true. Second, we learned that there are geometric reasons why the S4-conjecture cannot be true. Finally, we learned that there are arithmetic reasons why the S4-conjecture cannot work. Still there are many questions left. The signature of $g$ in 4.1 resp. 4.1 was $-1$ resp. [math]. So it is quite naturally to ask, if there is an counterexample, where $g$ has signature $1$ . There is another obvious question: Under which conditions does the S4-conjecture work? Can these conditions be expressed in geometric or algebraic terms? Results in this direction can be found in [Sch, Corollary 4.5]:

Proposition 5.1.

[Sch, Corollary 4.5]: Let $h_{1},\ldots,h_{r}\in\mathbb{R}[\mathrm{x}_{0},\ldots,\mathrm{x}_{n}]$ be homogeneous polynomials of even degree, and let

[TABLE]

Assume there is $\xi\in\mathbb{R}^{n+1}$ with $h_{i}(\xi)>0$ for $i=1,\ldots,r$ . If $p,q\in\mathbb{R}[\mathrm{x}_{0},\ldots,\mathrm{x}_{n}]$ are homogeneous of even degree and positive on $S\backslash\{0\}$ , then $pq^{m}$ lies in the preordering generated by $h_{1},\ldots,h_{r}$ , for all sufficiently large $m\geq 0$ .

In contrast to the S4-conjecture, we need a homogeneous polynomial $p$ of even degree that is positive on the set $S\backslash\{0\}$ . And even then we can only conclude that there is a natural number $m\geq 1$ such that $p^{m}\in T(h_{1},\ldots,h_{r})$ . If we could show that $p\in T(h_{1},\ldots,h_{r})$ , we would still have considerable obstacles. For example, we do not know what kind of degree bounds the various representations of $p$ in $T(h_{1},\ldots,h_{r})$ have. Nevertheless, let us consider the polynomials $g=\mathrm{x}_{1}^{2}+\mathrm{x}_{2}^{2}-4\mathrm{x}_{3}^{2}$ and $f=\mathrm{x}_{1}^{4}+\mathrm{x}_{2}^{4}-\mathrm{x}_{3}^{4}$ . Then $f$ and $g$ satisfy the following conditions:

•

There is a point $x^{\prime}\in\mathbb{R}^{3}$ such that $g(x^{\prime})>0$ .

•

The polynomial $f$ is positive on the set $S(g)\backslash\{0\}$ .

•

The polynomial $f-\mathrm{x}_{3}^{2}g$ is non-negative.

In other words, the two polynomials satisfy the S4-conjecture. It is easy to see that $V_{1}$ and $V_{2}$ are both non-singular curves. According to [BC, Proposition 11.6.2, p. 286] the set $V_{1}(\mathbb{R})\subseteq\mathbb{P}^{2}(\mathbb{R})$ decomposes into 1 or 0 ovals, and the set $V_{2}(\mathbb{R})\subseteq\mathbb{P}^{2}(\mathbb{R})$ into at most $4$ different ovals. For the definition of an oval see [BC, p. 286]. In our case $V_{1}(\mathbb{R})$ and $V_{2}(\mathbb{R})$ decompose into one oval that does not intersect the plane at infinity. Since $\mathrm{x}_{1}^{2}+\mathrm{x}_{2}^{2}$ and $\mathrm{x}_{1}^{4}+\mathrm{x}_{2}^{4}$ are positive on the set $\mathbb{R}^{2}\backslash\{0\}$ , the two sets $V_{1}(\mathbb{R})$ , $V_{2}(\mathbb{R})$ do not intersect the hyperplane at infinity. Another consequence is that we can tell something about the geometry of $S(g)$ resp. $S(f)$ . For $y\in\mathbb{R}\backslash\{0\}$ set $H_{y}=\mathbb{R}^{2}\times\{y\}$ and interpret $V_{1}$ and $V_{2}$ as affine varieties in $\mathbb{A}^{3}$ . Then we have $\partial S(g)\cap H_{y}=V_{1}(\mathbb{R})\cap H_{y}$ and $\partial S(f)\cap H_{y}=V_{2}(\mathbb{R})\cap H_{y}$ for all $y\in\mathbb{R}\backslash\{0\}$ . The two inclusions $\partial S(g)\cap H_{y}\subseteq V_{1}(\mathbb{R})\cap H_{y}$ and $\partial S(f)\cap H_{y}\subseteq V_{2}(\mathbb{R})\cap H_{y}$ are obvious. The other two inclusions $\partial S(g)\cap H_{y}\supseteq V_{1}(\mathbb{R})\cap H_{y}$ and $\partial S(f)\cap H_{y}\supseteq V_{2}(\mathbb{R})\cap H_{y}$ hold, because $V_{1}(\mathbb{R})$ and $V_{2}(\mathbb{R})$ do not have singular points in $H_{y}$ . Thus each slice $\partial S(f)\cap H_{y}$ , $\partial S(g)\cap H_{y}$ ’looks’ like a circle. If we replace $g$ by an arbitrary quadratic form $q\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]$ and $f$ by an arbitrary ternary quartic $p$ , then all geometric statements111Of course with an adjusted number of ovals made so far in this matter remain true, if the ovals of $\mathcal{V}(q)(\mathbb{R})$ and $\mathcal{V}(p)(\mathbb{R})$ do not intersect the plane at infinity. Interestingly, the ovals that do not intersect the plane at infinity have different topological properties than their counterparts that intersect the plane at infinity: It is a well known fact that the fundamental group of $\mathbb{P}^{2}(\mathbb{R})$ is exactly $\mathbb{Z}/2\mathbb{Z}$ . By interpreting an oval as a loop, it turns out that all ovals that do not intersect the hyperplane at infinity, represent the identity element of the fundamental group. If all ovals of $\mathcal{V}(q)(\mathbb{R})$ and $\mathcal{V}(p)(\mathbb{R})$ do not intersect the hyper plane at infinity, then it is obvious that the topological situation compared to $\mathcal{V}(g)(\mathbb{R})$ and $\mathcal{V}(f)(\mathbb{R})$ has not changed much.

Hence it is convenient to ask this final question:

Question 5.2.

Let $q\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]$ be a quadratic form and $p$ a ternary quartic. Set $V_{1}=\mathcal{V}(q)\subseteq\mathbb{P}^{2}$ and $V_{2}=\mathcal{V}(p)\subseteq\mathbb{P}^{2}$ . Suppose that the following conditions are satisfied:

•

There is a point $x^{\prime}\in\mathbb{R}^{3}$ such that $q(x^{\prime})>0$ .

•

The ternary quartic $p$ is positive on the set $S(q)\backslash\{0\}$ .

•

The projective varieties $V_{1}$ and $V_{2}$ are non-singular.

•

The set $V_{1}(\mathbb{R})$ is an oval and $V_{2}(\mathbb{R})$ decomposes into at least one oval.

•

All ovals of $V_{1}(\mathbb{R})$ and $V_{2}(\mathbb{R})$ do not intersect the hyperplane at infinity.

Can we find a non-negative homogeneous polynomial $t\in\mathbb{R}[\mathrm{x}_{1},\mathrm{x}_{2},\mathrm{x}_{3}]_{2}$ such that $p(y)-t(y)q(y)\geq 0$ for all $y\in\mathbb{R}^{3}$ ?

Bibliography13

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Bar] Alexander Barvinok: A Course in Convexity, Graduate Studies in Mathematics, AMS Volume 54, ISSN 1065-7339 .
2[Sha I] Igor Shavarevich: Basic Algebraic Geometry I, Springer, Second Edition, 1994 .
3[Sha II] Igor Shavarevich: Basic Algebraic Geometry II, Springer, Second Edition, 1997 .
4[BC] Jacek Bochnak, Michel Coste, Marie-Francois Roy: Real Algebraic Geometry, Springer, 1991 .
5[PT] Imre Polik, Tamas Terlaky: A Survey of the S-lemma, SIAM journals, http://epubs.siam.org/doi/abs/10.1137/S 003614450444614 X?journal Code=siread .
6[ZS] Kuize Zhang, Lijun Zhang, Fuchun Sun: High-Order S-Lemma With Application To Stability Of A Class Of Switched Nonlinear Systems, http://arxiv.org/abs/1403.1016 .
7[Ne] Tim Netzer, Stability of Quadratic Modules, http://arxiv.org/abs/0807.4403 .
8[RS] Remmert, Schumacher, Funktionentheorie 1, Springer, 5.Auflage, 2001 .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Contents

Chapter 0 Introduction

Chapter 1 The S-lemma

Theorem 0.1**.**

1 Preliminaries

Definition 1.1**.**

Remark 1.2**.**

Definition 1.3**.**

Definition 1.4**.**

Remark 1.5**.**

Definition 1.6**.**

Definition 1.7**.**

Definition 1.8**.**

Definition 1.9**.**

Definition 1.10**.**

Lemma 1.11**.**

Proposition 1.12**.**

Lemma 1.13**.**

Proposition 1.14**.**

Corollary 1.15**.**

Corollary 1.16**.**

Proposition 1.17**.**

2 Proof of the S-lemma

Proposition 2.1**.**

Chapter 2 Higher degree S-lemma

1 Counterexample

Example 1.1**.**

Remark 1.2**.**

Example 1.3**.**

2 Formulating a higher degree S-lemma

Conjecture 2.1**.**

Remark 2.2**.**

3 S4-conjecture in two variables

Theorem 3.1**.**

Remark 3.2**.**

Lemma 3.3**.**

Lemma 3.4**.**

Lemma 3.5**.**

4 The S4-conjecture: A counterexample

Example 4.1**.**

Remark 4.2**.**

5 Geometric analysis

Lemma 5.1**.**

Remark 5.2**.**

Lemma 5.3**.**

Remark 5.4**.**

Lemma 5.5**.**

Proposition 5.6**.**

Conjecture 5.7**.**

6 A generalization of the counterexample

Definition 6.1**.**

Remark 6.2**.**

Proposition 6.3**.**

Proposition 6.4**.**

Proposition 6.5**.**

Chapter 3 Quadratic modules and stability

1 Preliminaries

Definition 1.1**.**

Definition 1.2**.**

Definition 1.3**.**

Remark 1.4**.**

Definition 1.5**.**

Theorem 1.6**.**

2 Stability and tentacles

Definition 2.1**.**

Definition 2.2**.**

Proposition 2.3**.**

Corollary 2.4**.**

Theorem 2.5**.**

Definition 2.6**.**

Remark 2.7**.**

Proposition 2.8**.**

Theorem 2.9**.**

Theorem 0.1.

Definition 1.1.

Remark 1.2.

Definition 1.3.

Definition 1.4.

Remark 1.5.

Definition 1.6.

Definition 1.7.

Definition 1.8.

Definition 1.9.

Definition 1.10.

Lemma 1.11.

Proposition 1.12.

Lemma 1.13.

Proposition 1.14.

Corollary 1.15.

Corollary 1.16.

Proposition 1.17.

Proposition 2.1.

Example 1.1.

Remark 1.2.

Example 1.3.

Conjecture 2.1.

Remark 2.2.

Theorem 3.1.

Remark 3.2.

Lemma 3.3.

Lemma 3.4.

Lemma 3.5.

Example 4.1.

Remark 4.2.

Lemma 5.1.

Remark 5.2.

Lemma 5.3.

Remark 5.4.

Lemma 5.5.

Proposition 5.6.

Conjecture 5.7.

Definition 6.1.

Remark 6.2.

Proposition 6.3.

Proposition 6.4.

Proposition 6.5.

Definition 1.1.

Definition 1.2.

Definition 1.3.

Remark 1.4.

Definition 1.5.

Theorem 1.6.

Definition 2.1.

Definition 2.2.

Proposition 2.3.

Corollary 2.4.

Theorem 2.5.

Definition 2.6.

Remark 2.7.

Proposition 2.8.

Theorem 2.9.

Theorem 2.10.

Remark 2.11.

Remark 2.12.

Theorem 3.1.

Proposition 3.2.

Remark 3.3.

Example 3.4.

Example 3.5.

Theorem 3.6.

Proposition 3.7.

Remark 3.8.

Proposition 4.1.

Remark 4.2.

Proposition 5.1.

Question 5.2.