A new general formula for the Cauchy Index on an interval with Subresultants
Daniel Perrucci, Marie-Fran\c{c}oise Roy

TL;DR
This paper introduces a novel formula for calculating the Cauchy index of rational functions on any interval, utilizing subresultant polynomials, and simplifies computations by reducing the number of subresultants needed.
Contribution
The paper provides a new general formula for the Cauchy index that removes endpoint restrictions and optimizes the number of subresultants involved.
Findings
The formula applies to any interval without endpoint conditions.
It reduces the number of subresultants needed in certain cases.
The approach simplifies the computation of the Cauchy index.
Abstract
We present a new formula for the Cauchy index of a rational function on an interval using subresultant polynomials. There is no condition on the endpoints of the interval and the formula also involves in some cases less subresultant polynomials.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPolynomial and algebraic computation · Advanced Numerical Analysis Techniques · Numerical Methods and Algorithms
A new general formula
for the Cauchy Index on an interval with Subresultants
Daniel Perrucci*♭* Marie-Françoise Roy*♯*
Departamento de Matemática, FCEN, Universidad de Buenos Aires and IMAS UBA-CONICET,
Ciudad Universitaria, 1428 Buenos Aires, Argentina
IRMAR (UMR CNRS 6625), Université de Rennes 1,
Campus de Beaulieu, 35042 Rennes Cedex, France Partially supported by the Argentinian grantsUBACYT 20020160100039BAandPIP 11220130100527CO CONICET
Abstract
We present a new formula for the Cauchy index of a rational function on an interval using subresultant polynomials. There is no condition on the endpoints of the interval and the formula also involves in some cases less subresultant polynomials.
Keywords: Cauchy Index, Subresultant Polynomials.
AMS subject classifications: 14P99, 13P15, 12D10, 26C15
1 Introduction
Let be a real closed field and with . Already considered by Sturm and Cauchy ([8, 4]), the Cauchy index of the rational function is the integer number which counts its number of jumps from to minus its number of jumps from to . This value plays an important role in many algorithms in real algebraic geometry ([2, 3, 6]). For instance, the Tarski query of for , defined as
[TABLE]
is equal to the Cauchy index of the rational function (see, for instance, [3, Proposition 2.57]). Tarski queries are used to solve the sign determination problem, which consists in listing the signs of a list of polynomials in evaluated at the roots in of another polynomial in (see [3, Section 10.3]). In particular, the number of real roots of a polynomial coincides with the Tarski query and is equal to the Cauchy index of the rational function . We can also mention the role of the Cauchy index for complex roots counting (see [4, 5, 7]). For more information and references to the history of the Cauchy index see [5].
1.1 Cauchy index
Let with . The usual definition of the Cauchy index of is made directly on intervals whose endpoints are not roots of . In this paper we use the extended definition of the Cauchy index introduced in [5, Section 3], which is made first locally at elements in , and then on intervals without restriction.
Definition 1
Let and .
- •
If , the rational function can be written uniquely as
[TABLE]
with , , monic, and coprime and . For , define
[TABLE]
In all other cases, define
[TABLE]
- •
The Cauchy index of at is
[TABLE]
Said in other terms, when is a pole of , we have that \displaystyle{{\rm Ind}_{x}^{+}\Big{(}\frac{Q}{P}\Big{)}} is one half of the sign of to the right of , and \displaystyle{{\rm Ind}_{x}^{-}\Big{(}\frac{Q}{P}\Big{)}} is one half of the sign of to the left of . Then, the Cauchy index of at , \displaystyle{{\rm Ind}_{x}\Big{(}\frac{Q}{P}\Big{)}}, is simply the difference between them. We illustrate this notion considering the graph of the function around in each different case.
x$$x$$x$$x$${\rm{Ind}}_{x}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)}=0$${\rm{Ind}}_{x}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)}=1$${\rm{Ind}}_{x}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)}=-1$${\rm{Ind}}_{x}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)}=0
Definition 2
Let with and with . If , the Cauchy index of on is
[TABLE]
where the sum is well-defined since only roots of in contribute.
Similarly, if , the Cauchy index of on is
[TABLE]
where, again, the sum is well-defined since only roots of in contribute.
If , both the Cauchy index of on and the Cauchy index of on are defined as [math].
In the following picture we consider again the graph of the function , this time in .
a$$b$$a$$b$${\rm{Ind}}_{a}^{b}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)}=1+0+1=2$${\rm{Ind}}_{a}^{b}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)}=-1-1-\frac{1}{2}=-\frac{5}{2}
Note that with this extended definition of the Cauchy index, the Cauchy index of a rational function on an interval belongs to and it is not necessarily an integer number.
1.2 Sturm sequences and Cauchy index
Definition 3
Let . Define and, if ,
[TABLE]
with , where is the remainder in the euclidean division in of the first polynomial by the second polynomial.
The Sturm sequence of and is and the Sturm sequence of and [math] is , with . We denote by the degrees of .
Example 4
Let and . If and , the Sturm sequence of and is with
[TABLE]
In this case, , .
Extending the classical results by Sturm ([8]) and recent results by [5], we now explain that the Sturm sequence of and gives a formula for the general definition of the Cauchy index on an interval under no assumptions on and . To do so, it is first needed to extend the notion of sign of a rational function to degenerate cases.
Definition 5
Let . Using the same notation as in Definition 1, we define
[TABLE]
We define also \displaystyle{{\rm{sign}}\Big{(}\frac{Q}{P},x\Big{)}}=0 if .
In other words, if is a pole of , the sign of at is [math]; otherwise, it is simply the sign of the continuous extension of at . Notice that if , \displaystyle{{\rm{sign}}\Big{(}\frac{Q}{P},x\Big{)}={\rm{sign}}\Big{(}\frac{P}{Q},x\Big{)}}.
We now state the general result relating the Cauchy index and the Sturm sequence, which will be proved at the end of Section 4.
Theorem 6
Let with and , . If is the Sturm sequence of and , then
[TABLE]
Adding the condition that and are not common roots of and , from Theorem 6 a sign-variation-counting formula for the Cauchy index is obtained.
Definition 7
Let and , we define the sign variation of at by
[TABLE]
If with , we denote by the sign variation of at minus the sign variation of at ; namely,
[TABLE]
Note that for ,
[TABLE]
Moreover, if is not a common root of and , then
[TABLE]
The following result then follows clearly from Theorem 6.
Theorem 8
Let with and , . If and are not common roots of and and is the Sturm sequence of and , then
[TABLE]
Theorem 8 is a generalization of the classical Sturm theorem [8, 3], since or can be root of or (but not of both).
1.3 Subresultant polynomials
Subresultant polynomials are polynomials which are proportional to the ones in the Sturm sequence, but enjoy better properties since their coefficients belong to the ring generated by the coefficients of and . We include definitions and properties concerning subresultant polynomials. We refer the reader to [3] for proofs and details.
Let be a domain and let be its fraction field.
Definition 9
Let with and . For , the -th subresultant polynomial of and , is
[TABLE]
[TABLE]
By convention, we extend this definition with
[TABLE]
We also define
Note that in the matrix above, all the entries in the first columns are elements in , and all the entries in the last column are elements in . Doing column operations, it is easy to prove that for ,
[TABLE]
Note also that in the case , we have given two definitions for , both equal to so that there is no ambiguity.
Definition 10
Let with and .
- •
For , the -th subresultant coefficient of and , is the coefficient of in . By convention, we extend this definition with
[TABLE]
- •
For , is said to be
- –
defective* if or, equivalently, if ,*
- –
non-defective* if or, equivalently, if .*
We refer the reader to [1, Chapitre 9] for another definition of subresultant polynomials and coefficients, which differs possibly in a sign.
We illustrate Definitions 9 and 10 with the following example.
Example 11
Let and , then as in Example 4. We have
[TABLE]
Note that and are non-defective while and are defective. Finally, is defective if and only if , and is defective if and only if .
The following Structure Theorem is a key result in the theory of subresultants, stating the connection between subresultants and remainders. To state it, we need to introduce a notation.
Notation 12
For , we denote
Note that if the remainder of in the division by is [math] or and if the remainder of in the division by is or ; this implies that for
[TABLE]
Theorem 13** (Structure Theorem of Subresultants)**
Let with and . Let be the sequence of degrees of the Sturm sequence of and in decreasing order and let (note that and ).
- •
For ,
[TABLE]
and and are proportional. More precisely, for , denote
[TABLE]
(note that ), and extend this notation with and (even if is not monic). Then
[TABLE]
with
[TABLE]
This implies .
- •
For ,
[TABLE]
(where is the remainder in the euclidean division in of the first polynomial by the second polynomial) and the quotient belongs to .
- •
Both and are greatest common divisors of and in and they divide for . In addition, if then
[TABLE]
- Proof:
See [3, Chapter 8].
Note that Theorem 13 (Structure Theorem of Subresultants) gives a method for computing the subresultant polynomials using remainders which is more efficient than using their definition as determinants. However we are not concerned with subresultant polynomials computations in the current paper. We are only concerned with a formula for the Cauchy index using the subresultant polynomials.
Theorem 13 (Structure Theorem of Subresultants) can be illustrated by the following picture.
[TABLE]
As a corollary to Theorem 13, all subresultant polynomials are either [math] or proportional to polynomials in the Sturm sequence. More precisely, for , the subresultant polynomial is proportional to in the Sturm sequence.
Remark 14
In the case where all the subsresultant polynomials are non-defective, there are no pairs of proportional polynomials in the sequence of subresultant polynomials, the degrees of the polynomials in the Sturm sequence decrease one by one, and the coefficient of proportionality between and is a square (see [3, Corollary 8.37]).
Example 15** (Continuation of Example 4 and Example 11)**
Let us take as before and suppose and .
Looking at Example 11, we observe that, as expected given the Structure Theorem, the degrees of the non-defective subresultant polynomials are , , i.e. the degrees of the polynomials in the Sturm sequence given in Example 4. Moreover , while is proportional to and, also, to given in Example 4.
Using the notation from Theorem 13, we have
[TABLE]
1.4 Main results
In order to state our results we introduce the following notation.
Notation 16
Using the notation from Theorem 13, for , let
[TABLE]
( is well-defined since is odd).
We are ready now to state our main result, which is a new formula for {\rm Ind}_{a}^{b}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)} using only the polynomials in the sequence of the subresultant polynomials.
Theorem 17
Let with and with and . Then
[TABLE]
As we will see in Section 2, the main advantage of the formula in Theorem 17 in comparison with previously known related formulas is that there is no assumption on the endpoints and of the interval, and, more importantly, potentially less subresultant polynomials (i.e. only te ) are involved.
If we add the condition that and are no roots of and , from Theorem 17 we obtain a sign-variation-counting formula.
Theorem 18
Let with and with and . If and are not common roots of and , then
[TABLE]
Finally, for the Cauchy index on , we obtain the following result.
Theorem 19
Let with and . If the leading coefficient of is positive or if is even, then
[TABLE]
If the leading coefficient of is negative and is odd, then
[TABLE]
Example 20** (Continuation of Examples 4, 11 and 15)**
Following Notation 16, for we have . Therefore, by Theorem 19, when we fix with and , the number of roots of in is given by
[TABLE]
The rest of the paper is organized as follows. In Section 2 we comment the differences between our results and previously known related formulas. In Section 3 we review some useful properties of Cauchy index. In Section 4, we recall the notion of -chain and their connection with Cauchy index. Finally, in Section 5 we prove Theorem 17 using -chains, and Theorems 18 and 19 as consequences of Theorem 17.
2 Comparison with previous Cauchy index
formulas using subresultants
There is a previously known formula for the Cauchy index \displaystyle{{\rm Ind}_{a}^{b}\Big{(}\frac{Q}{P}\Big{)}} by means of subresultant polynomials which is as follows (see [3, Chapter 9]).
Definition 21
Let be a finite sequence of elements in of type
[TABLE]
with and a finite sequence of elements in with , which is either empty (this is, ) or with . The modified number of sign variations in is defined inductively as follows
[TABLE]
In other words, the usual definition of the number of sign variations is modified by counting two sign variations for the groups: and . If there are no zeros in the sequence , is just the classical number of sign variations in the sequence.
Let be a sequence of polynomials in and let be an element of which is not a root of the of , which we call . Then , the modified number of sign variations of at , is the number defined as follows:
delete from those polynomials that are identically [math] to obtain the sequence of polynomials in ,
- -
define as .
Let and be elements of which are not roots of . The difference between the number of modified sign variations in at and is denoted by
[TABLE]
Denoting by the list of subresultant polynomials of and , the following result is known (see [3, Chapter 9]).
Proposition 22
Let with and with and . If and are not roots of , then
[TABLE]
Our new formula for {\rm Ind}_{a}^{b}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)} given in Theorem 17 improves on the one from Proposition 22 in several aspects:
- a.i)
Theorem 17 is general, there are no restrictions on and .
- a.ii)
More importantly, there are cases when less subresultant polynomials are involved in this new formula. The Structure Theorem of Subresultants (Theorem 13 ) states that in the subresultant polynomial sequence, some polynomials appear only once and other polynomials appear exactly twice (up to scalar multiples). In addition, if a polynomial appears twice, its first appearance, , is defined as the polynomial determinant of a matrix of smaller size (in comparison with its second appearance), so that it is more suitable in computations. Our formula involves only the , i.e. the first appearance( up to scalar multiples) of each polynomial in the subresultant polynomial sequence.
In the special case when and are not common roots of and , Theorem 18 gives a sign-variation-counting formula which improves on the one from Proposition 22 since:
- b.i)
Theorem 18 imposes less restrictions on and .
- b.ii)
As in a.ii).
- b.iii)
The formula is more natural, since the sign-variation counting in Theorem 18 is local and needs only to consider the sign of two consecutive elements, contrarily to the modified number of sign variations which is very counter-intuitive.
Last but not least, the proofs of our results are also less technically involved than the proof of Proposition 22, which is cumbersome (see the proof in [3, Chapter 9]).
Note that, in the particular case where all subresultant polynomials are non-defective, both the formulas in Theorem 18 and in Proposition 22 become
[TABLE]
(see [3, Chapters 2 and 9]), but the new formula extends the previous one to the case whera and are not common roots of and .
There is also a previously known formula for the Cauchy index {\rm Ind}_{{\rm{\bf R}}}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)} by means of subresultant coefficients which we introduce below (see [3, Chapter 4]).
Proposition 23
Using the notation from Theorem 13, for , let be the leading coefficient of the non-defective subresultant polynomial (which is proportional to ). Then
[TABLE]
Even in this special case, the new formula for {\rm Ind}_{a}^{b}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)} given in Theorem 19 improves on the one from Proposition 23. As before, the main difference between the two formulas is that the are, in the defective cases, defined as determinants of matrices of smaller sizes than the and therefore is more suitable in computations. On the other hand, one advantage of Proposition 23 is that it can be proved directly, using only subresultant coefficients and does not use the definition of the subresultant polynomials and the Structure Theorem of subresultants (see [3, Chapter 4]).
3 Properties of Cauchy index
In this section we include some useful properties of Cauchy index.
Lemma 24
Let with , and . Then
[TABLE]
- Proof:
Follows immediately from the definition of Cauchy index.
Lemma 25
Let with , and such that
[TABLE]
Then
[TABLE]
- Proof:
For each , we first note that if
[TABLE]
with , and , then defining
[TABLE]
we have
[TABLE]
with and . This proves that {\rm Ind}_{x}^{\varepsilon}\Big{(}\displaystyle{\frac{Q}{P}}\Big{)}={\rm Ind}_{x}^{\varepsilon}\Big{(}\displaystyle{\frac{R}{P}}\Big{)} for every . The claim follows from the definition of the Cauchy index.
The following property is known as the inversion formula.
Proposition 26
Let with and . Then
[TABLE]
- Proof:
See [5, Theorem 3.9].
4 -chains and Cauchy index
The notion of -chain was introduced in [7]. Here, we need to introduce a slight variation of this notion.
Definition 27
Let and with and . A sequence of polynomials in is a special -chain if for there exist and such that
** 2. 2.
, 3. 3.
.
As in [7], note that for , taking , any sequence in is a special -chain.
Note also that Sturm sequences are always special chains.
Example 28** (Continuation of Examples 4, 11, 15 and 20)**
Taking and , then is a special -chain, with
[TABLE]
Taking now and , then is a special -chain, with
[TABLE]
We will see in Section 5 how to produce special -chains using Theorem 13 (Structure Theorem of Subresultants).
We introduce some more useful definition.
Definition 29
Let , , in and . We define , for ,
[TABLE]
and
[TABLE]
Using the ideas of the proof of [5, Theorem 3.11], we obtain the following result for special -chains. Note that no assumption on and is made.
Proposition 30
Let with , and . If in is a special -chain then
[TABLE]
- Proof:
We proceed by induction in . If , the result follows from Proposition 26 (Inversion Formula).
Suppose now that . Taking as in Definition 27, by Lemmas 24 and 25 we have
[TABLE]
We consider , and we apply the inductive hypothesis to the special -chain . For we have that . Finally, using Proposition 26 (Inversion Formula) and the inductive hypothesis,
[TABLE]
as we wanted to prove.
Corollary 31
Let with , and . If in is a special -chain and divides , then
[TABLE]
As mentioned in the Introduction, Theorem 6 can be deduced from Corollary 31.
- Proof of Theorem 6 :
Theorem 6 is a special case of Corollary 31 taking and , since the Sturm sequence is a special -chain and divides .
5 Proof of the main results
We fix the notation we will use from this point.
Notation 32
Let with and . Let be the sequence of degrees of the non-defective subresultant polynomials of and in decreasing order and let .
- •
Using the notation from Theorem 13, for , let
[TABLE]
- •
For , let
[TABLE]
and let and .
Lemma 33
* is a special -chain. In addition, divides all its elements.*
- Proof:
Recall that and . Also, by the Structure Theorem of Subresultants (Theorem 13), we have that for ,
[TABLE]
The claim follows from the definition of .
The following lemma explores the relation between the signs of the leading coefficients of the subresultants polynomials.
Lemma 34
Let with and . Following Notation 12 and 16, for ,
[TABLE]
- Proof:
For the result is clear. For , by the Structure Theorem of Subresultants (Theorem 13),
[TABLE]
We proceed then by induction on . If , then is odd and
[TABLE]
If , then is even, and ; therefore by the inductive hypothesis,
[TABLE]
[TABLE]
using equation (2).
Now we are ready to prove Theorem 17.
- Proof of Theorem 17:
By Corollary 31, since is a special -chain and divides ,
[TABLE]
So, we only need to prove that for ,
[TABLE]
Indeed, using Lemma 34,
[TABLE]
and we are done.
From Theorem 17, we can easily deduce Theorem 18 as follows.
- Proof of Theorem 18:
Theorem 13 implies that, if for some , two consecutive polynomials and in the sequence have a common root , then every polynomial in this sequence has as a root. So, suppose now that and are not common roots of and , therefore they are not common roots of and for any .
The proof is finished using the formula from Theorem 17 for the Cauchy index \displaystyle{{\rm{Ind}}_{a}^{b}\Big{(}\frac{Q}{P}\Big{)}} and the identity .
Finally, we prove Theorem 19.
- Proof or Theorem 19:
We introduce the notation
[TABLE]
Note that, if is even, then , and if is odd, then .
Choosing big enough and applying Theorem 18,
[TABLE]
From this identity the result can be easily proved, taking into account that for , , but there is an ad-hoc definition of (and not as the leading coefficient of ).
Acknowledgements: We are thankful to the anonymous referees for their helpful remarks and suggestions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Apéry, François; Jouanolou, Jean-Pierre. Résultant et sous-résultants : Elimination - Le Cas d’une variable. Hermann, Paris , 2006.
- 2[2] Barnett, Stephen. Polynomials and linear control systems. Monographs and Textbooks in Pure and Applied Mathematics, 77. Marcel Dekker, Inc., New York, 1983.
- 3[3] Basu, Saugata; Pollack, Richard; Roy, Marie-Françoise. Algorithms in real algebraic geometry. Second edition. Algorithms and Computation in Mathematics, 10. Springer-Verlag, Berlin , 2006.
- 4[4] Cauchy, Augustin. Oeuvres complètes, Gauthier-Villars, Paris, 1882–1974.
- 5[5] Eisermann, Michael. The fundamental theorem of algebra made effective: an elementary real-algebraic proof via Sturm chains. Amer. Math. Monthly 119 (2012), no. 9, 715–752.
- 6[6] Henrici, Peter. Applied and computational complex analysis. Volume 1. Pure and Applied Mathematics. Wiley-Interscience [John Wiley & Sons], New York-London-Sydney, 1974.
- 7[7] Perrucci, Daniel; Roy, Marie-Françoise. Quantitative Fundamental Theorem of Algebra. Q.J.Math . 70 (2019), no. 3, 1009–1037.
- 8[8] Sturm, Charles. Collected Works, Edited by J.-C. Pont, Birkhäuser, Basel, 2009.
