Certification for Polynomial Systems via Square Subsystems
Timothy Duff, Nickolas Hein, and Frank Sottile

TL;DR
This paper introduces methods for certifying solutions to overdetermined polynomial systems by leveraging square subsystems and advanced algebraic techniques, enabling reliable solution verification and completeness checks.
Contribution
It proposes novel certification approaches for overdetermined polynomial systems using square subsystems and algebraic tools like liaison and Newton-Okounkov bodies.
Findings
Effective certification of solutions to overdetermined systems.
Methods to reject nonsolutions and verify completeness.
Use of algebraic geometry tools for solution certification.
Abstract
We consider numerical certification of approximate solutions to a system of polynomial equations with more equations than unknowns by first certifying solutions to a square subsystem. We give several approaches that certifiably select which are solutions to the original overdetermined system. These approaches each use different additional information for this certification, such as liaison, Newton-Okounkov bodies, or intersection theory. They may be used to certify individual solutions, reject nonsolutions, or certify that we have found all solutions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Certification for Polynomial Systems
via Square Subsystems
Timothy Duff
School of Mathematics
Georgia Institute of Technology
686 Cherry Street
Atlanta, GA 30332-0160
USA
[email protected] http://people.math.gatech.edu/ tduff3/ ,
Nickolas Hein
Department of Mathematics and Computer Science
Benedictine College
1020 N. 2nd St
Atchison, KS 66002
USA
[email protected] https://www.benedictine.edu/faculty-staff/hein-nickolas and
Frank Sottile
Frank Sottile
Department of Mathematics
Texas A&M University
College Station
Texas 77843
USA
[email protected] www.math.tamu.edu/~sottile
Abstract.
We consider numerical certification of approximate solutions to a system of polynomial equations with more equations than unknowns by first certifying solutions to a square subsystem. We give several approaches that certifiably select which are solutions to the original overdetermined system. These approaches each use different additional information for this certification, such as liaison, Newton-Okounkov bodies, or intersection theory. They may be used to certify individual solutions, reject nonsolutions, or certify that we have found all solutions.
Key words and phrases:
certified solutions, alpha theory, polynomial system, numerical algebraic geometry, Newton-Okounkov bodies, Schubert calculus
2010 Mathematics Subject Classification:
65G20, 65H10
Work of Sottile supported in part by the National Science Foundation under grant DMS-1501370
Work of Duff supported in part by the National Science Foundation under grant DMS-1719968
Duff and Sottile supported by the ICERM
1. Introduction
Given polynomials with , an approximate solution to the system is an estimate of some point where the polynomials all vanish ( is a solution to ), such that the approximation error can be refined efficiently as a function of the input size and desired precision. Numerical certification seeks criteria and algorithms that guarantee that a computed estimate of a solution to is an approximate solution in this sense.
Many existing certification methods [21, 35] are for square systems, where . These exploit that the isolated, nonsingular solutions to the system are exactly the fixed points of the Newton operator given (where defined) by
[TABLE]
where is the Jacobian matrix of the system evaluated at . A Newton-based certificate establishes that the sequence of Newton iterates converges to a solution to . Examples include both Smale’s -test [34, 35] (typically performed in rational arithmetic) and Krawczyk’s method [21] (based on interval arithmetic).
Once such a certificate is in hand, we say that is an approximate solution to with associated solution . Further refinements bound the distance to the associated solution , decide if two approximate solutions are associated to the same solution, and, in the case of real systems, decide if the associated solution is real [12].
Certification in the overdetermined case, where , poses challenges not encountered in the square case. A detailed study of the “least-squares” Newton operator,
[TABLE]
was undertaken by Dedieu and Shub [5]. Here, “” indicates the Moore-Penrose pseudoinverse. Among the contributions in their work is a generalization of Smale’s -theorem, Theorem 3 in [5], giving sufficient conditions for the convergence of to a fixed point. Since there may be many fixed points of which are not solutions to this criterion is generally not sufficient to certify that is an approximate solution to In fact, the fixed points of are precisely the solutions to the square system defined by
[TABLE]
Clearly the solutions to include the solutions to : we thus say that is a square subsystem of Our point of view in this paper is that certification of approximate solutions may be possible if we are given a square subsystem of —typically not the system in Equation 3—together with some global information about the excess solutions.
Alternate approaches to certification in the overdetermined case have been considered in previous work. In [1] a hybrid symbolic-numeric approach is used when the polynomials in have rational coefficients. This requires computing an exact rational univariate representation [32] and using that to certify approximate solutions. Alternatively, one may attempt to lift the approximation to an approximate solution to a square system in more variables. This is the approach taken for Schubert problems in [11, 13].
Though reduction to the case of a square subsystem is a natural idea, a suitable square subsystem and the requisite global information are not easily obtained in general. Rather than prescribing a single approach, we follow the pattern of this reduction through a series of algorithms and examples. We highlight how abstract tools such as liaison theory and Newton-Okounkov bodies may be brought to bear on certification, and illustrate certification for problems of interest in the Schubert calculus and computer vision.
Remark 1.1**.**
It is well-understood that perturbing an overdetermined system, eg. by adding generic constants, will generally produce an inconsistent system. Nevertheless, there are many families of overdetermined polynomial systems such that a generic member of the family has finitely many isolated solutions. In other words, a family of overdetermined systems need not be over-constrained. In general, a family of polynomial systems specified by some parameters may be understood via the incidence variety
[TABLE]
The family is well-constrained if the projection onto is dominant and The examples in Sections 5.2 and 5.3 fit naturally into the category of well-constrained families of overdetermined systems, where it is reasonable to seek certificates even for generic
The algorithms in our paper address the problems below.
Problem 1. How may we certify that a point is an approximate solution to ?
Problem 2. Suppose it is known that has solutions. How may we certify that a set of points consists of approximate solutions to ?
In Section 2, we recall various notions of “approximate solution” that have been considered in the literature; our Definitions 2.1 and 2.5 encompass these various notions and apply just as well to the overdetermined case. We also explain a simple exclusion criterion that indirectly certifies particular approximate solutions to a subsystem of as non-solutions to . In Section 3, we explain how global information about the excess solutions to may be used to certify approximate solution to in the sense of Section 2, thus solving Problems 1 and 2. We also give an alternative approach for Problem 2 that incorporates global information about and explain how the global information about can be calculated in terms of Khovanskii bases and their associated Newton-Okounkov bodies in Section 3.2. Section 4 discusses one further approach to Problem 1 that is based on liaison theory. In Section 5, we give three examples illustrating our algorithms. One involves a finite Khovanskii basis, another is from the Schubert calculus, and a third is from computer vision.
2. Approximate solutions
Throughout, we will fix positive integers . All polynomials will lie in the ring . We will write for a system of polynomials. The system is square when .
Definition 2.1**.**
A -approximate solution to a (possibly overdetermined) polynomial system is a triple , where , , and is a map defined on some such that
There exists such that , and
- 2)
all iterates are defined and the sequence converges to as .
Here indicates the usual Hermitian norm on . We will refer to as an approximate solution when the procedure and constant are understood. We call the point in (1) the solution to associated to . We sometimes refer to as a refinement operator. This is typically some incarnation of Newton’s method.
Remark 2.2**.**
In our examples, the system and the approximate solution are defined over the rationals or the Gaussian rationals . Since numerical solvers typically output floating point results, care must be taken to control rounding errors when computing certificates. One option for certification is to perform all subsequent operations in rational arithmetic. Interval and ball arithmetic give yet another approach (discussed in Subsection 2.2). A “certificate” obtained without controlling rounding errors may still be of practical value. Following [12], we call this a soft certificate.
Remark 2.3**.**
In practice, the map in Definition 2.1 should restrict to a computable function . Our algorithms assume an oracle for and we generally take a naive approach to questions of computability and complexity. However, we do not rely on special features of nonstandard models of computation such as the Blum-Shub-Smale machine [3].
Let be a square sytem and be an approximate solution to with associated solution . Our main concern is to certify that when is a square subsytem of —a seemingly difficult task a priori. It is however relatively simple to certify that is not a solution to a single polynomial , provided that is sufficiently small. For , let be the th symmetric power of . This has a norm dual to the standard unitarily invariant norm on homogeneous polynomials, and which satisfies , for . The -th derivative of at is a linear map with operator norm,
[TABLE]
Proposition 2.4**.**
Suppose that is an approximate solution to a square polynomial system with associated solution . For any polynomial , if
[TABLE]
then .
Proof.
By Taylor expansion, it follows that for any . ∎
Let us write for the difference in the inequality (5), which we will call a Taylor residual. Note the implicit dependence on in Definition 2.1. If is a polynomial system, then we define its Taylor residual to be maximum of the Taylor residuals , for . For this test of nonvanishing using Taylor residuals to be practical, we need to estimate the operator norms of the higher derivatives. One possible bounding strategy, as explained in [34, §I-3] and [12, §1.1], uses the first derivative alone. Another option, less suitable for polynomials of high degree, is to bound with the entry-wise or norms of these tensors.
A consequence of Definition 2.1 is that each iterate is an approximate solution, as . We wish to quantify this rate of convergence. The triangle inequality gives a test for when approximate solutions and have distinct associated solutions, namely if
[TABLE]
It is useful to have some additional criterion when two approximate solutions have the same associated solutions, that is, we wish to certify uniqueness of the associated solution in a sufficiently small region. This motivates our next definition.
Definition 2.5**.**
An effective approximate solution to a system consists of a weakly decreasing rate function with , and an integer such that
is a -approximate solution to with associated solution ,
- 2)
for all , and
- 3)
For some iterate , is the unique solution in the ball .
We say the rate of convergence for the effective approximate solution has order .
The rate of convergence is quadratic when . This implies that each application of roughly doubles the number of significant digits in . We generalize the method for certifying distinct solutions in [12, §I-2].
Proposition 2.6**.**
Given a set of effective approximate solutions to a system , we may compute a set of refined approximate solutions with distinct associated solutions comprising all solutions associated to the set .
Proof.
We need only replace each with its refinement . After refinement, the solutions associated to and are distinct if and only if inequality (6) holds. ∎
Proposition 2.4 may fail to certify that an extraneous solution is not a solution to . However, if is an effective approximate solution for then this test will succeed after sufficiently many refinements.
Corollary 2.7**.**
Let be a system of polynomals, and suppose is an effective approximate solution such that the associated solution . There is a such that the Taylor residuals are positive for all .
Proof.
Since we may argue that for some by Taylor expansion as in Proposition 2.4. Since is weakly decreasing, it follows from 2) in Definition 2.5 that the Taylor residuals remain positive for all ∎
Certificates for square systems are generally based on Newton’s method. We now observe that Definitions 2.1 and 2.5 encapsulate several existing certification paradigms for square systems.
2.1. Smale’s -theory
The central quantities of Smale’s -theory are defined as follows. With as above and a point where is invertible,
[TABLE]
Note that is the length of a Newton step at . The following proposition gives a criterion for approximate solutions in the sense of Definition 2.1.
Proposition 2.8** ([3, p. 160]).**
Let be a square polynomial system and . If
[TABLE]
then is -approximate solution to and the Newton iterates converge quadratically.
Criteria for quadratically convergent effective approximate solutions in the sense of Definition 2.5 can also be given in terms of . The analysis amounts to showing that is a contraction mapping in a suitable neighborhood of . This is given by the “robust” -theorem (Theorem 6 and Remark 9 of [3, Ch. 8]).
Proposition 2.9**.**
Let be a square polynomial system and an approximate solution to with associated solution and suppose that . If satisfies
[TABLE]
then is an approximate solution to with associated solution .
It follows that, for , we have that is an effective approximate solution in the sense of Definition 2.5.
2.2. Other approaches
The classical analysis of Newton’s method is due to Kantorovich [14]. Several variations of Kantorovich’s theorem exist, typically assuming some local Lipchitz condition on the Jacobian and boundedness conditions on . Certificates based on Kantorovich’s theorem thus rely on a priori bounds in a region containing Explicit bounds on the rate of convergence in terms of the Lipchitz and bounding constants are given in various works [40, 9, 7]. We refer to [24] for a survey of variants and an explanation of the relationship between Kantorovich’s theorem and -theory.
Approximate solutions may also be understood within the general program of interval and ball arithmetics. Both paradigms rely on defining arithmetic operations on intervals or balls and are definable in either exact or floating point arithmetic. In general, operations on intervals represent enclosures. In exact interval arithmetic, we define the sum by . For floating point arithmetic, we may either accept a soft certificate or control rounding errors when defining arithmetic operations so as to obtain a rigorous certificate. We refer to [22, 29, 41] for a more comprehensive treatment of these notions. A variety of interval/ball-valued Newton iterations have been studied. A popular variant is the Krawcyzk Method—see [29, Chapter 6] for an introduction, [28] for quadratic convergence, and [4] for extensions to complex analytic functions. Once a Newton-like iteration is in place, we get criteria for approximate solutions in the sense of definitions 2.1 and 2.5 by taking to be the midpoint/center of the enclosing interval/ball.
3. Certification via nonsolutions
In this section we consider certification in the setting where we have an overdetermined system given by a full set of approximate solutions to some square subsystem and prior knowledge of an integer such that
[TABLE]
From this information, the Newton operator can be used to give an approximate solution to in the sense of Definition 2.1. We make this precise in Section 3.1; Algorithm 1 provides one possible solution to Problem 1 from the introduction. For Problem 2, we give an essentially different approach (Algorithm 2) that assumes knowledge of the number of solutions to .
Knowledge of may come from rigorous mathematical proof (eg. the examples in 5.2) or by some form of certified computation. If all points in are isolated, then is simply the degree of the saturated ideal . Thus, if we can compute Gröbner bases for the ideals generated by both polynomial systems, then we have which is an admissible input for Algorithms 1 and 2, which are given in this section.
Aside from addressing Problems 1 and 2, we explain another approach to computing in the special case where is obtained by “squaring up”, or randomization [42]. This means we have a suitably generic matrix such that
[TABLE]
For such the number can sometimes be computed from Khovanskii bases (a generalization of SAGBI bases) for the polynomial algebra in variables given by . We give an overview of this theory in Section 3.2. An attractive feature of Khovanskii bases is that we may, in principle, work with the algebra itself rather than some presentation . There is, however, a significant trade-off, which is that finite Khovanskii bases need not exist. Nevertheless, we feel that computation of Khovanskii bases deserves to be more thoroughly explored. Certification is a particular application which may benefit from more efficient and robust computational tools for Khovanskii bases.
3.1. Certification algorithms
We now formulate our first algorithm for solving Problem 1. This is the content of Theorem 3.2, based on Definition 2.1. Note that Algorithms 1 and 2 assume pairwise distinct approximate solutions and explicit separating balls, respectively. By Proposition 2.6, it is enough to require that the are effective approximate zeros and apply these algorithms after refinement.
Algorithm 1** (Certifying individual solutions).**
Input:
- * — a polynomial system*
- * — a square subsystem of *
- * satisfying (8)*
- * — pairwise distinct approximate solutions to *
Output: a set of approximate solutions to
* Initialize *
* ***for *** *** if *** *** then *** *
* ***if *** *** then *** ***, else *** *
* return *
Remark 3.1**.**
A priori, we only need to know that —if the inequality is strict, we necessarily return an empty set.
Theorem 3.2**.**
Suppose that are valid input for Algorithm 1. Then its output consists of approximate solutions to .
Proof.
If is empty there is nothing to prove. Otherwise, there are distinct solutions to associated to points of —by Proposition 2.4, these are not solutions to . Since the solutions associated to points of are disjoint from those associated to points of , by assumption and (8) they associate to solutions to . ∎
We now give a second algorithm using -theory to certify solutions to an overdetermined system to solve Problem 2. Suppose that we have an overdetermined system that is known to have solutions whose square subsystems are known to have solutions. While we could apply Algorithm 1 to certify approximate solutions to , we propose an alternative method to solve this problem.
Algorithm 2** (Certifying a set of solutions).**
Input:
- * — integers*
- * — a polynomial system with solutions*
- * — two square subsystems of *
- * — a set of distinct approximate solutions to *
- * — disjoint balls separating elements of .*
- * — a set of distinct approximate solutions to *
Output: , a set of approximate solutions to
1:* Initialize *
2:* r\leftarrow\displaystyle\min_{1\leq i<j\leq d}\,\Big{(}\|\hat{\zeta}_{i}-\hat{\zeta}_{j}\|-(\rho_{i}+\rho_{i})\,\Big{)} *
3:* for do
4:* ***repeat *** *** until *** *
5:* *
6:* ***for *** *** if *** *** then *** *
7:* end *for
8:* ***if *** \big{(}\#T==e\,\big{)}, then ***return *** ***, else return *** FAIL *
Note that the intersection of balls in line 6 is non-empty if and only if
[TABLE]
so this condition may be decided in rational arithmetic if a hard certificate is desired.
Theorem 3.3**.**
Let be a system of polynomials having solutions whose general square subsystems have solutions. Then Algorithm 2 either returns FAIL or it returns a set of approximate solutions to whose associated solutions are all the solutions to .
As with Algorithm 1, while the hypotheses appear restrictive, they are natural from an intersection-theoretic perspective, and are satisfied by a large class of systems of equations. We explain one such family coming from Schubert calculus in Section 5.2.
Proof.
Since the balls are pairwise disjoint, the quantity is positive. Thus the refinement of each approximate solution on line 4 terminates. Having refined each note that can intersect at most one ball from . Now, if are the solutions to , then we must have that some is associated to each for some indices . Thus, if has elements, then the only solutions to associated to are also solutions to . ∎
Remark 3.4**.**
If is a general square subsystem of , then it will have solutions and the only common solutions to and to are solutions to . In this case, if Algorithm 2 returns FAIL, then , so that some pair of balls in Step 6 meet, but their intersection does not contain a common solution to and to . In this case, we may then further refine the solutions in , and the corresponding balls until no such extraneous pair of balls meet.
3.2. Newton-Okounkov bodies and Khovanskii bases
Perhaps the main difficulty in applying Algorithm 1 is obtaining the correct number beforehand. As noted in the beginning of this section, Gröbner bases give a general recipe for calculating this number. Here, we sketch a less well-developed approach in the case of a square subsystem defined as in (9). In this case, is given by a birationally-invariant intersection index over We summarize the basic tenets of this theory as developed in [17, 18].
Definition 3.5**.**
([18, Def. 4.5]) Let be an -dimensional irreducible variety over with singular locus . For an -tuple of finite-dimensional complex subspaces of the function field , let , and define
[TABLE]
the set of smooth points where every function in each subspace is regular, and
[TABLE]
the set of basepoints of . For generic , all solutions to the system on are nonsingular and their number is independent of the choice of . The common number is the birationally invariant intersection index .
These claims are proven in [17, Sections 4 & 5]. For our purposes, and where is the linear space spanned by the polynomials in our system . Write for this self-intersection index, note that , while . Thus (8) holds for general square subsystems of , taking .
Let be a surjective valuation where is some fixed total order on . For example, could restrict to the exponent of the leading monomial in a term order on . We attach to the following data:
- •
—a graded subalgebra of
- •
, a sub-monoid of associated to the pair where is the -span of -fold products from . This is the initial algebra of with respect to the extended valuation defined by where is the levelwise order defined by
[TABLE]
- •
—the index of as a subgroup of .
- •
—the Euclidean closure of all -linear combinations from
- •
—the Newton-Okounkov body.
The linear space induces a rational Kodaira map
[TABLE]
with the section ring the projective coordinate ring of the image.
Proposition 3.6** ([18, Thm. 4.9]).**
Let be a finite-dimensional subspace of . Then
[TABLE]
Here, denotes the -dimensional Euclidean volume in the slice .
In our setting, where and the Kodaira map is . Thus, if need be, may be computed symbolically. The main difficulty in applying Proposition 3.6 is that it may be hard to determine the Newton-Okounkov body, as the monoid need not be finitely generated. This leads us to the notion of a finite Khovanskii basis [16].
Definition 3.7**.**
A Khovanskii basis for is a set of generators for the algebra whose values generate the monoid . If is a global monomial order on taking lead monomials defines a valuation , where is the reverse of . A Khovanskii basis with respect this valuation is commonly known as a SAGBI basis [15, 31].
When the monoid is finitely generated, there is a finite Khovanskii basis for . When this occurs, we may compute the Khovanskii basis via a binomial-lifting/subduction algorithm such as described in [31] or [39, Ch. 11].
Example 3.8**.**
We consider an “illustrative example” of an overdetermined system from [1]:
[TABLE]
The square subsystem defiened by has two singular solutions, and is the Jacobian determinant of this subsystem. Let be the graded reverse lexicographic ordering with and We observe that the initial terms of under the induced order are given by and The lattice points corresponding to these monomials comprise the first level and lie in the linear subspace of defined by We see that the inner approximation to the Newton-Okounkov body given by the first level has -dimensional volume However, there exists with :
[TABLE]
giving
[TABLE]
This element of was obtained by the previously mentioned binomial-lifting/subduction algorithm—carrying this out further, we can verify that this new element together with the original generators give a finite Khovanskii basis for It follows that
[TABLE]
and We also have that and Thus, we have giving a total root count of after squaring up
Remark 3.9**.**
We note that the total root count for a system obtained by randomizing in the previous example is equal to the normalized volume of the common Newton polytope of , which is four. This is equal to the “expected” polyhedral root count [23, 2]. However, this information does not give us as needed for Algorithms 1 and 2.
4. Certification via liaison pruning
Suppose that we have an overdetermined system with a square subsystem , so that . Suppose further that we have a square system with . Given this, we may certify all approximate solutions to and then certify the subset of those that are approximate solutions to , so that the solutions in which remain are certifiably approximate solutions to . This solves Problem 1. When this occurs, we say that is in liaison with the complete intersection . The basic scheme for certification via liaison is Algorithm 3. We also give a generalized version, Algorithm 4, which we later apply to the Schubert calculus in Section 5.2.
Let us begin with some definitions. A system of polynomials is a complete intersection if the variety they define has dimension , equivalently if it has codimension . A square system is a zero-dimensional complete intersection.
More generally, varieties of codimension are in liaison if there are polynomials such that . This relation has been deeply studied (see [20] and the references therein). Of particular interest is when one of the varieties, say , is itself a (different) complete intersection, so that is in liaison with a complete intersection. (This is a special case of the licci equivalence relation.)
Example 4.1**.**
(The twisted cubic.) The closure of the set is the rational normal curve . It is defined by three quadrics, , and is thus not a complete intersection. In the affine patch defined by , if we use the difference of the first two generators and the last generator, then , where is the line .
[TABLE]
Let be a variety of codimension that is in liaison with a complete intersection . There are polynomials , , and such that
[TABLE]
We generalize the notion of a square system of polynomials. A square system on consists of polynomials that are sufficiently general in that is a finite set and the intersection is transverse. Then
[TABLE]
Thus the square system on is the set-theoretic difference of two square systems of polynomials, and
[TABLE]
For example, let be the rational normal curve of Example 4.1 in , which has codimension 2, so that is a square system on . Manipulating the polynomials in leads to the solutions
[TABLE]
If generate the ideal of , then is the overdetermined system
[TABLE]
Thus an algorithm to certify points on solves Problem 1 for . As we may certifty solutions and nonsolutions to systems (10) and (11), this discussion leads to the following certification algorithm, when a variety is in liaison with a complete intersection . This uses the test of Proposition 2.4, the Taylor residual (5), and Smale’s -theory for the system (11).
Algorithm 3** (Certifying approximate solutions to a square system on a variety ).**
Input:
- **
- * — a square polynomial system such that , ** *with both and of codimension
- * — polynomials such that *
- * — pairwise distinct approximate solutions to with refinement*
* ** operator *
Output: with , where consists of approximate solutions to and consists of approximate solutions to .
1:* Set , a square system on . *
2:* Initialize , *
3:* for do
4:* *
5:* *** if *** *** then *** *
6:* *** else if *** *** then *** *
7:* *** else *** and return to 5. *
8:* *** end if *** *
9:* end *for
Remark 4.2**.**
As in all subsequent algorithms, we assume distinct approximate solutions to with refinement operator as part of the input. We could have just as easily assumed effective approximate solutions. The test in line 5 could be replaced by testing that is an approximate solution to the square system by some criterion other than -theory—for simplicity, we do not assume this criterion is part of the input.
Proof of correctness.
As , it is an approximate solution to the square system with an associated nonsingular solution . Since is nonsingular, , as is singular along . Thus if and only if . Let be the sequence of iterates using starting at . This converges to .
If , then , and the sequence will eventually lie in the basin of quadratic convergence for Newton iterations and converges to [math]. As is bounded, converges to [math]. Thus the condition in line 5 will eventually hold and will be placed in .
If , then . By Corollary 2.7, the Taylor residuals are positive for large enough. Thus the condition in line 6 eventually holds, and will be placed into . ∎
We describe a more involved application of this idea. Write for the codimension, , of a variety . Suppose that are in general position and , then Bertini’s Theorem [19] implies that
[TABLE]
is a transverse intersection consisting of finitely many points. When , so that each is a hypersurface, then (12) is equivalent to the square polynomial system
[TABLE]
As a variety need not be a complete intersection, a square system of varieties (12) with does not necessarily have a formulation as a square system of polynomials. However, the points of (12) are the solutions to an overdetermined system of polynomials given by the generators of the ideals of each of .
Suppose now that form a square system of varieties (12), each is in liaison with a complete intersection , and these are all in sufficiently general position. Then there are square systems and of polynomials such that if is defined by for each , then
[TABLE]
are complete intersections for each . Thus
[TABLE]
We give a more general version of Algorithm 3 that will certify solutions to the square system (12) of varieties, given solutions (14) to the square system .
Algorithm 4** (Certifying solutions to a square system of varieties).**
Input:
- **
- * and — square polynomial systems such that*
* ** for each , (13) are complete intersections.*
- * — pairwise distinct approximate solutions to *
Output: consisting of approximate solutions to .
1:* for do
2:* Set . *
3:* Initialize *
4:* for do
5:* *
6:* *** if *** *** then *** discard *
7:* *** else if *** *** then *** *
8:* *** else *** and return to 6. *
9:* *** end if *** *
10:* end *for
11:* *
12:* end *for
Proof of correctness.
By algorithm 3, in each iteration of the outer loop, the algorithm constructs the set of elements of the input that do not lie in . As , we see that the algorithm performs as claimed. ∎
5. Examples
We give three further examples that illustrate our certification via square subsystems. All computations were carried out using the computer algebra system Macaulay2 [10]. For each example, we found complex floating-point solutions to square subsystems via homotopy continuation, as implemented in the package NumericalAlgebraicGeometry [26]. Tests from -theory were supplied by the package NumericalCertification [25].
5.1. Plane quartics through four points
Consider the overdetermined system where the the are given as follows:
[TABLE]
These give a basis for the space of quartics passing through the four points:
[TABLE]
As an illustration of Algorithm 1 and the techniques based on Khovanskii bases described in Section 3.2, we show how to certify that numerical approximations of these points represent true solutions to .
Letting we consider the algebra Letting be the graded-reverse lex order with the algebra has a finite Khovanskii basis with respect to the -valuation associated to . It is given by where
[TABLE]
The Newton-Okounkov body, depicted below, has normalized volume . The integer points correspond to . The fractional vertices corresponding to and demonstrate that these elements are essential in forming the Khovanskii basis.
Using the procedure of [33], we may express and as homogeneous polynomials in the algebra generators :
[TABLE]
[TABLE]
The Khovanskii basis was computed using the Macaulay2 package SubalgebraBases, based on the work in [38]. We checked this computation against our own top-level implementation of the binomial-lifting / subduction algorithm.
For certification, we squared up with a random matrix, , and found complex approximate solutions to using homotopy continuation. Each solution was softly certified distinct via -theory. Computing values as in Algorithm 1, we softly certified of these as nonsolutions to , hence associating the four remaining solutions to . Observe that by Proposition 3.6. Also, we have by Bézout’s theorem. This implies that and hence
5.2. Example from Schubert calculus
We describe a family of examples from Schubert calculus to which Algorithms 1, 2, and 4 all apply. For more on the Grassmannian and Schubert calculus, see [8]. Let be an integer and set . Consider the geometric problem of the 2-planes in that meet general codimension planes nontrivially. The number of such 2-planes is the Kostka number , the first few values of which are shown below.
[TABLE]
This may be computed recursively. Let be the coefficient of the Schur function in the product . Then . For the recursion, set and , when . Then, for , we set and for , .
We express this geometric probem in local coordinates. Write for the identity matrix and let be a matrix of indeterminates, and set , which has rows and columns. For any choice of , the column span of , also written , is a 2-plane in that does not meet the coordinate plane , and parametrizes the set of such -planes. For , let be a general -matrix whose column span (also written ) is a general -plane. Then if and only if the matrix has rank at most . This condition is given by the maximal minors of , each of which is the determinant of the square -matrix obtained by deleting a row from . This gives a system of quadratic equations in variables which define the solutions to our geometric problem.
Any polynomial that is a linear combination of the has the form , where the entries of are the coefficients of in that linear combination. This justifies the following scheme to obtain a square subsystem of . For each and , let be an -plane that is general given that it contains . We obtain the matrix of by appending a general column vector to the matrix of . Let be the determinant of the matrix —this vanishes when . We claim that the susbsystem of is square.
For this, let us investigate the corresponding geometric loci in the Grassmannian . Write for the set of all 2-planes which meet nontrivially, and for those that meet nontrivially. Let be the hyperplane containing both and , and let be the set of all 2-planes that are contained in . Since and it was shown in [37] that
[TABLE]
is a (generically) transverse intersection.
It is natural to analyze this geometric problem in the context of liaison theory discussed in Section 4: specifically, we can use Algorithm 4. On the other hand, the algorithms from section 3 work just as well. For each approach, we explain the details needed in order to certify. We note that the main bottleneck, solving is well within the capabilities of modern homotopy continuation software, say, for in the single digits [27].
Algorithm 4. In the local coordinates , we have that , so that (15) is a complete intersection and is in liaison with , which we show is a complete intersection. Let be the linear form (a row vector) whose kernel is . Then if and only if , so that . If and are the two rows of , then , showing that it is a complete intersection.
Our geometric problem of the 2-planes that meet each of is equivalent to the intersection
[TABLE]
which is a square system of varieties (12). As each is in liaison with a complete intersection, Algorithm 4 applies and may be used to certify the solutions to our geometric problem. Its input is the set , which consists of the points in the intersection
[TABLE]
While each pair is not in general position, this intersection is generically transverse, and the different pairs are in general position, so the intersection (16) is transverse. Consequently, the number of points in the intersection (16) is the expected number, which is the Catalan number .
Algorithm 1. For this algorithm, the number of excess solutions is . It starts with the set of points in the intersection (16). We run Algorithm 1, and if it finds that , so that we have rejected all nonsolutions, then those that remain are certified solutions to our geometric problem . Otherwise, we may refine the approximate solutions in so that the Newton steps become small enough to reject nonsolutions.
This algorithm is particularly easy in this case as the Taylor residual (5) of a linear function is , where the derivative of is a vector.
Algorithm 2. Here, we simply observe that is the number of solutions to the square system and is the number of solutions to . These data together with a full set of approximate solutions to are all that is needed for certification.
5.3. Essential matrix estimation
A fundamental object of study in geometric computer vision is the essential variety
[TABLE]
This is an irreducible variety of dimension and degree Elements of are called essential matrices. The ten polynomials defining are known as the Demazure cubics [6]. They minimally generate the homogeneous ideal of . It is possible to recover an essential matrix given five generic point correspondence constraints of the form
[TABLE]
where Although the overdetermined family given by equations (17) and (18) is fairly simple, it is notable for its apperance in applications. State of the art algorithms for solving these equations run on the order of microseconds [30] and have been successfully employed in large-scale 3D reconstruction pipelines [36]. This motivates the problem of developing certification techniques for this problem with comparable efficiency. For concreteness, we consider an instance of this problem in which the data are given by
[TABLE]
Truncating to decimal places, we may regard each of the and as rational vectors and seek a certificate. A candidate approximate solution is given to 6 places by
[TABLE]
To certify we consider the square system given by equations (18), the chart and the first three Demazure cubics: namely
[TABLE]
The number of solutions to is bounded a priori by and we can easily certify that this bound is attained. It follows that we may apply the methods described in Section 3. However, there is an even simpler procedure based on the exclusion criteria of Proposition 2.4 and 2.7, as well as the following result obtained by symbolic computation.
Proposition 5.1**.**
Let be the subvariety of defined by equations (18) and (19). Consider the polynomials
- **
- **
- **
and define projective varieties as the Zariski closures of the indicated quasiprojective varieties,
- **
- **
- **
We have that
Proof.
This follows by symbolically computing ideal quotients: letting by the ideal generated by (18) and (19) and for we may establish that equals the ideal defining ; for instance, by showing they have the same reduced Gröbner basis for a given term order. This is easily accomplished by Macaulay2. ∎
For generic data we expect the exclusion criteria of Proposition 2.4 and 2.7 for and to be satisfied for candidate solutions to the overdetermined problem: we may easily verify this in rational arithmetic for our given . Moreover, we estimate that thus giving that is an approximate solution using the refinement operator
We remark that -theory does not furnish a certificate if we use the Newton fixed point system given by equation 3 in the introduction. For this system, whose defining equations are of higher degree, we estimate that We also found, using Gröbner bases over a finite field, that the number of excess solutions to this fixed point system is for a total of critical points overall for the least-squares Newton operator. By constrast, Proposition 5.1 furnishes a certificate that requires no excess solutions whatsoever. Thus, even for this toy example, we see how the flexibility of square subsystems may enhance the prospects of obtaining rigorous mathematical proofs from the output of numerical computations. Although certification for overdetermined systems remains a challenge in general, similar techniques may be worth considering for problems of a larger scale.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Tulay Ayyildiz Akoglu, Jonathan D. Hauenstein, and Agnes Szanto. Certifying solutions to overdetermined and singular polynomial systems over ℚ ℚ {\mathbb{Q}} . J. Symbolic Comput. , 84:147–171, 2018.
- 2[2] David N. Bernshtein. The number of roots of a system of equations. Functional Analysis and its applications , 9(3):183–185, 1975.
- 3[3] Lenore Blum, Felipe Cucker, Michael Shub, and Stephen Smale. Complexity and real computation . Springer-Verlag, New York, 1998.
- 4[4] Michael Burr, Kisun Lee, and Anton Leykin. Effective certification of approximate solutions to systems of equations involving analytic functions. In Proceedings of the 2019 ACM on International Symposium on Symbolic and Algebraic Computation , pages 267–274. ACM, 2019.
- 5[5] J.-P. Dedieu and M. Shub. Newton’s method for overdetermined systems of equations. Math. Comp. , 69(231):1099–1115, 2000.
- 6[6] Michel Demazure. Sur deux problemes de reconstruction. Technical Report 882, INRIA, 1988.
- 7[7] Peter Deuflhard. Newton methods for nonlinear problems: affine invariance and adaptive algorithms , volume 35. Springer Science & Business Media, 2011.
- 8[8] William Fulton. Young tableaux . London Mathematical Society Students Texts, 35. Cambridge University Press, Cambridge, 1997.
