An alternative quadratic formula
Norbert Hungerb\"uhler (ETH Z\"urich)

TL;DR
This paper reviews existing quadratic formulas and introduces a new formula for solving quadratic equations, aiming to provide an alternative method for finding roots.
Contribution
It presents a novel quadratic root formula that differs from classical solutions, expanding the toolkit for solving quadratic equations.
Findings
New quadratic root formula demonstrated
Comparison with classical formulas shows advantages
Potential for simplified calculations in certain cases
Abstract
The classical quadratic formula and some of its lesser known variants for solving the quadratic equation are reviewed. Then, a new formula for the roots of a quadratic polynomial is presented.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
An alternative quadratic formula
Norbert Hungerbühler
1 A short history of the quadratic formula
The commonly used formula to solve a quadratic equation
[TABLE]
is the quadratic formula
[TABLE]
In German speaking countries this formula is called Midnight Formula or Moonshine Formula**Mitternachtsformel or Mondscheinformel* since a student is supposed to know it by heart even when torn from sleep at midnight. From a didactical point of view, this paradigm is certainly disputable since knowing something by heart is the enemy of understanding the subject. This is why Hermann Weyl used to urge his students to “un-learn” this formula.
The formula appeared for the first time in Descartes’ Discours de la méthode [4, p. 303] where he coined one of the most enduring conventions of mathematics, namely the use of as constants and as variables (see [2]).
The prehistory of the quadratic formula goes back to Babylonian mathematics: One of the oldest traces is the cuneiform tablet BM 13901, stored in the British Museum, which states around 80 mathematical problems. It dates back to Hammurabi’s reign in the beginning of the 18th century BC. The first problem asks to find the side length of a square (see [10] for a detailed discussion):
The surface plus the length of the square is .
In modern terms this corresponds to the equation . The later Greek mathematics would have never accepted this mixing of dimensions. However, the proposed Babylonian solution may be translated as follows:
Take 1, the unity. Brake it up in and . Square to get . Add to and obtain . The root of this is . Subtract of it the which you squared before. You get which is the side of the square.
These numerical steps are expressed in the formula
[TABLE]
which indeed corresponds to the positive solution of the quadratic equation (see also (5)).
In Greek mathematics we find geometric methods to solve quadratic equations in Euclid’s works: In [5] and [16] the Theorems 5 & 6 in Book II and the Theorems 28 & 29 in Book VI of the Elements, as well as the Theorems 84 & 85 in the Data are interpreted as tools to solve quadratic equations (see [9] for a critical analysis of this view). However, it is conceivable that Euclid would have rather used the Theorems 35 and 36 in Book III, since they allow a much more direct link between the quadratic equation and its geometric solution: Suppose we want to solve the equation , where , are given as lengths of segments. Then, the solutions satisfy and . Hence the following application of the intersecting chords theorem yields the solution on the left:
B$$C$$w$$A$$v$$Z$$c_{1}$$E$$c_{2}$$D$$x_{1}$$x_{2}
C$$B$$v$$A$$v$$Z$$c_{1}$$c_{2}$$E$$D$$x_{1}$$x_{2}$$r
Choose to be a sufficiently large circle through and and a concentric circle through . Then cuts a chord of length in segments of lengths . See also Exercise 4.
In particular, if we choose and as diameter of , we have and . Hence we can read off the solutions of and get the familiar expression .
The famous Bakhshali manuscript, the oldest passed down manuscript in Indian mathematics, also contains examples and solutions of quadratic equations with numerical coefficients. This birch bark manuscript is dated between the 9th and the 12th century CE but is assumed to have origins going back to the 3rd or 4th century CE. In Chinese mathematics, the Zhang Qiujian suanjing, a manual from the 5th century CE, not only discusses quadratic equations, it also contains a consistent concept of negative numbers. See [1] for more information.
An important milestone in the history of mathematics is Al-Khwarizmi’s algebra book Al-kitāb al-mukhtaṣar fī ḥisāb al-ğabr wa’l-muqābala which is, in part, devoted to an extensive discussion and classification of quadratic equations. Unlike in earlier work, Al-Khwarizmi, who lived between 780 and 850 CE, introduced the concept of equation not only as a series of case by case problems to be solved, but as an infinite class of problems with a common method of solution. In particular, he identified the cases where the quadratic equation has two, one or no solution. He also demonstrated different methods of solution by means of the idea of equal area. It is possible, that he was inspired by the translation of Euclid’s Elements by al-Hajjaj ibn Matar who, like Al-Khwarizmi, worked in the House of Wisdom in Baghdad (see [14, Chapter 11]). Although most of the modern notation was not yet available at that time and al-Khwarizmi still formulated his problems and solutions in text form, his work marks the dawn of elementary algebra.
So Descartes was, by far, not the first to solve quadratic equations. However, instead of giving a verbal description of the algorithm which yields the solution from the numerical values of the coefficients of a given single equation, he wrote down the solution as a condensed formula involving the coefficients of the equation as variables. This progress can hardly be overestimated: The formula allows to analyze the solutions as functions of these coefficients, which in turn may depend on other variables. Moreover, one may make use of the formula as an element in more complex computations. Finally, a clear formal justification of the formula becomes possible, not only a case by case verification of the numerical values at hand.
2 Friends of the quadratic formula
Quadratic equations can not only be considered over but also over a general field. The way to compute the solutions of (2) from the coefficients involves the four basic arithmetic operations and square roots.
Roots.
In a general field , is the set of squares. A function \sqrt{{\color[rgb]{1,1,1}\definecolor[named]{pgfstrokecolor}{rgb}{1,1,1}\pgfsys@color@gray@stroke{1}\pgfsys@color@gray@fill{1}p}}:Q\to F with the property is called a square root. The polynomial has at most two zeros. For the two zeros are , and these two values coincide if or if has characteristic 2. If is algebraically closed all elements are squares. By virtue of the Frobenius automorphism the same is true for every finite field of characteristic 2. And in a finite field of odd characteristic there are as many nonzero squares as nonsquares. In that case, products of two squares or of two nonsquares are squares, a product of a square and a nonsquare is a nonsquare.
The axiom of choice, or more precisely the axiom of choice for families of pair sets, ensures the existence of a square root for a general field. Actually, is equivalent to the fact, that in each field there exists a root on its set of squares (see [7, Proposition 4.3]). However, there are cases where a root may be defined due to additional structures: If is an ordered field, e.g. , one may always choose the larger of two possible values for the root of a square. In the case one can use its structure of a Riemann surface to define a root by identifying an analytical principal branch of the root function.
It is worth noticing that the solution formula (2) works with any choice of a root function and in any field of characteristic . Let us have a brief look at the special case of a field of characteristic .
Quadratic equations in a field of characteristic 2.
We consider (1) in a field of characteristic 2. If , then (1) is equivalent to . And if , or equivalently , the unique solution in is . If , there is no solution in .
If , we rewrite (1) in the form
[TABLE]
which, for , takes the form
[TABLE]
If we consider as a vector space over its prime field , then the Artin-Schreier map , is linear. Its kernel consists of the elements for which , i.e., . Hence, (3) has a solution iff is an element of the image of the Artin-Schreier map, and then is the second solution. In terms of the original variable , we get: (1) with has no solution if does not lie in the image of the Artin-Schreier map, and it has the two solutions and if is a preimage of under the Artin-Schreier map. For a detailed discussion, see [3].
Characteristic .
In a general field of characteristic , the number of solutions of (1) is governed by the discriminant : For coefficients , equation (1) has no solution in if the discriminant is a nonsquare. If is a square, (1) has one solution if otherwise two solutions.
But let us now concentrate to the case and discuss alternative ways to represent the solutions of (1).
When starting from the normalized form
[TABLE]
the quadratic formula takes the form
[TABLE]
Lesser-known is the formula
[TABLE]
for the solutions of (1). This variant follows from (2) by observing, that . The formula (6) yields only one solution if , but otherwise it gives a correct answer even in the degenerate case (i.e. a linear equation), provided . For this reason (6) is used in Muller’s iterative method [12] for numerically finding roots of a function by approximating with parabolas interpolating the graph in three points.
For real coefficients, trigonometric and hyperbolic methods for solving quadratic equations are also circulating: In contrast to (2), (5) and (6) these formulas are numerically stable (see [13], [6], and the discussion in Section 3). To explain the idea we assume, without loss of generality, that in (1). Moreover, we sort out the cases (with the solutions ) and (with the solutions ). Otherwise, we distinguish the cases and :
Case 1. If we bring equation (1) by the scaling to the normalized form
[TABLE]
Using the double-angle formula
[TABLE]
we see that is a solution of (7) if is chosen such that . According to Viëta’s Theorem, the product of the solutions of (7) is , i.e., . Summarizing, we get as solutions of (1)
[TABLE]
Case 2. If we first have to sort out the case with double root . Otherwise, we bring equation (1) by the scaling †††Alternatively, one may use to arrive again at the form like in the first case. to the normalized form
[TABLE]
Recall the double-angle formula for the hyperbolic tangent
[TABLE]
The range of the function is . Since , we have and we can choose such that , and actually iff . Then, by (9) it follows that is a solution of (8). Again by Viëta’s Theorem, the product of the solutions of (8) is , i.e., . Putting all together, we find as solutions of (1) in the considered case
[TABLE]
It may look strange at first sight to employ trigonometric or hyperbolic functions to solve a quadratic equation. However, recall, that the use of trigonometric functions becomes inevitable for the cubic and quartic equation in the casus irreducibilis. And for the general quintic equation, even trigonometric functions are no longer strong enough: Hermite used the theory of elliptic modular functions and the Jacobi theta function to solve the quintic (see [8]). Later, Klein related the symmetries of the icosahedron with Galois theory to show systematically, why elliptic modular functions appear naturally in the solution of the quintic. He also devised a solution using hypergeometric functions (see [11]).
3 Meet a new friend
Here, we want to propose an alternative quadratic formula which stands out by its symmetry.
Proposition**.**
Let the quadratic equation be given by the homogeneous normal form
[TABLE]
*Then its solutions can be expressed by *
[TABLE]
For real coefficients , (10) has
- (a)
a double root (namely ) iff ,
- (b)
two positive distinct real roots iff ,
- (c)
two real roots of opposite sign iff .
Remark. It is easy to convert the standard form (1) to the homogeneous form (10), however it requires a root to determine . But for dimensional reasons, problems originating from physics or geometry often present themselves already in the form (10). Actually, Descartes’ first example at the very hour of birth of the quadratic formula, was geometric and of the homogeneous form (10) with coefficients : The following facsimile shows the corresponding excerpt of page 303 in [4].
Proof of the Proposition. The classical formula (2) gives initially
[TABLE]
This establishes (a)–(c) in the case of real coefficients . In general, if , we use that for any root function on the product is either plus or minus and get by (12)
[TABLE]
with either the plus or the minus sign. The result for is the same expression with the opposite sign.
Alternatively, the correctness of (11) can readily be checked, without transforming the product , by plugging it into (10).
A benefit of the new formula (11) compared to the classical version (2) is its numerical stability. To illustrate this property, we compare the classical formula with the new formula when applied to the model problem in [6]:
[TABLE]
For simplicity we take even. Then the solution obtained by (2) rounded to significant decimals, are
[TABLE]
However, if we repeat the calculation rounded to significant decimals, we are victims of catastrophic cancellation and get
[TABLE]
If we use (11) instead, the situation is considerably better, as it still yields satisfactory solutions if the calculations are carried out rounded to only significant decimals:
[TABLE]
The following example taken from a physics textbook [15, Section 14.3, Problem 8] shows how nicely (11) may turn out compared to the classical formulas (2) or (5):
Exercise. A stone is dropped from rest into a well. seconds later, the sound of the splash is heard. Ignoring air resistance, how far has the stone fallen before hitting the water?
Solution. Let denote the depth of the well, the velocity of sound, and the acceleration of fall (we neglect air resistance). The time is the sum of the time for the stone to fall and the travel time of the sound. Squaring and rearranging terms leads to the quadratic equation
[TABLE]
The alternative formula (11) yields the particularly compact expression
[TABLE]
Compare this to the rather messy expression you get from the classical formula (5)! Let us briefly discuss the result: A falling stone reaches after time the velocity of sound. During that time , it travelled the distance which is precisely the first factor in (14). The second factor, the squared bracket, is a dimensionless number. The Taylor expansion in powers of of (14) is
[TABLE]
Here, the first term is the answer to the exercise in the case of infinite propagation of sound, the second term gives the first order correction (in ) which yields reasonably good results in practice.
Observe, that the second solution does not solve the given physical problem. However, the reader is invited to figure out what physical interpretation the second solution of (13) has.
Exercises
Solve Descartes’ problem by the alternative formula (11). Interpret the result geometrically. 2. 2.
Let be a field of characteristic . Prove: The solutions of the quadratic equation are squares in if and only if is a square, and are squares. 3. 3.
Let be positive such that , . Show that with
[TABLE]
exist iff . Find and in this case. 4. 4.
Solve the quadratic equation with the Intersecting Secants Theorem (Theorem 36 in Book III of Euclid’s Elements). 5. 5.
A company sells olives in large cardboard boxes folded from squares of sidelength :
a$$a$$x
The selling price for olives is Dollar per unit volume. The price for one box is therefore . The cost for the cardboard is Dollar per unit area. The cost per box is therefore . Optimize the profit per box. Use the alternative formula (11).
Acknowledgement. The author would like to thank Hans Peter Dreyer for pointing out to him the nice exercise of the falling stone, and Jacques Gélinas for the remark about the numerical stability.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Ari Ben-Menahem. Historical Encyclopedia of Natural and Mathematical Sciences , volume 1. Springer, 2009.
- 2[2] Bill Casselman. x 𝑥 x , y 𝑦 y , and z 𝑧 z . Notices Amer. Math. Soc. , 2015.
- 3[3] Jørgen Cherly, Luis Gallardo, Leonid Vaserstein, and Ethel Wheland. Solving quadratic equations over polynomial rings of characteristic two. Publ. Mat. , 42(1):131–142, 1998.
- 4[4] Descartes, René. Discours de la Méthode . Imprimerie de Ian Maire, 1637.
- 5[5] Walter C. Eells. Greek Methods of Solving Quadratic Equations. Amer. Math. Monthly , 18(1):3–14, 1911.
- 6[6] George E. Forsythe. Pitfalls in Computation, or why a Math Book isn’t Enough. Amer. Math. Monthly , 77(9):931–956, 1970.
- 7[7] Lorenz Halbeisen, Norbert Hungerbühler, Nir Lazarovich, Waltraud Lederle, Marc Lischka, and Salome Schumacher. Forms of choice in ring theory. Results Math. , 74 (2019), no. 1, Art. 14, 33 pp.
- 8[8] Charles Hermite. Sulla risoluzione delle equazioni del quinto grado. Annali di math. pura ed appl. , 1:256–259, 1858.
