A new approach to hypergeometric transformation formulas
Noriyuki Otsubo

TL;DR
This paper introduces a novel, unified method for proving transformation formulas of Gauss hypergeometric functions using Jacobi's canonical form, also extending the approach to q-hypergeometric functions.
Contribution
The paper presents a new, simplified approach to deriving hypergeometric transformation formulas, leveraging Jacobi's canonical form, and explores analogous methods for q-hypergeometric functions.
Findings
Unified proof technique for hypergeometric transformations
Extension of methods to q-hypergeometric functions
Simplification of existing proof processes
Abstract
We give a new method to prove in a uniform and easy way various transformation formulas for Gauss hypergeometric functions. The key is Jacobi's canonical form of the hypergeometric differential equation. Analogy for -hypergeometric functions is also studied.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Mathematical Identities · Mathematical functions and polynomials · Nonlinear Waves and Solitons
A new approach to hypergeometric transformation formulas
Noriyuki Otsubo
Department of Mathematics and Informatics, Chiba University, Inage, Chiba, 263-8522 Japan
Abstract.
We give a new method to prove in a uniform and easy way various transformation formulas for Gauss hypergeometric functions. The key is Jacobi’s canonical form of the hypergeometric differential equation. Analogy for -hypergeometric functions is also studied.
Key words and phrases:
Hypergeometric functions, basic hypergeometric functions, transformation formulas.
2010 Mathematics Subject Classification:
33C05, 33D15
1. Introduction
Recall that a Gauss hypergeometric function is defined by the power series
[TABLE]
which converges on the open unit disk. Here the parameters , , are complex numbers and . We know many transformation formulas among such functions since Euler, Pfaff and Gauss, and some of them are quite new. Such formulas have various aspects and to find or prove them, different techniques have been used. For example, elliptic functions and computation using mathematical software played important roles.
In this paper, we give a new method to prove such formulas in a uniform and easy way. Recall that the hypergeometric function satisfies a linear ordinary differential equation of order two, and hence is characterized by this equation together with the initial values. The key of our method is the following canonical form of the hypergeometric differential equation
[TABLE]
where (Theorem 2.2). As the author learned after writing the first manuscript of the present paper, this form was known by Jacobi [15] (see also [22, §24]). Though it seems to have been scarcely used, at least unless , it has several advantages to the standard ones (see (2.1), (2.2)). It clarifies the symmetry under (then ), and behaves nicely under the change of variables we are to consider. Above all, it enables us to compute the differential equation for from that for in a straight-forward way.
Among the transformation formulas to be proved in this paper, of particular interest are the following ones, which have strong connection to number theory.
Theorem 1.1**.**
On a neighborhood of ,
[TABLE]
The quadratic formula (1.1) with two free parameters and is very classical, due to Gauss [10, p. 225, formula 101]. The special case where reduces to the Landen transformation formula
[TABLE]
for the elliptic integral of the first kind
[TABLE]
From this follows the formula
[TABLE]
where denotes the arithmetic-geometric mean of and ().
The cubic formula (1.2) for was first found by Ramanujan [23, second notebook, p. 258] in his study of elliptic functions to alternative bases. It was rediscovered and proved by Borwein-Borwein [4, p. 694] in their study of a cubic analogue of the arithmetic-geometric mean. The general case of (1.2) is due to Berndt-Bhargava-Garvan [3, Theorem 2.3].
The quadratic formula (1.3) for was also found by Ramanujan [23, p. 260]. This is also related with a generalized arithmetic-geometric mean, and a proof was given implicitly by Borwein-Borwein [4, Theorem 2.6] and explicitly by Berndt-Bhargava-Garvan [3, Theorem 9.4]. The general case of (1.3) was found recently by Matsumoto-Ohara [21, Corollary 3] (see Section 3.4).
One may expect that our method is useful not only for proving formulas, but also for finding new ones. In fact, the author found (1.3) independently before knowing [21]. For other transformation formulas which were found more recently, see for example Vidūnas [24].
This paper is constructed as follows. In Section 2, we derive the canonical form of the hypergeometric equation and explain our general method for transformation formulas. In Section 3, we give a short proof of Theorem 1.1, and discuss transformation formulas for multivariable hypergeometric functions of Appell and Lauricella. In Sections 4–6, we give proofs of other quadratic, cubic and quartic formulas and discuss their relations with the previous formulas. In the last Section 7, we study the analogy for -hypergeometric functions (basic hypergeometric functions) . We give a canonical form of the -hypergeometric difference equation (Theorem 7.5), and use it to give a new proof of Heine’s transformation formula (Theorem 7.6).
2. Generalities
2.1. Hypergeometric differential equation
Let us write differential operators as
[TABLE]
One sees easily from that satisfies the differential equation
[TABLE]
Since , (2.1) is equivalent to
[TABLE]
Further, it can be written as
[TABLE]
where we define by
[TABLE]
Then it is obvious that is another solution ( assumed).
Remark 2.1*.*
One cannot expect such a symmetry for the differential equation satisfied by a generalized hypergeometric function in general for . This can be seen from the asymmetry of the Riemann scheme.
The key observation of this paper is the following.
Theorem 2.2**.**
Put where . Then (resp. ) is the unique solution of the differential equation
[TABLE]
such that
[TABLE]
Proof.
For any function regarded as a multiplication operator, the identity of operators
[TABLE]
holds. Therefore,
[TABLE]
Hence follows the equivalence of (2.3) and (2.4). The initial values are immediate from the definition and the uniqueness is evident. ∎
Remark 2.3*.*
In fact, the differential equation is regular singular at , and (resp. ) is the unique holomorphic solution with (resp. ).
2.2. Transformation
We consider differential operators of the form
[TABLE]
If is a solution of the differential equation , we say for brevity that is a solution of , and that is a differential operator for . This type of differential equation is stable under a change of variables.
Lemma 2.4**.**
Let be a solution of and be a non-constant holomorphic function. Then is a solution of
[TABLE]
Proof.
Immediate from . ∎
Consider the differential operator as in (2.4)
[TABLE]
The following examples of are important. First, for a positive integer , let
[TABLE]
Then becomes by Lemma 2.4
[TABLE]
Secondly, for a positive integer , let
[TABLE]
Note that the map is an involution since
[TABLE]
We have
[TABLE]
so becomes by Lemma 2.4
[TABLE]
where we put
[TABLE]
Finally, consider
[TABLE]
This is the composition of the two substitutions as above. Then,
[TABLE]
In general, the resulting differential operator is not so simple. For , and , however, we have respectively
[TABLE]
Hence the term is a product of powers of , , and . This explains why, in each formula of Theorem 1.1, the differential equation for the right-hand side is of a manageable form.
2.3. Comparison
The formulas we prove are of the form
[TABLE]
where is a solution of a differential operator of order of the form
[TABLE]
Suppose that , and are holomorphic at , and
[TABLE]
Then the equality holds on a neighborhood of if and only if is also a solution of .
Lemma 2.5**.**
The identity of operators holds if and only if
[TABLE]
Proof.
Using , we have an equality of differential operators
[TABLE]
Hence , and the lemma follows. ∎
If the condition (2.8) holds, then
[TABLE]
hence is a solution of near . Conversely, if one seeks a transformation formula between and , one is led to find a function satisfying (2.8).
2.4. Linear Transformations
As easy examples of our method, let us prove the following formulas, respectively due to Euler and Pfaff. Later, we prove a -analogue of the former in a similar manner (see Theorem 7.6).
Theorem 2.6**.**
On a neighborhood of ,
[TABLE]
Proof.
(2.9). Put , and . Using the notations of Section 2.3, we have by Theorem 2.2
[TABLE]
The first equality of (2.8) is obvious and for the second,
[TABLE]
Since and , we have .
(2.10). Put , and . Then, by Theorem 2.2 and Lemma 2.4,
[TABLE]
and is the same as (2.9). The first equality of (2.8) is obvious. For the second, since , we have using the logarithmic derivatives
[TABLE]
Hence (2.8) follows. The comparison of the initial values is easy, and (2.10) follows. ∎
3. Proof of Theorem 1.1
Here we give direct proofs of the formulas (1.1), (1.2) and (1.3). Alternative proofs are given respectively in Section 4, Section 5 and Section 4.
3.1. Proof of (1.1)
Let be the right-hand side and be the in the left-hand side. Here, with the notations of Section 2.2, , , and
[TABLE]
By Theorem 2.2 and Lemma 2.4, we have with the notations of Section 2.3
[TABLE]
On the other hand, we have by (2.5)
[TABLE]
Letting , we have . Since , we have
[TABLE]
Hence the condition (2.8) is verified. Since and , we obtain . ∎
Remark 3.1*.*
The original proof of Gauss also compares the differential equations. In Erdélyi et. al. [8, p. 111, (5)], another proof is suggested but not explicitly.
3.2. Proof of (1.2)
Let be the right-hand side and be the in the left-hand side. Here, , , and
[TABLE]
By Theorem 2.2 and Lemma 2.4, we have
[TABLE]
Letting , we have
[TABLE]
One easily verifies the condition (2.7) and the coincidence of the initial values. Hence we obtain . ∎
Remark 3.2*.*
See Section 3.4 for another proof. When , other proofs are given by Borwein-Borwein-Garvan [5, Corollary 2.4], Chan [6, Sections 5 and 6], Cooper [7, Theorem 5.3] and Maier [20, Corollary 6.2].
3.3. Proof of (1.3)
Let be the right-hand side and be the in the left-hand side. Here, , , and
[TABLE]
By Theorem 2.2 and Lemma 2.4, we have
[TABLE]
Letting , we have
[TABLE]
Then, the rest is just as above. ∎
Remark 3.3*.*
See Section 3.4 for another proof. When , other proofs are given by Cooper [7, Theorem 5.3] and Maier [20, Corollary 6.2].
An investigation of our proofs suggests that, contrary to (1.1), the formulas (1.2) and (1.3) have no generalization to a formula with two free parameters.
3.4. Multivariable hypergeometric functions
The formula (1.2) (resp. (1.3)) is obtained as a specialization of a transformation formula for a hypergeometric function of two (resp. three) variables. Recall Lauricella’s hypergeometric function of variables [19]
[TABLE]
It converges on the open unit polydisk . When , this is the Gauss function and when , this is Appell’s function [1].
For and , we have the following transformation formulas.
[TABLE]
where
[TABLE]
[TABLE]
where
[TABLE]
The formula (3.1) is due Koike-Shiga [17, Proposition 2.5] for and Matsumoto-Ohara [21, Theorem 1] in general. The formula (3.2) is due Kato-Mastumoto [16, Proposition 1] for and Matsumoto-Ohara [21, Theorem 3] in general.
Then we obtain (1.2) (resp. (1.3)) from (3.1) (resp. (3.2)) by letting (resp. ), using the multinomial formula
[TABLE]
Note .
All the proofs in [16], [17] and [21] use mathematical software to compare the two systems of partial differential equations. One might be able to give simpler proofs by extending the method of this paper. Put
[TABLE]
Then, is a solution of the system of linear partial differential equations (see [13, Chapter 3, 9.1])
[TABLE]
In fact, this system is of rank . Using Theorem 2.2, we easily obtain the following.
Theorem 3.4**.**
Put
[TABLE]
Then, is the unique solution of
[TABLE]
such that
[TABLE]
4. Quadratic Formulas
The formulas (1.1) and (1.3) are two different combinations of the following formulas, the former due to Kummer [18, p. 78, Formula 44], and the latter due to Ramanujan (cf. [2, p. 50, Chap. 11, Entry 4]). These are in fact equivalent as is clear from the proof below.
Theorem 4.1**.**
On a neighborhood of ,
[TABLE]
Proof.
(4.1). Let be the right-hand side and be the in the left-hand side. By (2.6) with , the differential operator for is
[TABLE]
where . On the other hand, by (2.5) with , the differential operator for is
[TABLE]
Letting so that (with the notations of Section 2.3), we have
[TABLE]
and the condition (2.8) holds. Comparing the initial values, we obtain (4.1).
(4.2). If we replace with , the formula becomes
[TABLE]
By Theorem 2.2, the both sides satisfy the same differential equation as those of (4.1). Comparing the initial values at , we obtain the equality above, hence (4.2). ∎
Now, let us deduce (1.1) and (1.3) from Theorem 4.1. Rewrite (4.1) and (4.2) as
[TABLE]
where
[TABLE]
First, if we let , and , then
[TABLE]
Letting and equating the left-hand sides of (4.3) and (4.4), we obtain
[TABLE]
Since , it becomes (1.1) (in variable ) after a suitable change of parameters.
Secondly, if we let , then
[TABLE]
Letting and equating the right-hand sides of (4.3) and (4.4), we obtain
[TABLE]
Since , it becomes (1.3) (in variable ) after a suitable change of parameters.
Let us give short proofs of two other important formulas. First, the following is due to Gauss [10, p. 226, Formula 102].
Theorem 4.2**.**
On a neighborhood of ,
[TABLE]
Proof.
Letting , we have , . By Theorem 2.2 and Lemma 2.4, the differential operator for the right-hand side is
[TABLE]
which coincides with the differential operator for the left-hand side. Comparing the initial values, we obtain (4.5). ∎
The following is due to Kummer [18, Formula 53].
Theorem 4.3**.**
On a neighborhood of ,
[TABLE]
Proof.
Let as in the proof of (1.1). By Theorem 2.2 and Lemma 2.4, the differential operator for the right-hand side is
[TABLE]
Similarly, the differential operator for the in the left-hand side is
[TABLE]
Letting , one verifies (2.8). Comparing the initial values, we obtain (4.6). ∎
Remark 4.4*.*
In fact, (4.5) and (4.6) are equivalent to each other; apply Pfaff’s formula (2.10) to the left-hand side of (4.5) and then replace with , to obtain (4.6).
5. Cubic Formulas
A cubic analogue of Theorem 4.1 is the following formulas due to Goursat [11, p. 140, (127) and (126)]. Here we give short proofs and see that two different combinations of (5.1) and (5.2) give the formulas (1.2) and (5.3).
Theorem 5.1**.**
On a neighborhood of ,
[TABLE]
On a neighborhood of ,
[TABLE]
Proof.
If we let z=64x\bigl{(}\frac{1-x}{1+8x}\bigr{)}^{3}, then
[TABLE]
where we put
[TABLE]
By Theorem 2.2 and Lemma 2.4, the differential operator for the right-hand sides of (5.1) and (5.2) is
[TABLE]
Letting , we have
[TABLE]
and then
[TABLE]
Therefore, by Theorem 2.2 and (2.8), is nothing but the differential operator for (resp. for ) in the left-hand side of (5.1) (resp. (5.2)). Comparing the initial values at (resp. ), we obtain (5.1) (resp. (5.2)). ∎
As was found by Chan [6, Section 6], (1.2) is deduced from Theorem 5.1 as follows. If we let , and , then
[TABLE]
Equating the right-hand sides of (5.1) and (5.2) and letting , we obtain (1.2) (in variable ).
On the other hand, by equating the left-hand sides of (5.1) and (5.2), which is only possible for , we obtain
[TABLE]
on a neighborhood of . In particular, the both sides satisfy the same differential equation, which remains true near . Replacing with and comparing the initial values at , we obtain the following.
Corollary 5.2**.**
On a neighborhood of (resp. ),
[TABLE]
where (resp. ).
The author does not know if it is equivalent to a known formula.
6. Quartic formulas
Here we treat two quartic formulas. First, the following is an iteration of (1.1).
Corollary 6.1**.**
On a neighborhood of ,
[TABLE]
Proof.
As in the proof of (1.1) given in Section 4, let , and
[TABLE]
so that . Solving , we have , . Then by (1.1), we have
[TABLE]
Since , we obtain (6.1) (in variable ). ∎
The following formula of Matsumoto-Ohara [21, Corollary 2] is a specialization of a transformation formula (loc. cit. Theorem 2) for Appell’s function , whose proof is similar to (3.1) and (3.2). We give a direct proof.
Theorem 6.2**.**
On a neighborhood of ,
[TABLE]
Proof.
By Theorem 2.2 and (2.7), the differential operator for the right-hand side is given by
[TABLE]
On the other hand, by Theorem 2.2 and Lemma 2.4 applied to , the differential operator for the in the left-hand side is given by
[TABLE]
Letting , one easily verifies (2.8). Comparing the initial values, we obtain (6.2). ∎
Remark 6.3*.*
The author learned from Hiroyuki Ochiai that (6.2) is also obtained as a combination of (1.1) and (4.6) as follows. Use the same notations as in the proof of Corollary 6.1. By (1.1) with ,
[TABLE]
By (4.6) with ,
[TABLE]
Then (6.2) (in variable ) follows similarly as (6.1).
7. -analogues
We give a new canonical form of the difference equation for a -hypergeometric series which generalizes (2.4), and apply it to give a proof of Heine’s transformation formula.
7.1. Preliminaries
For the moment, let , , and be indeterminates. Recall the -Pochhammer symbol
[TABLE]
Here, for a ring , denotes its unit group. The -hypergeometric series is defined by
[TABLE]
This is a power series in with coefficients in .
Recall the -number
[TABLE]
Note that . We write symbolically and define the number
[TABLE]
Define a difference operator (-derivation) by
[TABLE]
Following Jackson [14], define the shift operator by
[TABLE]
and the difference operator by
[TABLE]
In particular, we have by definition
[TABLE]
Then we have
[TABLE]
Hence is the -analogue of the differential operator , where .
Any function (or a power series) defines a multiplication operator. To avoid possible confusion in writing operators, we write
[TABLE]
As an operator, means the composition of and .
Lemma 7.1**.**
For any , we have an identity of operators
[TABLE]
Proof.
Since
[TABLE]
for any , the lemma follows. ∎
7.2. -hypergeometric difference equation
Recall the differential equation (2.1) for . Similarly, since
[TABLE]
the series satisfies the difference equation
[TABLE]
where we used symbols , and . From this, we derive difference equations analogous to (2.2) and (2.3).
Proposition 7.2**.**
Put . Then is a solution of the difference equation
[TABLE]
or equivalently
[TABLE]
Proof.
In (7.1), substitute
[TABLE]
By Lemma 7.1, we have (operators), hence
[TABLE]
Then, using , we obtain (7.2), hence (7.3). ∎
Now, let with . Let , , , , , and assume . Then the power series defines an analytic function on . Note that, in the limit as , we have , , hence . Since and , the equation (7.1) (resp. (7.2), (7.3)) specializes to (2.1) (resp. (2.2), (2.3)).
We give a -analogue of (2.4). The function satisfies
[TABLE]
A -analogue of the function is the following.
Definition 7.3**.**
For , define a function by
[TABLE]
where . It converges on .
Recall that . Similarly, we have the -binomial theorem
[TABLE]
(see for example [9, (1.3.2)]). Moreover, one sees easily the following.
Lemma 7.4**.**
We have
[TABLE]
Our canonical form of the difference equation for is the following.
Theorem 7.5**.**
Put where . Then is the unique solution of the difference equation
[TABLE]
such that
[TABLE]
Proof.
By Lemma 7.1, (7.4) and Lemma 7.4, we have identities of operators
[TABLE]
Then we obtain (7.5) from (7.3), using
[TABLE]
The initial condition follows by
[TABLE]
and the uniqueness is evident. ∎
7.3. Transformation
The following transformation formula due to Heine [12, p. 325, XVIII] is a -analogue of Euler’s formula (2.9). We give an analogous proof using Theorem 7.5.
Theorem 7.6**.**
We have
[TABLE]
Before the proof, we introduce another notation.
Definition 7.7**.**
For , define an operator by
[TABLE]
Then, one easily shows the following.
Lemma 7.8**.**
There are identities of operators
[TABLE]
Proof of Theorem 7.6.
Put and . By Theorem 7.5, and are solutions of the difference operators, respectively,
[TABLE]
The right-hand side of (7.6) is a solution of . We show the identity of operators
[TABLE]
Then it follows that the left-hand side of (7.6) is also a solution of . Comparing the initial values, which is easy, we obtain the theorem.
We compute the left-hand side of (7.7). For the first term, using Lemmas 7.1, 7.4, 7.8 and (7.4), we have
[TABLE]
For the second term, we have similarly
[TABLE]
Finally for the last term, we have
[TABLE]
Then, using
[TABLE]
we obtain (7.7), hence the theorem. ∎
Acknowledgement
The author would like to thank Ryojun Ito, Hiroyuki Ochiai, Nobuki Takayama and Raimundas Vidūnas for helpful discussions. This work is supported by JSPS Grant-in-Aid for Scientific Research: 18K03234.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P. Appell and J. Kampé de Fériet, Fonctions hypergéométriques et hypersphériques , Gauthier Villars, Paris, 1926.
- 2[2] B. C. Berndt, Ramanujan’s Notebooks, Part II , Springer-Verlag, New York, 1989.
- 3[3] B. C. Berndt, S. Bhargava and F. G. Garvan, Ramanujan’s theories of elliptic functions to alternative bases, Trans. Amer. Math. Soc. 347 , No. 11 (1995), 4163–4244.
- 4[4] J. M. Borwein and P. B. Borwein, A cubic counterpart of Jacobi’s identity and the AGM, Trans. Amer. Math. Soc. 323 (1991), No. 2, 691–701.
- 5[5] J. M. Borwein, P. B. Borwein and F. G. Garvan, Some cubic modular identities of Ramanujan, Trans. Amer. Math. Soc. 343 (1994), No. 1, 35–47.
- 6[6] H. H. Chan, On Ramanujan’s cubic transformation formula for F 1 2 ( 1 3 , 2 3 ; 1 ; z ) subscript subscript 𝐹 1 2 1 3 2 3 1 𝑧 {}_{2}F_{1}(\frac{1}{3},\frac{2}{3};1;z) , Math. Proc. Cambridge Philos. Soc. 124 (1998), No. 2, 193–204.
- 7[7] S. Cooper, Inversion formulas for elliptic functions, Proc. London Math. Soc. (3) 99 (2009), 461–483.
- 8[8] A. Erdélyi et al. ed., Higher transcendental functions, Vol. 1 , Mc Grow-Hill, New York, 1953.
