Superintegrable classical Zernike system
George S. Pogosyan, Kurt Bernardo Wolf, Alexander Yakhno

TL;DR
This paper analyzes the classical Zernike system, revealing its superintegrability through higher-order invariants and separation of variables in various coordinate systems, linking wavefront aberration classification to advanced Hamiltonian dynamics.
Contribution
It demonstrates that the classical Zernike system is superintegrable, with explicit invariants and separability properties, connecting wavefront aberration modeling to integrable Hamiltonian systems.
Findings
Trajectories are closed ellipses due to higher-order invariants.
The system's Hamilton-Jacobi action separates in multiple coordinate systems.
The Zernike system belongs to the class of superintegrable systems.
Abstract
We consider the differential equation that Zernike proposed to classify aberrations of wavefronts in a circular pupil, as if it were a classical Hamiltonian with a non-standard potential. The trajectories turn out to be closed ellipses. We show that this is due to the existence of higher-order invariants that close into a cubic Higgs algebra. The Zernike classical system thus belongs to the class of superintegrable systems. Its Hamilton-Jacobi action separates in three vertical projections of polar coordinates of a sphere, polar and equidistant coordinates on half-hyperboloids, and also in elliptic coordinates on the sphere.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Superintegrable classical Zernike system
George S. Pogosyan,111Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías, Universidad de Guadalajara, México; Yerevan State University, Yerevan, Armenia; and Joint Institute for Nuclear Research, Dubna, Russian Federation. Kurt Bernardo Wolf222Instituto de Ciencias Físicas, Universiad Nacional Autónoma de México, Cuernavaca. and Alexander Yakhno333Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías, Universidad de Guadalajara, México.
Keywords: Zernike system, Superintegrable Higgs algebra, Classical nonstandard Hamiltonian
Abstract
We consider the differential equation that Zernike proposed to classify aberrations of wavefronts in a circular pupil, as if it were a classical Hamiltonian with a non-standard potential. The trajectories turn out to be closed ellipses. We show that this is due to the existence of higher-order invariants that close into a cubic Higgs algebra. The Zernike classical system thus belongs to the class of superintegrable systems. Its Hamilton-Jacobi action separates in three vertical projections of polar coordinates of a sphere, polar and equidistant coordinates on half-hyperboloids, and also in elliptic coordinates on the sphere.
1 Introduction: the Zernike operator
In Reference [23, p. 700], Frits Zernike proposed a two-dimensional differential equation whose polynomial solutions provide an orthogonal basis for functions in a Hilbert space over the unit disk , which —importantly— have a constant absolute value on the boundary circle: . This Zernike basis is thus distinct from the well-known bases of Bessel functions over the disk whose values (or logarithmic derivatives) vanish on a boundary circle. The differential operator and eigenvalue equation of Zernike are
[TABLE]
The requirement that this operator be self-adjoint under the inner product , i.e., , constrains the coefficients to have the values [23]. In this paper however, we let and take arbitrary real values, to be later constrained to those regions that lead to the closed orbits that we consider to be the main feature of interest of the Zernike system.
For in (1), the polar factored solutions , , correspond to the eigenvalues ; when normalized to , the radial functions are the Zernike polynomials [23]. These can be related to the Jacobi polynomials whose interval of orthogonality is . It was remarked in Ref. [2] that the reasons for postulating Eq. (1) were rather arbitrary, so its authors used the Gram-Schmidt method to find the same polynomial solutions from first principles. Zernike polynomials have wide applications in the correction of optical aberrations by describing wavefronts at circular pupils (see for example Ref. [3]); they also display a host of enticing mathematical properties [13, 9, 18, 20, 22, 8] that are characteristic of algebraic structures.
When , reduces to a linear combination of generators of the real symplectic algebra 4 under Poisson brackets or commutators [21, Sect. 11.4]; when also , then (1) becomes simply the Laplace equation with plane wave solutions , or, adapted to polar coordinates , multipole solutions with Bessel functions, where the radial wavenumber may or may not be quantized according to whether the boundary conditions are set at a finite or infinite radius. On the other hand, when but , the Zernike equation (1) reduces to the kinetic part of a nonlinear oscillator Hamiltonian [4]. We shall keep their generic values and particularize when convenient.
We found that it is of interest to examine the classical counterpart of the Zernike system, which in ‘wave’ (or quantum mechanical) form is (1). The process of de-quantization of this equation consists in replacing
[TABLE]
The operator (1) thus yields a classical Hamiltonian function which depends on two coordinates and two momenta. In Cartesian and polar coordinates, it is
[TABLE]
and its value is the energy . The appearance of in this Hamiltonian seems indeed anomalous, yet our calculations will show that at the end we have a purely real classical system whose trajectories can be found explicitly.
The Hamilton-Jacobi method is particularly apt to solve this system, where we shall preferentially use the polar coordinates and their momenta in (5). Since is independent of time and the angular coordinate is cyclic, the action function (also called Hamilton’s principal function) that satisfies the Hamilton-Jacobi equation can be separated in the form
[TABLE]
The space derivatives of this function yield the polar momenta and as
[TABLE]
In Sect. 2 we shall use the derivatives of (6) with respect to the radius and the angle , to find the geometric trajectories , which are closed ellipses. Then in Sect. 3 the dynamical trajectories will be found differentiating the action with respect to the energy. The symmetries behind the closure of the orbits will be elucidated in Sect. 4, where Eq. (1) is separated in three spherical, six hyperbolic, and elliptic coordinates, and shown to lead to constants of motion. In Sect. 5 we show that the operators which characterize these constants close into a cubic superintegrable algebra, and offer some additional comments.
2 Geometric trajectories
The derivative of the action function (6) with respect to the radius is the radial momentum,
[TABLE]
Replacing in (5) yields a quadratic algebraic equation for the derivative of , namely
[TABLE]
whose two solutions are
[TABLE]
From here we find through the indefinite integral
[TABLE]
We can now find the trajectories that relate and by differentiating (6) with respect to ,
[TABLE]
where is a constant of the motion given by the initial conditions. The derivative of in (11) with respect to , is then
[TABLE]
where in the last equality we have substituted with , and we define
[TABLE]
We note that the imaginary summand in (11) is absent from this equation and thus from the system. The double sign in (13) corresponds to the angular momentum of a trajectory traversed in opposite directions.
One finds the indefinite integral solved in [6, Eqs. 2.266], with various expressions involving inverse trigonometric and hyperbolic functions, or logarithms, depending on the signs of the constants; in our case (16) and for , the integral is
[TABLE]
Thus, joining Eqs. (12), (16), and (17), we obtain
[TABLE]
and this leads to in the form
[TABLE]
We can invert the dependence to by solving for the square radius and setting for convenience ,
[TABLE]
This is the parametric equation for ellipses, provided that
[TABLE]
These conditions restrict the range of energies and angular momenta where the trajectories are real and closed. As shown in Fig. 1 (left) for the generic Zernike range , , the first condition excludes the energy interval between the two parabolas, ; the second inequality is (for ) a lower bound (equal to for the Zernike case); lastly, the third condition excludes the interior of the parabola that has its apex at the origin, and which eliminates the region that was left allowed by the previous two conditions.
In Fig. 1 (right) we show the allowed regions for the generic Zernike range , . The two parabolas stemming from the first inequality in (24), under reflect the -axis; the second inequality in (24) is now the upper bound ; and the third inequality allows elliptic orbits in the remaining interior of the parabola, namely for . Finally, when , the ‘forbidden’ region between the two parabolas due to the first condition in (24) becomes , while the second two conditions are satisfied by , so that closed elliptical trajectories occur for all .
Since we took , the -axis is at and the -axis at . The semi-major and semi-minor axes of the ellipse are, respectively,
[TABLE]
The area of this ellipse is given by times the product of the two semi-axes,
[TABLE]
3 Dynamical trajectories and orbits
We return now to the integral expression for in (11), differentiating the action in (6) now with respect to the energy ,
[TABLE]
where is the initial time constant. Instead of (13)–(15), we now have
[TABLE]
where as before we have set , and are again given by (16). The indefinite integral can be found in [6, Eqs. 2.261]; it is
[TABLE]
The conditions for this integral to be proper, and also lead to (24), while the solutions corresponding to (19) are now
[TABLE]
with and given by (19).
From here we can extract the dependence of the square radius of the trajectory on time as (23) did for the angle. We choose such that is the semi-major axis in (25), i.e., , so , and write
[TABLE]
This is a periodic function of time, with period , or
[TABLE]
In the generalized Zernike range , the radicand is positive; when , the second inequality in (24) prevents the orbits from being closed for . Although orbits in the Zernike range are ellipses, they differ from the isochronous orbits of the classical harmonic oscillator, whose period does not depend on their energy [5].
As a function of time, the trajectories \Big{(}x(t),\,y(t)\Big{)} can be found from the previous expressions, (23) and (33), as
[TABLE]
and are shown in Fig. 2 for the Zernike case , but are valid for the range .
The trajectories are circular when , i.e., or . This is the case of the upper right and lower left trajectories in Fig. 2. For it occurs on the two parabolas that bound the region excluded by the first condition in (24) and respect the other two inequalities. The radius of those circles can be found from (23), as . At the upper boundary one has , so in the Zernike region this means , which in turn entails that , or , which yields the radius of the circle as ; in the case this is the boundary of the unit circle of Zernike’s differential equation [23]. On the other hand, at the lower boundary in the same Zernike range , , and one has , which for exceeds the unit radius allotted by Zernike’s requirement. We conclude that the elliptic trajectories in the lower ‘allowed’ region of Fig. 1 (left) cannot correspond with solutions of the Zernike differential equation (1). Only those in the upper region do. On the other extreme of the region, the trajectories become lines when , namely for ever larger and also when approaches the lower boundary .
Regarding the region in Fig. 1 (right), the excentricity in (23) is on the parabola . The radii of those circles can be found as we did above, yielding . The trajectory is a unit circle when , i.e., . This value falls on a single point of the parabolic boundary of the allowed region in Fig. 1 (right). On the upper boundary of that region, , the excentricty is and the trajectores are lines. Finally, when and the allowed region is , on its boundary we have circles of radii .
4 Separation of variables and symmetries
The classical Zernike Hamiltonian (4) in Cartesian coordinates can be subject to the Hamilton-Jacobi method of solution with the action partial derivatives and , and yields the Hamiltonian (4) written as
[TABLE]
This equation is separable on the -plane, but the boundary condition imposed by Zernike [23] on the solutions, namely that their absolute value at the boundary be constant, can only be separated in polar coordinates, as we did in Sect. 2. Although the classical Zernike system appears to belong to the class of Bertrand systems [1] in which all bounded orbits are closed, it does not qualify as such because the linear and quadratic terms replace the two-dimensional central force potentials of the Coulomb or isotropic oscillator systems. We surmise that this feature is a specific consequence of the superintegrability of the Zernike system. It is therefore of interest to find any additional separable systems of orthogonal coordinates and, associated with these, the extra symmetry operators that will clearly demonstrate the classical Zernike Hamiltonian to be superintegrable. We remind the reader that in an -dimensional space with constant curvature (real or complex), a maximally superintegrable system allows, in addition to the Hamiltonian , another functionally independent constants of motion, , , … , , , that are in involution with , namely for [12].
4.1 Coordinate systems on sphere and hyperboloid
Equation (1) is linear and of second order,
[TABLE]
According to the standard classification, this equation is of elliptic type when , of parabolic type when , and of hyperbolic type when . The original Zernike case is in the range , where the region of ellipticity is the interior of the circle . On the other hand, when , the equation (1) is of elliptic type over the whole - plane .
To be within the Zernike case we consider first the range , and map the open disk on the hemisphere , , embedded in a Euclidean space with three Cartesian coordinates , through the orthogonal (or ‘vertical’) projection
[TABLE]
In these coordinates the Hamiltonian equation (37) can be separated into three mutually orthogonal spherical systems of coordinates [15],
[TABLE]
and in the elliptical system of coordinates [15, 10, 11] to be seen below.
Still within the case, we can consider the outside of the circle at radii , where the equation (38) is hyperbolic. There one can map the trajectories of the - plane on trajectories on the one-sheeted half-hyperboloid . Coordinates that permit separation of variables for (38) replace trigonometric functions by hyperbolic functions thus:
[TABLE]
On the other hand when , the region of ellipticity being the whole plane , allows one to map this plane on the upper sheet of the two-sheeted hyperboloid using ‘modified’ coordinate systems:
[TABLE]
The hyperboloidal coordinates in (43)–(48) have been defined in Ref. [16].
4.2 Separation in spherical systems I, H′I and HI
In the spherical coordinates () of System I in (40) for , the Hamilton-Jacobi expression in (37) acquires the form
[TABLE]
This equation is integrable with the help of the first-order integral of motion
[TABLE]
that is independent of and separates the action function as , leading to the equation
[TABLE]
Using the same approach of Sect. 3 for the Zernike case, one finds the trajectory to be
[TABLE]
where and are given in (23), and which lies within the hemisphere of radius , as seen in Fig. 3. The trajectories reach the rim only when .
Still in the case, the pseudo-spherical coordinates () of System H′I in (43) allow separation of the action function as , so the Hamiltonian (37) leads to the equation
[TABLE]
Then the trajectories, instead of (52), are given by
[TABLE]
with and given in (23). These are closed orbits in the region . In Figure 4 we show such trajectories on the one-sheeted half-hyperboloid.
Turning now to the case for the pseudo-spherical system (46), the separation of variables yields
[TABLE]
so that the trajectory is found as
[TABLE]
lying on one sheet of a two-sheeted hyperboloid , and where again and are given in (23). The orbits on this manifold are elliptic and are shown in Fig. 5
4.3 Separation in coordinate systems II and HII
The second system of spherical coordinates in (41) leads to the Hamiltonian (37) in the form
[TABLE]
When , separation of variables applies on the action function and leads to the pair of equations
[TABLE]
where is a separation constant. Rewriting (59) in Cartesian coordinates, we obtain
[TABLE]
The integration in yields a second integral of motion that depends on the parameters ,
[TABLE]
In the case , the action function admits separation of variables in the hyperbolic equidistant system HII in (44), and yields the two equations
[TABLE]
which lead to the same integral of motion in (61).
4.4 Separation in the coordinate system III
The third spherical system of coordinates in (42) leads to the Hamilton-Jacobi form (37) written as
[TABLE]
In the case , for , the separation of variables in the action function, leads to
[TABLE]
From (66) we find a third constant of motion that depends on ,
[TABLE]
and which under the phase space -rotation coincides with in (61). Finally, we note that when , the separations of variables (46)–(48) on the hyperboloid yield the same integrals of motion , and given above.
We note that, unlike the three orthogonal coordinate systems on the sphere, on hyperboloids there are nine orthogonal coordinate systems where the Laplace and the Helmholtz equations yield to separation of variables [14].
4.5 Separation of variables in the elliptic system
The Hamilton-Jacobi equation (37) also yields to separation in elliptic coordinates on the sphere in trigonometric form [15, 10, 11],
[TABLE]
where the constants and are related to the interfocal distance of the ellipses on the upper unit hemisphere, so that . When and thus , the action function separates as , and leads again to two equations,
[TABLE]
where is a separation constant, and . Eliminating from these equations one obtains
[TABLE]
Returning to Cartesian coordinates,
[TABLE]
we can express the constant as
[TABLE]
Thus the elliptic separation constant is not functionally independent but depends on the constants and in (50) and (61).
5 Algebraic structure and conclusions
We have found three functionally independent integrals of motion, in (50), in (61), and in (67) with no singularities on the full parameter space. To probe their algebraic structure let us define
[TABLE]
The function is -angular momentum and its Poisson operator generates rotations of phase space, while the function does depend on . These functions Poisson-commute with the Zernike Hamiltonian function in (4), which can be written as
[TABLE]
but do not commute with each other. This shows that the generalized classical -Hamiltonian of Zernike, in (5), is superintegrable on each of the domains examined above, in particular on the -disk , for , that contains the Zernike original case .
To identify the symmetry of the generalized Zernike -Hamiltonians, we introduce a new integral of motion through the Poisson bracket of (75) and (76),
[TABLE]
which also Poisson-commutes with , and is functionally independent of and , although it can be seen that and are connected to each other by a rotation of in the – phase space planes. The algebraic structure of three functions is thus found to be
[TABLE]
They form therefore a cubic Higgs algebra [7] that Poisson-commutes with the generalized Zernike Hamiltonian, .
When so , the Zernike Hamiltonian becomes a simpler quadratic function,
[TABLE]
The Poisson operators of all quadratic functions of these four phase space coordinates close under commutation into the real symplectic Lie algebra sp(,R).
The Hamiltonian (81) belongs to the elliptic orbit of harmonic oscillators [21, Chap. 12], as can be seen under the complex linear canonical transformation
[TABLE]
This maps (81) on a regular harmonic oscillator,
[TABLE]
and the three constants of the motion, in (75), (76) and (78), on
[TABLE]
whose Poisson brackets close into a scaled u() Lie algebra,
[TABLE]
In the paraxial geometric or wave optical interpretation, the central generates isotropic fractional Fourier transforms [19], while generates anisotropic ones, generates rotations, and generates gyrations [17] that transform Hermite-Gauss into Laguerre-Gauss beams. Together their Poisson operators form the Fourier algebra [19], which is the maximal compact subalgebra in sp(,R). If were a pure imaginary number, (83) would be the repulsive oscillator Hamiltonian and (84)–(86) its commuting ‘Fourier’ algebra ; a similar treatment of the classical system with Hamiltonian (4) would yield hyperbolic orbits. For a free system with an inhomogeneous iso(2) ‘Fourier’ algebra would appear.
The original Zernike system in (1) [23] was proposed to develop a set orthogonal and complete set of two-variable orthogonal polynomials , , , which present the same -pattern as the two-dimensional quantum harmonic oscillator states. There has been some effort in replicating the raising and lowering techniques of the oscillator scheme on the Zernike system [22, 18] without achieving a proper Lie algebra. Because here we have a two-parameter system , we could surmise that superintegrable systems can be obtained as a new kind of algebra deformation, from (83)–(87) to (75)–(80), consisting in the addition of the square of an element of a Lie algebra to the generator designed to be the original quadratic Hamiltonian. Imposing boundary conditions such as those proposed by Zernike will need the quantum treatment of this construction.
Acknowledgements
We acknowledge the interest and early discussions with Prof. Natig M. Atakishiyev (Instituto de Matemáticas, unam); we thank Guillermo Krötzsch (icf-unam) for indispensable help with the figures. G.S.P. and A.Y. thank the support of project pro-sni-2016 (Universidad de Guadalajara). K.B.W. thanks Cristina Salto-Alegre (Posgrado en Ciencias Físicas, icf-unam) for her interest and interaction on the subject, and acknowledges the support of unam-dgapa Project Óptica Matemática papiit-IN101115.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Bertrand, Théorème relatif au mouvement d’un point attiré vers un centre fixe, C. R. Acad. Sci. 77 , 849–853 (1873).
- 2[2] A. B. Bhatia and E. Wolf, On the circle polynomials of Zernike and related orthogonal sets, Math. Proc. Cambridge Phil. Soc. 50 , 40–48 (1954).
- 3[3] M. Born and E. Wolf, Principles of Optics: Electromagnetic Theory of Propagation, Interference and Diffraction of Light 7th ed. (Cambridge University Press, 1999). p. 986.
- 4[4] J. F. Cariñena, M. F. Rañada and M. Santander, Two important examples of nonlinear oscillators, ar Xiv:math-ph/0505028.
- 5[5] J. F. Cariñena, A. M. Perelomov and M. F. Rañada, Isochronous classical systems and quantum systems with equally spaced spectra, J. Phys.: Conf. Ser. 87 , 012007, 4 p. (2007).
- 6[6] I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series, and Products (6th Ed., Academic Press, 2000).
- 7[7] P. W. Higgs, Dynamical symmetries in a spherical geometry, J. Phys. A 12 , 309–323 (1979).
- 8[8] M. E. H. Ismail and R. Zhang, Classes of bivariate orthogonal polynomials, SIGMA 12 , 021 (2016), ar Xiv:1502.07256 v 3 [math.CA].
