Lie groupoids in information geometry
Katarzyna Grabowska, Janusz Grabowski, Marek Ku\'s, Giuseppe Marmo

TL;DR
This paper introduces a novel framework using Lie groupoids and Lie algebroids to unify and generalize contrast functions in information geometry, leading to new geometric insights and examples.
Contribution
It establishes Lie groupoids and Lie algebroids as the proper setting for contrast functions, connecting them to metrics and connections in a unified geometric framework.
Findings
Contrast functions induce pseudo-Riemannian metrics on Lie algebroids.
The framework includes classical two-point functions and their geometric structures.
An example of the Fubini-Study metric on Hilbert space is derived.
Abstract
We demonstrate that the proper general setting for contrast (potential) functions in statistical and information geometry is the one provided by Lie groupoids and Lie algebroids. The contrast functions are defined on Lie groupoids and give rise to two-forms and three-forms on the corresponding Lie algebroid. If the two-form is non-degenerate, it defines a `pseudo-Riemannian' metric on the Lie algebroid and a family of Lie algebroid torsion-free connections, including the Levi-Civita connection of the metric. In this framework, the two-point functions are just functions on the pair groupoid with the `standard' metric and affine connection on the Lie algebroid . We study also reductions of such systems and infinite-dimensional examples. In particular, we find a contrast function defining the Fubini-Study metric on the Hilbert projective space.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Lie groupoids in information geometry
Katarzyna Grabowska111email:[email protected]
Faculty of Physics, University of Warsaw
Janusz Grabowski222email: [email protected]
Institute of Mathematics, Polish Academy of Sciences
Marek Kuś333email: [email protected]
Center for Theoretical Physics, Polish Academy of Sciences
Giuseppe Marmo444email: [email protected]
Dipartimento di Fisica “Ettore Pancini”, Università “Federico II” di Napoli
and Istituto Nazionale di Fisica Nucleare, Sezione di Napoli
Abstract
We demonstrate that the proper general setting for contrast (potential) functions in statistical and information geometry is the one provided by Lie groupoids and Lie algebroids. The contrast functions are defined on Lie groupoids and give rise to two-forms and three-forms on the corresponding Lie algebroid. If the two-form is non-degenerate, it defines a ‘pseudo-Riemannian’ metric on the Lie algebroid and a family of Lie algebroid torsion-free connections, including the Levi-Civita connection of the metric. In this framework, the two-point functions are just functions on the pair groupoid with the ‘standard’ metric and affine connection on the Lie algebroid . We study also reductions of such systems and infinite-dimensional examples. In particular, we find a contrast function defining the Fubini-Study metric on the Hilbert projective space.
1 Introduction
Information geometry studies statistical-probabilistic models equipped with a differential structure. It started with the pioneering work of Rao [23] and was brought to a mature level with works of Amari [5, 3, 2] and many others [9, 10, 11].
The main consideration is that when dealing with probability distributions, two aspects play a relevant role. The first one is a notion of “distinguishability” or “distance” measuring the relative difference between two probability distributions, the second one is connected with the possibility to compose distributions by means of convex combinations, i.e., treating them as elements of a convex set of an affine space. Information geometry aims at treating these aspects from the point of view of differential geometry, where the notion of distance will be associated with a metric by means of geodesic distances, while the composition will be associated with a connection. In general the connection will not be the Levi-Civita one and, in many instances, it turns out to have curvature different form zero; therefore one looks for alternative connections, which are (possibly) flat and would allow for a notion of a “convex” composition. This is equivalent to the existence of an affine coordinate system.
Our aim in this note is to convince the reader that the proper general setting for contrast (potential) functions in statistical and information geometry is the one provided by Lie groupoids and Lie algebroids. The contrast functions are defined on Lie groupoids (in the standard case the pair groupoid ) and vanish on the submanifold of units (the diagonal in the standard case). The corresponding statistical manifolds are given by a two-form and a three-form on the corresponding Lie algebroid. If the two-form is non-degenerate, it defines a ‘pseudo-Riemannian’ metric on the Lie algebroid and a family of Lie algebroid torsion-free connections, including the Levi-Civita connection of the metric. The three-tensor controls the deviation for the Levi-Civita connection. In this framework, the two-point functions are just functions on the pair groupoid with the ‘standard’ metric and affine connection on the Lie algebroid . Naturally understood reductions can lead from the ‘two-point function case’ to contrast function on more complicated manifolds (see Example 8.1).
This generalization is not only completely natural, but offers a wide field of new applications and examples. Our approach is coordinate-free, so that the corresponding differential calculus can be done in the framework of Banach manifolds. In particular, we study a contrast function defining the Fubini-Study metric on the Hilbert projective space in an infinite dimension.
2 Information geometry
As it was written above, the main underlying idea of information geometry is to give and analyze a geometric structure of the set of probability distributions pertinent to the problem in question [5, 3, 13]. To this end one introduces a statistical manifold, i.e., a triple , where is a differential manifold parameterizing a family of probability distribution, is a metric tensor on , and is a third order skewness tensor on characterizing its flatness.
Let us first start with the classical context. Let denote the space of probability distributions on a measure space with a measure . The statistical manifold gives a parameterization of (a submanifold of) by an injective map . In a coordinate system the Fisher-Rao metric (the tensors ) and the skewness tensor take the form
[TABLE]
[TABLE]
For a given tensors and we define a family of torsionless connections by its Christoffel symbols
[TABLE]
where are the Christofell symbols for the metric tensor (the Levi-Civita connection).
For all vector fields on we have a duality property
[TABLE]
A torsionless connection is self-dual if , what implies and, consequently, identifies the Levi-Civita connection as the only torsionless one that is self-dual.
An alternative characterization of the geometric properties of the statistical structure on can be given by introducing two torsionless connections and in terms of which . If both and are flat the statistical manifold is called dually flat [5, 3, 7]. It is to note that the space of pure states of a finite-level quantum system treated as a statistical manifold does not admit a dually flat structure [7, 8].
The above outlined geometrical structure of a statistical manifold can be generalized in the following different way [5, 3]. We introduce a two-point potential function which is usually a “directed” distance quantifying the relative distinguishability of two probability distributions [13] and thus often goes under the name of contrast function or divergence. For all the potential function is non-negative, , and vanishes exactly on the diagonal, i.e., , if and only if . Let, as above, be a coordinate system on the first manifold and on the second. If is at least the condition imposed on imply [21],
[TABLE]
The metric and the torsion tensors are then given as
[TABLE]
and
[TABLE]
The condition (5) says that if we immerse diagonally, , then , and, taking into account the non-negativity of , the imposed conditions mean thus that it has a local minimum on the diagonal.
If additional requirements are imposed on , they will provide the metric with additional properties. Thanks to the geometrical formulation of quantum mechanics [6], it is possible to use this description also in the quantum setting.
3 Lie groupoids
To make the paper relatively self-contained we decided to summarize in two next sessions basic informations about Lie groupoids and Lie algebroids. In our presentation we used, only slightly adapted, lecture notes by Meinrenken [22], but one can also use the book of Mackenzie [19] as a source of concepts, examples and references.
The structure of a Lie groupoid, , involves a manifold of arrows, a submanifold of units (objects), and two surjective submersions , called source and target, such that .
One thinks of as of an arrow from its source to its target , with embedded as trivial arrows. The arrows and can be composed, , provided :
[TABLE]
such that , . The composition is associative, i.e., , whenever
[TABLE]
Elements act as units , , and there is an inverse map , such that , , and and are units, and , respectively.
Note that the groupoid structure of is completely determined by the graph of the composition,
[TABLE]
Whenever we write , we implicitly assume that and are composable.
A morphism of Lie groupoids, is a smooth map, such that .
Example 3.1**.**
A Lie group is a Lie groupoid with a unique unit.
Example 3.2**.**
For any manifold one can construct the pair groupoid , with , and
[TABLE]
The units are given by the diagonal embedding .
Example 3.3**.**
Let be a -principal bundle. The Atiyah groupoid is the groupoid structure on , where the class is taken with respect to the equivalence relation . We have , , and , provided . Of course, .
4 Lie algebroids
Infinitesimal parts of Lie groupoids are Lie algebroids. A Lie algebroid over is a vector bundle together with a Lie bracket on the space of its sections, such that there exists a vector bundle map covering the identity on , called the anchor map, satisfying the Leibniz rule,
[TABLE]
for all , .
Example 4.1**.**
A Lie algebroid over a point is the same as a finite-dimensional real Lie algebra.
Example 4.2**.**
The tangent bundle with its usual bracket of vector fields is a Lie algebroid wit .
Example 4.3**.**
For every principal -bundle, the bundle is a Lie algebroid called the Atiyah algebroid of . The bracket on is induced from the identification of elements of with -invariant vector fields on , while the anchor map is induced from .
4.1 The Lie algebroid of a Lie groupoid
We now define the Lie algebroid of a Lie groupoid . As a vector bundle, we take to be the normal bundle of in . To define the anchor map, note that and coincide on , so the difference vanishes on and hence descend to a map .
A vector field on is called left-invariant if it is tangent to target fibers and (note the left multiplication). Similarly is right-invariant if it is tangent to source fibers and satisfies . By construction, the spaces and of left (resp., right) invariant fields on form Lie subalgebras. Since , are complements to in , each of this bundles may be identified with the normal bundle.
For we denote by and the unique left-invariant and right-invariant vector fields, such that and .
Proposition 4.4**.**
For all we have
[TABLE]
where is the inclusion of units. Furthermore
[TABLE]
Here, means that the vector fields and are -related. Moreover
[TABLE]
where and .
Proposition 4.5**.**
There exists a unique Lie bracket on , such that
[TABLE]
This is a Lie algebroid bracket with the anchor . Moreover,
[TABLE]
Example 4.6**.**
If is a Lie group, then is the Lie algebra of .
Example 4.7**.**
If is a pair groupoid, then .
Example 4.8**.**
If is an Atiyah groupoid associated with a -principal bundle , then is the Atiyah algebroid of .
4.2 Lie algebroid connections
Let be a Lie algebroid. An -connection on a real vector bundle is a bilinear map
[TABLE]
with the properties: , .
For the -connection is the standard affine connection. On the other hand, every -connection determines an -connection by setting .
The curvature of an -connection is tensor field defined by
[TABLE]
Thus, if and only if the map preserves the brackets. In this case the connection is called flat or a representation of .
In the case , the torsion of an -connection is the tensor field given by
[TABLE]
A pseudo-Riemannian metric on is a tensor field , where is the symmetric tensor product , such that defines isomorphism of vector bundles.
Proposition 4.9**.**
For every pseudo-Riemannian metric on there is a unique torsion-free -connection on that is metric, i.e. . We call it the Levi-Civita connection of .
The proof is the same as in the standard case:
[TABLE]
The dual connection of an -connection on a pseudo-Riemannian Lie algebroid is the connection defined by
[TABLE]
Hence, the Levi-Civita connection is a self-dual, torsion-free connection.
5 Contrast functions on Lie groupoids and dualistic structures
Our aim in this section is to show that the right framework for contrast functions, as they were defined for stochastic models above, is the theory of the Lie groupoids and Lie algebroids. In this sense, the two point contrast function is viewed as a one-point function of the pair groupoid . The metric induced by in the standard case is the (pseudo-Riemannian) metric on the Lie algebroid . In the standard case it reduces to a metric on or, as we use to say in such a case, on . Also the pair of dual connections defined by the two-point contrast function can be generated as the pair of dual -connections with . In this sense, the expected and observed -geometries of a statistical model introduced by Chentsov and Amari (cf. [1]) are particular instances of geometries derived from contrast functions on Lie groupoids. Consequently, these statistical geometries may be studied within this unified framework.
We start with the following Lemma,
Lemma 5.1**.**
Let be a submanifold in and let vanishes on together with all its derivatives up to an order , i.e. the -th jet of vanishes on , . Then, for vector fields on , the derivative
[TABLE]
defines a symmetric -tensor on that depends only on the class in the normal bundle . In other words, .
Proof.
Observe first that is symmetric. For it suffices to show that it does not change under the transposition of neighboring arguments. We have
[TABLE]
since the -th jet of vanishes on . Further, is a tensor, since
[TABLE]
Finally, if , then , so that since vanishes on . ∎
Let now be a Lie groupoid and be a smooth function vanishing on . We say that is a contrast function if . A direct consequence of the Lemma 5.1 is the following
Proposition 5.2**.**
Any contrast function defines on the Lie algebroid a symmetric 2-form by , where and are any vector fields on representing at points of . In particular,
[TABLE]
Of course, if , then is non-negatively defined.
A symmetric 2-form on we will call a pseudometric. We call the contrast function regular if has constant rank as a morphism of vector bundles , and metric if is a pseudo-Riemannian (non-degenerate) metric on , i.e. is an isomorphism. If is metric, then is a Riemannian metric on .
Example 5.3**.**
Define by
[TABLE]
Let . We have
[TABLE]
Remark 5.4**.**
Metric contrast functions on are called yokes in [9] (see also [10, 11] for the idea of generating tensors from yokes).
Proposition 5.5**.**
Let be a contrast function on . Then, is also a contrast function and .
Proof.
For being sections of we have (cf. (11))
[TABLE]
so is a contrast function. Moreover,
[TABLE]
∎
For a metric contrast function on we define on the Lie()-connections and by
[TABLE]
where .
Theorem 5.6**.**
* and are dual, torsion-free -connections on , such that is the Levi-Civita connection with respect to . In particular, if , the connection is the Levi-Civita connection for .*
Proof.
Let us first prove that and are -connections. Linearity with respect to is clear. Since, for we have
[TABLE]
the equations (15) properly define and as sections of . Similarly,
[TABLE]
Finally,
[TABLE]
so that . Now, we have
[TABLE]
But,
[TABLE]
Finally,
[TABLE]
so that
[TABLE]
that shows that . Note that and are indeed torsion-free,
[TABLE]
∎
Corollary 5.7**.**
The three tensor defined by
[TABLE]
is totally symmetric, . We have
[TABLE]
and
[TABLE]
In particular, if .
Proof.
We can extend coordinates on to coordinates on such that . Then,
[TABLE]
so that has vanishing second jets at points of . Hence, according to Lemma 5.1, does not depend on , , and the order of them, if they represent fixed vectors in . Thus
[TABLE]
and
[TABLE]
where are coefficients of . ∎
Example 5.8**.**
Let be a function on the unitary group given by
[TABLE]
Similarly as in Example 5.3 we get for , g^{F}(X,Y)=-2\mbox{\mathrm{tr}}(XY). Here, since . Hence, will be the Levi-Civita connection for g^{F}(X,Y)=-2\mbox{\mathrm{tr}}(XY). We have . Consequently,
[TABLE]
Similarly, Y^{L}Z^{R}F(U)=-\mbox{\mathrm{tr}}\left(ZUY+Y^{\dagger}U^{\dagger}Z^{\dagger}\right) and
[TABLE]
Hence, .
We will call such a -structure on a Lie algebroid , with and being dual -connections with respect to the metric , a dualistic structure.
6 Statistical vector bundles
If the contrast function is not metric, the Levi-Civita connection does not have a clear sense, but both symmetric tensors and are still properly defined. However, they do not depend on the Lie algebroid structure on as in the case of connections.
Since there is a tubular neighhbourhood of in which is diffeomorphic with a neighbourhood of in and such that acts as the multiplication by , we can always view contrast functions as defined on a neighbourhood of the zero-section in . In the classical situation, we replace two-point function with a function on the tangent bundle. Now, we define symmetric tensors and by
[TABLE]
Here, are any vector fields on whose values at points of coincide with the values of vertical lifts of , respectively. Actually, we can take just the vertical lifts:
[TABLE]
where . For instance, in affine coordinates on one has for .
Thus, we can see contrast functions as defined on , so that we have the associated triple consisting of the vector bundle , and symmetric covariant 2- and 3-tensors and , respectively.
Such a triple we will call a statistical vector bundle. This is with the analogy to the terminology of Lauritzen in [4] (cf. also [13]), where statistical vector bundles for and being metric are called statistical manifolds. If is metric, we will speak about metric statistical vector bundles, and if is additionally positively defined – about Riemannian statistical vector bundles.
Proposition 6.1**.**
Any statistical vector bundle structure on comes from a globally defined contrast function on . In particular, if then any torsion-free dualistic structure on comes from a global contrast function on .
Proof.
Take a statistical vector bundle . Let be a locally finite covering of by coordinate systems and , where is the projection. In affine coordinates on , the tensors take the forms
[TABLE]
Let , , on a neighborhood of M, and let us put now , where and
[TABLE]
Here, on , , and . For being sections of we clearly have
[TABLE]
Similarly, .
In the case of a torsion free dualistic structure on a Lie groupoid , we carry over the whole structure on using a tubular neighborhood of in . ∎
The above proposition is a generalization of the result by Matumoto [21] telling that any statistical manifold has a two-point contrast function.
7 Higher contrast functions
Of course, according to Lemma 5.1, starting with functions with higher order jets vanishing on a submanifold (higher contrast functions), we get higher order symmetric tensors on the normal vector bundle .
A smooth function on a Lie groupoid (or a Lie algebroid , which gives analogous results) we call a contrast function of degree if the th jet of vanishes on , . Then we can define symmetric tensors
[TABLE]
by
[TABLE]
or, on the Lie algebroid,
[TABLE]
The existence of easily follows from the fact that is a contrast function of degree if is of degree . Such a triple we will call a statistical vector bundle of degree . As before, any such structure is generated by some contrast function of degree . In the case we can speak about statistical manifold of degree .
Note that the tensor could be understood as ‘higher metric’ if it is non-degenerate in the sense that
[TABLE]
In other words a ‘higher metric’ would be the symmetric analogue of a multisymplectic form.
Starting with a contrast function of degree on a Lie groupoid with the Lie algebroid , and with the analogy to (cf. (15)), we can define a -linear map
[TABLE]
by
[TABLE]
It is easy to see that, for ,
[TABLE]
and
[TABLE]
so that is a multi-differential operator of the first order. This looks like a definition of a sort of ‘higher connection’ which is a local invariant (concomitant) of . Moreover,
[TABLE]
depends on only. This is a local invariant of the ‘higher metric’ and can be viewed like a definition of a ‘higher Levi-Civita connection’. The closer study of the concomitant we postpone, however, to a separate paper.
8 Reductions: contrast functions on groupoids
To describe an example of a reduction of a contrast function, consider a principal action of a Lie group on a Lie groupoid . Such a structure is called in [12] a -groupoid if acts on by groupoid isomorphisms. The concept of a -groupoid is essentially of double nature: a -groupoid is a -principal bundle object in the category of Lie groupoids. Similarly, -algebroids are Lie algebroids with a principal action of by Lie algebroid isomorphisms. It is easy to see that the Lie algebroid of a -groupoid is canonically a -algebroid.
Let us recall that, for being a pair of Lie groupoids, a Lie groupoid morphism is a pair of maps such that the following diagram is commutative
[TABLE]
subject to the further condition that respects the (partial) multiplication; if are composable, then . It then follows that for we have and . Like in the classical Lie Theory, morphisms of Lie groupoids induce morphisms of the corresponding Lie algebroids (see [19]).
For a groupoid ,
The action of on commutes with the source and target maps, thus projects onto a -action on the manifold . Moreover, as an immersed submanifold of is invariant with respect to the -action, and the projected and restricted actions coincide. 2. 2.
As the action of on is principal, it is also principal on the immersed submanifold , so inherits a structure of a principal -bundle. It is important to note that is G-invariant. In particular, the quotient manifold exists. 3. 3.
The reduced manifold is a Lie groupoid , with the set of units , defined by the following structure:
[TABLE]
where is the canonical projection. In fact, the above constructions imply, tautologically, that is a morphism of Lie groupoids with the above structures. The fundamental fact in the Lie theory of groupoids says that any morphism of Lie groupoids induces a morphism of the corresponding Lie algebroids. It is derived from the map . In our case,
[TABLE]
is covering the map . It is easy to see that . This Lie algebroid morphism defines in turn the pul-backs of symmetric forms
[TABLE]
Note that a morphisms of vector bundles do not induce, in general, maps of the corresponding sections. This makes the definition of a Lie algebroid morphism non-trivial.
Example 8.1**.**
Let be a -principal bundle with the right action
[TABLE]
Then, the pair Lie groupoid is a -groupoid with respect to the action and we have the corresponding morphism of Lie groupoids
[TABLE]
The Lie groupoid is called the Atiyah groupoid of the principal bundle The corresponding Lie algebroid
[TABLE]
is the Atiyah algebroid with the Atiyah exact sequence of Lie algebroid morphisms
[TABLE]
where is the (surjective in this case) anchor map and is its kernel, a bundle of Lie algebras isomorphic to . The map identifies sections of with -invariant -forms on .
The structure of a -groupoid is described in [12]. For simplicity, we present the result for trivial -structures (all -groupoids are locally trivial).
Theorem 8.2**.**
For any -groupoid structure on the trivial -bundle there is a Lie groupoid structure on with the source and target maps and a groupoid morphism such that the source map , the target map and the partial multiplication in read
[TABLE]
Let us see what is the structure of .
First, according to the decomposition , we can view sections as -dependent sections of with the identification of sections of as -independent sections of , i.e. . Since the -independent sections generate the module over , the bracket in is completely determined by the bracket of the -independent sections and the anchor map. Hence, the left (resp., right) invariant vector fields on are spanned by (resp., ), where .
Now, using the decomposition , by straightforward calculations we obtain:
Proposition 8.3**.**
For , , we have in the groupoid :
[TABLE]
Here, of course, for , the symbol denotes the right translation of the vector by .
Note that the above proposition implies easily that , so -independent sections of commute as they representatives in , and the knowledge of the anchor completely determines the Lie bracket in .
Proposition 8.4**.**
Assume that is a contrast function on a groupoid and is the corresponding groupoid morphism. If is -invariant contrast function, then induces a contrast function on and
[TABLE]
In other words,
[TABLE]
Moreover, if is metric, then is metric and
[TABLE]
Proof.
According to Proposition 8.3, and differ from and (viewed as vector fields on ) by vector fields tangent to orbits of . As is -invariant and
[TABLE]
the first jets of along and along are trivial, and
[TABLE]
so (25) follows. Similarly,
[TABLE]
whence (26).
Example 8.5**.**
Let be the -principal bundle of oriented orthonormal frames on the sphere canonically embedded in as the unit sphere. In other words, is the orhonormal frame bundle of with the canonical Riemannian metric. We will view as the set of pairs , where and is an isometry respecting the orientations. Note that elements can be identified with orthonormal frames in in the obvious way. This implies that is simultaneously a homogeneous space of the group acting freely on (transitive -principal bundle). The group acts as a subgroup of with respect to the embedding ,
[TABLE]
Consider the pair groupoid . To every pair we can associate a matrix which is the unique matrix in which maps the oriented orthonormal frame of onto . This defines the map and, in turn, a diffeomorphism
[TABLE]
which identifies the diagonal in with . The normal bundle of in , thus , can be therefore identified with . With this identification, element corresponds to the vector which is tangent to the curve at . Hence, is the fundamental vector field of the -action on , corresponding to .
Define now the two-point function
[TABLE]
This function is a contrast function: it vanishes on the diagonal and
[TABLE]
We obtain the metric on the normal bundle similarly like in Example 5.3,
[TABLE]
The function is invariant with respect to the inversion , so . Like in Example 5.8 we obtain the Levi-Civita connection in the form
[TABLE]
where the bracket is that in the Lie algebra .
Let us observe now (cf. Example 8.1) that is a -groupoid with respect to the obvious action
[TABLE]
This action, in the identification , looks like
[TABLE]
hence acts on via
[TABLE]
so the vector field on is -invariant if . Note the canonical decomposition
[TABLE]
where is the orthogonal complement of in with respect to the trace scalar product. Note that this decomposition is -independent.
The Lie algebroid is . Its sections are identified with -invariant vector fields on . It is clear that is -invariant,
[TABLE]
so it defines a metric contrast function on the Atiyah groupoid
[TABLE]
We can simplify the picture choosing one point of , say to identify with and with right-invariant vector fields on . This time, however, the left-invariant vector fields represent -invariant vector fields on , so sections of . Moreover, the bracket in the Lie algebroid on sections agrees with the Lie bracket in ,
[TABLE]
Actually, the invariance of is with respect to the -action of the whole , so the metric (27) induces a Riemannian metric on which is simultaneously left- and right-invariant. There is a canonical mapping , obtained from the submersion
[TABLE]
which induces the anchor map . The left invariant vector fields on generate now the module as a module over -the -invariant functions on . The anchor is the corresponding fundamental vector field of the canonical action of on . Thus projects under to a -invariant vector field on . The kernel of this projection is generated by , where , so the anchor map identifies with .
Tu sum up: We can identify with
[TABLE]
so that the reduced contrast function is
[TABLE]
The Lie algebroid is identified with the normal bundle, i.e. . Any constant section represents a -invariant vector field on (left-invariant vector field on ) and projects to a -invariant vector field on . The reduced contrast function (28) induces on a metric by
[TABLE]
Moreover, and the Lie algebroid Levi-Civita connection for which satisfies
[TABLE]
The connection is clearly torsionless. The full form of involves of course the anchor map. In particular, for , , we have
[TABLE]
since the anchors are trivial. Due to invariance, the metric and the connection project to the sphere . Using the base
[TABLE]
in corresponding to vectors , we see that
[TABLE]
that shows that the metric induced by on the sphere is the standard Riemannian metric. Hence, and
[TABLE]
∎
9 Infinite dimensions
Our coordinate-free approach to stochastic manifolds has an additional advantage: it can be applied practically without changes in infinite-dimensional, say Banach manifold, frameworks. The differential calculus on Banach manifolds, in particular Banach-Lie groupoids, produces forms as elements of for some Banach spaces . This time, however, the non-degeneracy is a more delicate problem. This is due to the fact that Banach manifolds are generally not reflexive, the more not self-dual. In a weaker version, for non-degeneracy of one can assume that the map is an immersion, in a strong one, that it is an isomorphism. The latter require of course that is self-dual, .
The best infinite-dimensional framework is therefore that of (real or complex) Hilbert spaces. Here is a nice example.
Example 9.1**.**
For a Hilbert space , on , where , consider the two-point function
[TABLE]
It is easy to see that is a non-negative contrast function. Indeed, calculating the derivative with respect to , we get
[TABLE]
so that . Now,
[TABLE]
The 2-form is degenerated, but if we reduce by the action of (the contrast function is invariant with respect to -action
[TABLE]
on ), we obtain a Riemannian metric on the Hilbert projective space . This metric reads
[TABLE]
i.e. it is proportional to the Fubini-Study metric on .
Example 9.2**.**
The groupoid of rank-one operators.
We consider , the unit sphere in , as a -principal bundle over the complex projective space . Using normalized vectors
[TABLE]
we construct transition probability amplitudes as elements of . Equivalence classes are projected onto:
[TABLE]
hence, the Atiyah groupoid projects onto the complex projective space represented by rank-one operators.
10 Conclusions and outlook
In previous sections we have argued that a groupoid approach to differential geometry of information theory is a more natural setting to deal with sub-manifolds of classical probability distributions. We have also considered the reduction problem of contrast functions which will be very useful in the quantum setting, where relative entropies will be invariant under the action of the unitary group.
As a matter of fact, a coordinate free approach to deal with the differential calculus required to derive metric and dual connections out of potential or contrast functions was introduced previously [14, 16, 18, 20], however there the introduction was by “ad hoc” methods, here it is intrinsic with the notion of Lie groupoid and its associated Lie algebroid. Moreover the notion of groupoid enters also naturally within the Schwinger approach to quantum mechanics [15, 16, 17]. As the example provided in Section 9 shows, it is possible to write a contrast function in quantum mechanics. The contrast function used there arises from an Atiyah groupoid, indeed it is possible to consider , the unit sphere in the Hilbert space as a -principle bundle over the complex projective space, then the groupoid has a space of “objects” (units) provided by rank-one projectors which represent the pure states, the “arrows” are transition probability amplitudes.
Thus in this quantum setting, we replace probabilities with probability amplitudes and transition probabilities with transition probability amplitudes. This replacement is crucial to be able to describe quantum interference phenomena as argued by Born in his Nobel acceptance speech. We are already familiar with the interpretation of “wave functions” as probability amplitudes. This shift from probabilities to their “complex square root” allows to introduce also in quantum mechanics the language of groupoids to deal with contrast functions.
In a forthcoming paper we shall elaborate on the groupoid setting both in the Hilbert space approach and the C*-algebra approach to quantum mechanics to deal with contrast functions considered as generalized relative entropies.
Acknowledgments
J. Grabowski acknowledges research founded by the Polish National Science Centre grant HARMONIA under the contract number 2016/22/M/ST1/00542. M. Kuś acknowledges financial support of the the Polish National Science Centre grant 2017/27/B/ST2/02959. G. Marmo acknowledges financial support from the Spanish Ministry of Economy and Competitiveness, through the Severo Ochoa Programme for Centres of Excellence in RD(SEV-2015/0554), and would like to thank the support provided by the Santander/UC3M Excellence Chair Programme 2018/2019.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S.-i. Amari. Differential geometric methods in statistics. Lecture Notes in Statist. , 28, Springer, Heidelberg, 1985.
- 2[2] S.-i. Amari. Differential-geometrical methods in statistics . Springer Science & Business Media, Berlin/Heidelberg, 2012.
- 3[3] S.-i. Amari. Information geometry and its applications . Springer Japan, Berlin/Heidelberg, 2016.
- 4[4] S.-i. Amari et al. Differential Geometry in Statistical Inference (IMS Lecture Notes–Monograph Series, Volume 10) Institute of Mathematical Statistics, Hayward, California, 1987.
- 5[5] S.-i.i Amari and H. Nagaoka. Methods of information geometry . American Mathematical Soc., Providence, R. I., 2007.
- 6[6] A. Ashtekar and T. A. Schilling. Geometrical formulation of quantum mechanics. In On Einstein’s Path , pages 23–65. Springer, New York, NY, 1999.
- 7[7] N. Ay and W. Tuschmann. Dually flat manifolds and global information geometry. Open Syst. Inf. Dyn. , 9 , 195–200, 2002.
- 8[8] N. Ay and W. Tuschmann. Duality versus dual flatness in quantum information geometry. Journal of Mathematical Physics , 44 , 1512–1518, 2003.
