Some proofs of the Poincar\'e-Birkhoff-Witt theorem and related matters

Gyula Lakos

arXiv:1812.04896·math.RA·July 24, 2024

Some proofs of the Poincar\'e-Birkhoff-Witt theorem and related matters

Gyula Lakos

PDF

Open Access

TL;DR

This paper explores the foundational PBW theorem for free Lie algebras, demonstrating its close connection to Magnus-Witt theorems through various proofs and expositions.

Contribution

It provides new insights into the relationship between the PBW theorem and Magnus-Witt theorems, offering multiple proof strategies.

Findings

01

PBW theorem closely follows from Magnus-Witt theorems

02

Multiple proof methods elucidate the theorem's foundations

03

Enhanced understanding of free Lie algebra structures

Abstract

This expository paper focuses on free Lie $K$ -algebras and the basic PBW theorem. We argue in various ways that the basic PBW theorem is a quite close consequence of the Magnus-Witt theorems concerning free Lie algebras.

Equations263

m_{\leq} : ⨂_{\leq} g \to U g

m_{\leq} : ⨂_{\leq} g \to U g

m_{Σ} : ⨂_{Σ} g \to U g

m_{Σ} : ⨂_{Σ} g \to U g

μ_{n} (X_{1}, \dots, X_{n}) =

μ_{n} (X_{1}, \dots, X_{n}) =

= I_{1} \dot{\cup} \dots \dot{\cup} I_{s} = {2, \dots, n} I_{k} = {i_{k, 1}, \dots, i_{k, p_{k}}} \neq = \emptyset i_{k, 1} < \dots < i_{k, p_{k}} \sum β_{s} \cdot [μ_{p_{1}} (X_{i_{1, 1}}, \dots, X_{i_{1, p_{1}}}), \dots, μ_{p_{s}} (X_{i_{s, 1}}, \dots, X_{i_{s, p_{s}}}), X_{1}]_{L},

s = 0 \sum \infty β_{s} x^{s} = β (x) = \frac{x}{e ^{x} - 1} .

s = 0 \sum \infty β_{s} x^{s} = β (x) = \frac{x}{e ^{x} - 1} .

μ_{n} (X_{1}, \dots, X_{n}) =

μ_{n} (X_{1}, \dots, X_{n}) =

= J_{1} \dot{\cup} \dots \dot{\cup} J_{r} = {1, \dots, n - 1} J_{l} = {j_{l, 1}, \dots, j_{l, q_{l}}} \neq = \emptyset j_{l, 1} < \dots < j_{l, q_{l}} \sum \tilde{β}_{r} \cdot [μ_{q_{1}} (X_{j_{1, 1}}, \dots, X_{j_{1, q_{1}}}), \dots μ_{q_{r}} (X_{j_{r, 1}}, \dots, X_{j_{r, q_{r}}}), X_{n}]_{L},

r = 0 \sum \infty \tilde{β}_{r} x^{r} = β (- x) = \frac{- x}{e ^{- x} - 1} .

r = 0 \sum \infty \tilde{β}_{r} x^{r} = β (- x) = \frac{- x}{e ^{- x} - 1} .

μ_{1} (X_{1}) = X_{1} and

μ_{1} (X_{1}) = X_{1} and

μ_{n} (X_{1}, \dots, X_{n}) =

[[μ_{p_{1}} (X_{i_{1, 1}}, \dots, X_{i_{1, p_{1}}}), \dots, μ_{p_{s}} (X_{i_{s, 1}}, \dots, X_{i_{s, p_{s}}}), X_{1}]_{L},

[μ_{q_{1}} (X_{j_{1, 1}}, \dots, X_{j_{1, q_{1}}}), \dots, μ_{q_{r}} (X_{j_{r, 1}}, \dots, X_{j_{r, q_{r}}}), X_{n}]_{L}]

s = 0 \sum \infty r = 0 \sum \infty α_{s, r} x^{s} y^{r} = α (x, y)

s = 0 \sum \infty r = 0 \sum \infty α_{s, r} x^{s} y^{r} = α (x, y)

= - \frac{β ( x + y ) - β ( x )}{y} β (- y)

= \frac{- x e ^{y} - e ^{- x} y + x + y}{( e ^{- y} - e ^{x} ) ( e ^{y} - 1 ) ( e ^{- x} - 1 )} .

μ_{n} (X_{1}, \dots, X_{n})_{(R)} = r \sum (X_{1}, \dots, X_{n - 1} loc. incr.) \sum \tilde{β}_{r} \cdot [r many μ μ (...), \dots, μ (...), X_{n}]_{L},

μ_{n} (X_{1}, \dots, X_{n})_{(R)} = r \sum (X_{1}, \dots, X_{n - 1} loc. incr.) \sum \tilde{β}_{r} \cdot [r many μ μ (...), \dots, μ (...), X_{n}]_{L},

= r, p, s \sum (X_{2}, \dots, X_{n - 1} loc. incr.) \sum \tilde{β}_{r} β_{s} \cdot [p many μ μ (...), \dots, μ (...), [s many μ μ (...), \dots, μ (...), X_{1}]_{L}, r - 1 - p many μ μ (...), \dots, μ (...), X_{n}]_{L} .

= r, p, s \sum (X_{2}, \dots, X_{n - 1} loc. incr.) \sum \tilde{β}_{r} β_{s} \cdot [p many μ μ (...), \dots, μ (...), [s many μ μ (...), \dots, μ (...), X_{1}]_{L}, r - 1 - p many μ μ (...), \dots, μ (...), X_{n}]_{L} .

= \overset{s}{ˉ}, \overset{r}{ˉ} \sum (X_{2}, \dots, X_{n - 1} loc. incr.) \sum \tilde{α}_{\overset{s}{ˉ}, \overset{r}{ˉ}} \cdot [[\overset{s}{ˉ} many μ μ (...), \dots, μ (...), X_{1}]_{L}, [\overset{r}{ˉ} many μ μ (...), \dots, μ (...), X_{n}]_{L}] .

= \overset{s}{ˉ}, \overset{r}{ˉ} \sum (X_{2}, \dots, X_{n - 1} loc. incr.) \sum \tilde{α}_{\overset{s}{ˉ}, \overset{r}{ˉ}} \cdot [[\overset{s}{ˉ} many μ μ (...), \dots, μ (...), X_{1}]_{L}, [\overset{r}{ˉ} many μ μ (...), \dots, μ (...), X_{n}]_{L}] .

\overset{s}{ˉ} + \overset{r}{ˉ} = k \sum \tilde{α}_{\overset{s}{ˉ}, \overset{r}{ˉ}} x^{\overset{s}{ˉ}} y^{\overset{r}{ˉ}} = r - 1, s \geq 0, (r - 1) + s = k \sum \tilde{β}_{r} (y^{r - 1} + \dots + (x + y)^{p} y^{r - 1 - p} + \dots (x + y)^{r - 1}) β_{s} x^{s} = r - 1, s \geq 0, (r - 1) + s = k \sum \frac{β ~ _{r} ( x + y ) ^{r} - β ~ _{r} y ^{r}}{x} β_{s} x^{s} = (\frac{β ( - x - y ) - β ( - y )}{x} β (x))_{k -homogeneous part in x, y} .

\overset{s}{ˉ} + \overset{r}{ˉ} = k \sum \tilde{α}_{\overset{s}{ˉ}, \overset{r}{ˉ}} x^{\overset{s}{ˉ}} y^{\overset{r}{ˉ}} = r - 1, s \geq 0, (r - 1) + s = k \sum \tilde{β}_{r} (y^{r - 1} + \dots + (x + y)^{p} y^{r - 1 - p} + \dots (x + y)^{r - 1}) β_{s} x^{s} = r - 1, s \geq 0, (r - 1) + s = k \sum \frac{β ~ _{r} ( x + y ) ^{r} - β ~ _{r} y ^{r}}{x} β_{s} x^{s} = (\frac{β ( - x - y ) - β ( - y )}{x} β (x))_{k -homogeneous part in x, y} .

\begin{array}[]{c|cccccc}\beta_{s}&s=0&1&2&3&4&5\\ \hline\cr\\ &1&-\dfrac{1}{2}&\dfrac{1}{12}&0&-\dfrac{1}{720}&0\,.\\ \end{array}

\begin{array}[]{c|cccccc}\beta_{s}&s=0&1&2&3&4&5\\ \hline\cr\\ &1&-\dfrac{1}{2}&\dfrac{1}{12}&0&-\dfrac{1}{720}&0\,.\\ \end{array}

\begin{array}[]{c|ccccc}\alpha_{s,r}&r=0&1&2&3&4\\ \hline\cr\\ s=0&\dfrac{1}{2}&\dfrac{1}{6}&0&-{\dfrac{1}{180}}&0\\ \\ 1&-\dfrac{1}{6}&-\dfrac{1}{12}&-{\dfrac{1}{120}}&{\dfrac{1}{360}}&{\dfrac{1}{2016}}\\ \\ 2&0&{\dfrac{1}{120}}&{\dfrac{1}{240}}&{\dfrac{1}{5040}}&-{\dfrac{1}{4032}}\\ \\ 3&{\dfrac{1}{180}}&{\dfrac{1}{360}}&-{\dfrac{1}{5040}}&-{\dfrac{1}{3024}}&-{\dfrac{1}{60480}}\\ \\ 4&0&-{\dfrac{1}{2016}}&-{\dfrac{1}{4032}}&{\dfrac{1}{60480}}&{\dfrac{1}{34560}}\,.\end{array}

\begin{array}[]{c|ccccc}\alpha_{s,r}&r=0&1&2&3&4\\ \hline\cr\\ s=0&\dfrac{1}{2}&\dfrac{1}{6}&0&-{\dfrac{1}{180}}&0\\ \\ 1&-\dfrac{1}{6}&-\dfrac{1}{12}&-{\dfrac{1}{120}}&{\dfrac{1}{360}}&{\dfrac{1}{2016}}\\ \\ 2&0&{\dfrac{1}{120}}&{\dfrac{1}{240}}&{\dfrac{1}{5040}}&-{\dfrac{1}{4032}}\\ \\ 3&{\dfrac{1}{180}}&{\dfrac{1}{360}}&-{\dfrac{1}{5040}}&-{\dfrac{1}{3024}}&-{\dfrac{1}{60480}}\\ \\ 4&0&-{\dfrac{1}{2016}}&-{\dfrac{1}{4032}}&{\dfrac{1}{60480}}&{\dfrac{1}{34560}}\,.\end{array}

μ_{n} (\dots_{1}, X_{k - 1}, X_{k}, \dots_{2}) - μ_{n} (\dots_{1}, X_{k}, X_{k - 1}, \dots_{2}) = μ_{n - 1} (\dots_{1}, [X_{k - 1}, X_{k}], \dots_{2})

μ_{n} (\dots_{1}, X_{k - 1}, X_{k}, \dots_{2}) - μ_{n} (\dots_{1}, X_{k}, X_{k - 1}, \dots_{2}) = μ_{n - 1} (\dots_{1}, [X_{k - 1}, X_{k}], \dots_{2})

μ_{Σ} (x_{1} \otimes \dots \otimes x_{n}) = I_{1} \dot{\cup} \dots \dot{\cup} I_{s} = {1, \dots, n} I_{k} = {i_{k, 1}, \dots, i_{k, p_{k}}} \neq = \emptyset i_{k, 1} < \dots < i_{k, p_{k}} \sum \frac{1}{s !} μ_{p_{1}} (x_{i_{1, 1}}, \dots, x_{i_{1, p_{1}}}) \otimes \dots \otimes μ_{p_{s}} (x_{i_{s, 1}}, \dots, x_{i_{s, p_{s}}})

μ_{Σ} (x_{1} \otimes \dots \otimes x_{n}) = I_{1} \dot{\cup} \dots \dot{\cup} I_{s} = {1, \dots, n} I_{k} = {i_{k, 1}, \dots, i_{k, p_{k}}} \neq = \emptyset i_{k, 1} < \dots < i_{k, p_{k}} \sum \frac{1}{s !} μ_{p_{1}} (x_{i_{1, 1}}, \dots, x_{i_{1, p_{1}}}) \otimes \dots \otimes μ_{p_{s}} (x_{i_{s, 1}}, \dots, x_{i_{s, p_{s}}})

μ_{Σ} (\dots_{1} \otimes X_{k - 1} \otimes X_{k} \otimes \dots_{2} - \dots_{1} \otimes X_{k - 1} \otimes X_{k} \otimes \dots_{2} - \dots_{1} \otimes [X_{k - 1}, X_{k}] \otimes \dots_{2}) = 0,

μ_{Σ} (\dots_{1} \otimes X_{k - 1} \otimes X_{k} \otimes \dots_{2} - \dots_{1} \otimes X_{k - 1} \otimes X_{k} \otimes \dots_{2} - \dots_{1} \otimes [X_{k - 1}, X_{k}] \otimes \dots_{2}) = 0,

σ \in Σ_{n} \sum P (X_{σ (1)}, \dots, X_{σ (n)}) = 0.

σ \in Σ_{n} \sum P (X_{σ (1)}, \dots, X_{σ (n)}) = 0.

μ_{Σ} (m_{Σ} (\frac{1}{n !} σ \in Σ_{n} \sum g_{σ (1)} \otimes \dots \otimes g_{σ (n)})) = μ_{Σ} (\frac{1}{n !} σ \in Σ_{n} \sum g_{σ (1)} \otimes \dots \otimes g_{σ (n)}) = \frac{1}{n !} σ \in Σ_{n} \sum μ_{1} (g_{σ (1)}) \otimes_{Σ} \dots \otimes_{Σ} μ_{1} (g_{σ (n)}) = \frac{1}{n !} σ \in Σ_{n} \sum g_{σ (1)} \otimes \dots \otimes g_{σ (n)} .

μ_{Σ} (m_{Σ} (\frac{1}{n !} σ \in Σ_{n} \sum g_{σ (1)} \otimes \dots \otimes g_{σ (n)})) = μ_{Σ} (\frac{1}{n !} σ \in Σ_{n} \sum g_{σ (1)} \otimes \dots \otimes g_{σ (n)}) = \frac{1}{n !} σ \in Σ_{n} \sum μ_{1} (g_{σ (1)}) \otimes_{Σ} \dots \otimes_{Σ} μ_{1} (g_{σ (n)}) = \frac{1}{n !} σ \in Σ_{n} \sum g_{σ (1)} \otimes \dots \otimes g_{σ (n)} .

from I_{s} i_{s, 1}, \dots, i_{s, p_{s}}, \dots, from I_{k} i_{k, 1}, \dots, i_{k, p_{k}}, \dots, from I_{1} i_{1, 1}, \dots, i_{1, p_{1}} .

from I_{s} i_{s, 1}, \dots, i_{s, p_{s}}, \dots, from I_{k} i_{k, 1}, \dots, i_{k, p_{k}}, \dots, from I_{1} i_{1, 1}, \dots, i_{1, p_{1}} .

I i is a Lie-permutation of {1, \dots, n} \sum a_{I i} [X_{i_{1, 1}}, \dots, X_{i_{1, p_{1}}}]_{L} \cdot_{Σ} \dots \cdot_{Σ} [X_{i_{s, 1}}, \dots, X_{i_{s, p_{s}}}]_{L}

I i is a Lie-permutation of {1, \dots, n} \sum a_{I i} [X_{i_{1, 1}}, \dots, X_{i_{1, p_{1}}}]_{L} \cdot_{Σ} \dots \cdot_{Σ} [X_{i_{s, 1}}, \dots, X_{i_{s, p_{s}}}]_{L}

X_{1} \dots X_{n} = I i is a Lie-permutation of {1, \dots, n} \sum b_{I i} [X_{i_{1, 1}}, \dots, X_{i_{1, p_{1}}}]_{L} \cdot_{Σ} \dots \cdot_{Σ} [X_{i_{s, 1}}, \dots, X_{i_{s, p_{s}}}]_{L};

X_{1} \dots X_{n} = I i is a Lie-permutation of {1, \dots, n} \sum b_{I i} [X_{i_{1, 1}}, \dots, X_{i_{1, p_{1}}}]_{L} \cdot_{Σ} \dots \cdot_{Σ} [X_{i_{s, 1}}, \dots, X_{i_{s, p_{s}}}]_{L};

μ_{n} (X_{1}, \dots, X_{n}) = I i is a Lie-permutation of {1, \dots, n}, of one single block \sum b_{I i} [X_{i_{1, 1}}, \dots, X_{i_{1, p_{1}}}]_{L},

μ_{n} (X_{1}, \dots, X_{n}) = I i is a Lie-permutation of {1, \dots, n}, of one single block \sum b_{I i} [X_{i_{1, 1}}, \dots, X_{i_{1, p_{1}}}]_{L},

μ_{n} (\dots_{1}, X_{k - 1}, X_{k}, \dots_{2}) - μ_{n} (\dots_{1}, X_{k}, X_{k - 1}, \dots_{2}) = μ_{n - 1} (\dots_{1}, [X_{k - 1}, X_{k}], \dots_{2})

μ_{n} (\dots_{1}, X_{k - 1}, X_{k}, \dots_{2}) - μ_{n} (\dots_{1}, X_{k}, X_{k - 1}, \dots_{2}) = μ_{n - 1} (\dots_{1}, [X_{k - 1}, X_{k}], \dots_{2})

χ \in Σ_{n - 1} \sum c_{χ}^{LHS} [X_{χ_{1}}, \dots, X_{χ_{n - 1}}, X_{n}]_{L} and χ \in Σ_{n - 1} \sum c_{χ}^{RHS} [X_{χ_{1}}, \dots, X_{χ_{n - 1}}, X_{n}]_{L}

χ \in Σ_{n - 1} \sum c_{χ}^{LHS} [X_{χ_{1}}, \dots, X_{χ_{n - 1}}, X_{n}]_{L} and χ \in Σ_{n - 1} \sum c_{χ}^{RHS} [X_{χ_{1}}, \dots, X_{χ_{n - 1}}, X_{n}]_{L}

n terms \dots_{1} X_{k - 1} X_{k} \dots_{2} - n terms \dots_{1} X_{k} X_{k - 1} \dots_{2} = n - 1 terms \dots_{1} [X_{k - 1}, X_{k}] \dots_{2}

n terms \dots_{1} X_{k - 1} X_{k} \dots_{2} - n terms \dots_{1} X_{k} X_{k - 1} \dots_{2} = n - 1 terms \dots_{1} [X_{k - 1}, X_{k}] \dots_{2}

χ \in Σ_{n - 1} \sum c_{χ}^{LHS} [X_{χ_{1}}, \dots, X_{χ_{n - 1}}, X_{n}]_{L} = χ \in Σ_{n - 1} \sum c_{χ}^{RHS} [X_{χ_{1}}, \dots, X_{χ_{n - 1}}, X_{n}]_{L},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Topics in Algebra · Algebraic structures and combinatorial models · Homotopy and Cohomology in Algebraic Topology

Full text

Some proofs of the Poincaré–Birkhoff–Witt theorem and related matters

Gyula Lakos

[email protected]

Department of Geometry, Institute of Mathematics, Eötvös University, Pázmány Péter s. 1/C, Budapest, H–1117, Hungary

Abstract.

The first part of this article is concerned with proving the symmetric PBW theorem using Magnus commutators. Extensions of the Dynkin–Specht–Wever lemma and a general theorem of Nouazé–Revoy type are also obtained. The second part focuses on free Lie $K$ -algebras and the basic PBW theorem. Appendices are provided in order make the discussion self-contained, and also putting it into context.

Key words and phrases:

Poincaré–Birkhoff–Witt theorem, Magnus commutators, free Lie algebras

2010 Mathematics Subject Classification:

Primary: 17B35. Secondary: 17B01, 16S30.

The author would like to thank the support of Balázs Csikós.

1. Introduction

The objective of this paper is to give several alternative proofs of the (global versions of the) Poincaré–Birkhoff–Witt theorem. We also consider some consequences of these arguments.

The local form. Assume that $K$ is a unital commutative ring, and $\mathfrak{g}$ is a $K$ -module with a compatible Lie-ring structure; i. e. $\mathfrak{g}$ is a Lie $K$ -algebra (also called: Lie ring over $K$ ). The universal enveloping algebra $\mathcal{U}\mathfrak{g}$ is the free $K$ -algebra $\mathrm{F}_{K}[\mathfrak{g}]\simeq\bigotimes\mathfrak{g}\equiv\bigoplus_{n=0}^{\infty}{\textstyle\bigotimes}^{n}\mathfrak{g}$ factorized by the ideal $J\mathfrak{g}$ generated by the elements $X\otimes Y-Y\otimes X-\boldsymbol{[}X,Y\boldsymbol{]}$ , the tensor products are taken over $K$ . Let $\boldsymbol{m}:\bigotimes\mathfrak{g}\rightarrow\mathcal{U}\mathfrak{g}$ denote this canonical homomorphism. The enveloping algebra is naturally filtered by $\mathcal{U}^{n}\mathfrak{g}=\boldsymbol{m}\left({\textstyle\bigotimes}^{\leq n}\mathfrak{g}\right)$ , and the construction implies the existence of natural (surjective) maps $\boldsymbol{m}^{(n)}:{\textstyle\bigodot}^{n}\mathfrak{g}\rightarrow\mathcal{U}^{n}\mathfrak{g}/\mathcal{U}^{n-1}\mathfrak{g}$ . The (local form of the) Poincaré–Birkhoff–Witt theorem, whenever it holds, states that the maps $\boldsymbol{m}^{(n)}$ are isomorphisms. This theorem is known to hold in the following cases:

(i) $K$ is a field, or, more generally, $\mathfrak{g}$ is a free $K$ -module, or, more generally, $\mathfrak{g}$ is a direct sum of cyclic $K$ -modules (Poincaré [25]: $K$ is a field, $\mathbb{Q}\subset K$ ; Birkhoff [2], Witt [39]: $K$ is a field, but their methods work more generally; cf. also Bourbaki [4], Ton-That, Tran [36]);

(i’) $K$ is a principal ideal domain (Lazard [19]) or just a Dedekind ring (Cartier [6]); also see Higgins [16] for further results in this direction;

(ii) $\mathbb{Q}\subset K$ (Cohn [9]);

(ii’) $\frac{1}{2}\in K$ but $[\mathfrak{g},[\mathfrak{g},\mathfrak{g}]]=0$ (Nouazé, Revoy [24]);

(cf. Grivel [14] for a review), but there are counterexamples (Širšov [31], Cartier [6], Cohn [9]). The most general approach is of Higgins [16], cf. Revoy [27].

The global form. In practice, mostly cases (i) are (ii) are considered but typically formulated in global form:

(i) If $\mathfrak{g}$ is a sum of cyclic $K$ -modules, then we can choose a basis $\{g_{\alpha}\,:\,\alpha\in A\}$ , and an ordering $\leq$ of $A$ . Then let $\bigotimes_{\leq}\mathfrak{g}$ be the submodule of $\bigotimes\mathfrak{g}$ spanned by $g_{\alpha_{1}}\otimes\ldots\otimes g_{\alpha_{n}}$ with $\alpha_{1}\leq\ldots\leq\alpha_{n}$ . Then the “basic” version of the PBW theorem states that

[TABLE]

(a restriction of $\boldsymbol{m}$ ) is an isomorphism of $K$ -modules. (In fact, what we really need, in general, is a choice function $\mathfrak{c}$ , which transforms any finite multiset $\boldsymbol{\alpha}$ from $A$ into an ordered word $\alpha_{1},\ldots,\alpha_{n}$ . Then the statement is that is that the corresponding map $\boldsymbol{m}_{\mathfrak{c}}:\textstyle{\bigotimes_{\mathfrak{c}}\mathfrak{g}}\rightarrow\mathcal{U}\mathfrak{g}$ is an isomorphism. For the sake of simplicity, we will use the ordered version.) This is equivalent to the local version. Indeed, it is easy to show that $\boldsymbol{m}_{\leq}$ is surjective: Whenever we have an expression in $\mathcal{U}\mathfrak{g}$ (as an image of $\boldsymbol{m}$ ), we can rearrange the formally top nonarranged degree term into an image of $\boldsymbol{m}_{\leq}$ at the cost of generating formally lower order terms. Then we repeat this in formally lower orders. We call this as the “basic rearrangement procedure”. Then the isomorphism (i. e. injectivity) is a consequence of the local PBW theorem, and in fact, equivalent to it.

(ii) If $\mathbb{Q}\subset K$ , then one can consider the submodule $\bigotimes_{\Sigma}\mathfrak{g}$ of $\bigotimes\mathfrak{g}$ . This submodule can be interpreted either as the submodule of elements invariant under permutations in the order of tensor product or as the span of the elements $a_{1}\otimes_{\Sigma}\ldots\otimes_{\Sigma}a_{n}=\frac{1}{n!}\sum_{\sigma\in\Sigma_{n}}g_{\sigma(1)}\otimes\ldots\otimes g_{\sigma(n)}$ . Then the “symmetric” version of the PBW theorem states that

[TABLE]

(a restriction of $\boldsymbol{m}$ ) is an isomorphism of $K$ -modules. This is, again, equivalent to the local version. We can prove the surjectivity of $\boldsymbol{m}_{\Sigma}$ , using the “symmetric rearrangement procedure” in $\mathcal{U}\mathfrak{g}$ , i. e. symmetrizing in the formally top nonarranged degree term at the cost of generating lower order terms, and repeating the process in formally lower orders. Then, again, isomorphism (injectivity) is a consequence of the local PBW theorem, and, in fact, equivalent to it.

In practice, one typically starts with the global cases (i), (ii), and then proceeds further to the general local ones. For sake of reference, in Appendix A, we include the Witt–Lazard version of the proof of the global cases of the PBW theorem. (We formulate it to cover not only the original case (i) but (ii).)

The free case. One of the few cases where $\mathcal{U}\mathfrak{g}$ is easy to describe is when $\mathfrak{g}$ is the free Lie $K$ -algebra $\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ . Then $\mathcal{U}\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ is naturally isomorphic to the noncommutative polynomial algebra $\mathrm{F}_{K}[X_{\lambda}:\lambda\in\Lambda]$ . This holds for purely universal algebraic reasons. Indeed, $\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ evaluates by commutators, which defines a map $\iota:\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]\rightarrow\mathrm{F}_{K}[X_{\lambda}:\lambda\in\Lambda]$ , which, by the universality of the enveloping algebra, gives rise to a map $\mathcal{U}\iota:\mathcal{U}\mathrm{F}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]\rightarrow\mathrm{F}_{K}[X_{\lambda}:\lambda\in\Lambda]$ . Here the class $[X_{\lambda}]_{\mathcal{U}}$ maps to $X_{\lambda}$ . Comparing this to the universality of $\mathrm{F}_{K}[X_{\lambda}:\lambda\in\Lambda]$ , we see that $\mathcal{U}\iota$ is an isomorphism. What is not that obvious is that $\iota:\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]\rightarrow\mathrm{F}_{K}[X_{\lambda}:\lambda\in\Lambda]\simeq\mathcal{U}\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ is an inclusion. This is the theorem of Magnus [20] (cf. Witt [39]; the critical case is $K=\mathbb{Z}$ ) on the representability of free Lie $K$ -algebras. But this is a consequence of the PBW theorem ( $\boldsymbol{m}^{(1)}$ level). Now, the PBW theorem does, indeed, hold in the case when $\mathfrak{g}$ is a free Lie $K$ -algebra; this is due to the fact that free Lie $K$ -algebras are free $K$ -modules. This latter fact, however, is not entirely trivial if $K$ is not a field (but it can be derived easily from the theorem of Magnus…). In Appendix B, we include some elementary facts regarding free Lie $K$ -algebras, and we show how to prove that free Lie $K$ -algebras are free $K$ -modules using a sufficiently strong version of the PBW theorem itself. In Appendix C a more informative account is given, which shows how to do this without the use of the PBW theorem. An important point is, however, that free Lie $K$ -algebras are notable special cases of the PBW theorem.

The first part of this article is concerned with proving the symmetric PBW theorem using Magnus commutators (cf. Magnus [20]). Extensions of the Dynkin–Specht–Wever lemma and a general theorem of Nouazé–Revoy type are also obtained. The second part focuses on free Lie $K$ -algebras and the basic PBW theorem. Appendices are provided in order make the discussion self-contained, and also putting it into context. Arguments of such type where considered before; see Cartier [5], Bonfiglioli, Fulci [3], Ch. 6. A difference compared to them is that in the first part we use Magnus commutators instead of BCH series (the former is just more on target); and in the second part we consider general rings $K$ ( $\mathbb{Q}$ and $\mathbb{R}$ are relatively easy). We keep our arguments elementary. We do not consider the several generalizations of the PBW theorem (but see Grivel [14] for some). Nor we consider proofs which use higher algebraic or topological methods; for those we refer to the general literature.

2. The existence of $\mu$ I

For practical reasons we will use left-iterated higher commutators $\boldsymbol{[}X_{1},\ldots,X_{n}\boldsymbol{]}_{\mathrm{L}}=\boldsymbol{[}X_{1},\boldsymbol{[}X_{2},\ldots,\boldsymbol{[}X_{n-1},X_{n}\boldsymbol{]}\ldots\boldsymbol{]}\boldsymbol{]}$ .

Proposition/Definition 2.1.

There is a series of Lie-polynomials $\mu_{n}(X_{1},\ldots,X_{n})$ , $n\geq 1$ , over $\mathbb{Q}$ such that the following hold:

[TABLE]

where the generating function of the coefficients $\beta_{s}$ is

[TABLE]

where the generating function of the coefficients $\tilde{\beta}_{r}$ is

[TABLE]

for $n\geq 2$ , where the generating function of the coefficients $\alpha_{s,r}$ is

[TABLE]

Proof.

It is easy to see that $\beta(x)$ has only rational coefficients, and the formal rational power series on the RHS of (cgen), line 1 and 2, expand function theoretically to line 3 (which therefore also allows a power series expansion). Thus, we have here three well-defined recursive definitions for $\mu_{n}$ , we just have to show that they give the same Lie-polynomials.

The three definitions are obviously the same for $n=1$ . By induction, assume that the $\mu_{m}$ are well-defined for $m<n$ , $n\geq 2$ . Consider first the definition of $\mu_{n}(X_{1},\ldots,X_{n})$ according to (R). This is

[TABLE]

where we do not fill out the variables in the $\mu(...)$ , but just note that we have to sum for all locally increasing deployments of the variables $X_{1},\ldots,X_{n-1}$ .

In each summand, we consider the $\mu(...)$ containing $X_{1}$ , and expand it using (L). (We can do this according to the induction hypothesis). It yields

[TABLE]

There we have a commutator $\boldsymbol{[}\boldsymbol{[}\underbrace{\mu(...),\ldots,\mu(...)}_{\text{$ s $many$ \mu $}},X_{1}\boldsymbol{]}_{\mathrm{L}},\boldsymbol{[}\underbrace{\mu(...),\ldots,\mu(...)}_{\text{$ r-1-p $many$ \mu $}},X_{n}\boldsymbol{]}_{\mathrm{L}}\boldsymbol{]}$ , which is commutated further by $p$ many $\mu(...)$ . Let us distribute those, using the Leibniz rule, between the two terms of the commutators. We obtain

[TABLE]

This is a formally deterministic process, which gives nonzero contributions only for $\bar{s}+\bar{r}\leq n-2$ (because there are only $n-2$ many variables to distribute). According to our specific method, if $k\leq n-2$ , then

[TABLE]

We see that our manipulations yield $\tilde{\alpha}_{\bar{s},\bar{r}}=\alpha_{\bar{s},\bar{r}}$ for $\bar{s}+\bar{r}\leq n-2$ , which implies that the definitions of $\mu_{n}(X_{1},\ldots,X_{n})$ according to (R) and (C) are the same (again, terms with $\bar{s}+\bar{r}>n-2$ do not appear in either side, as there are only $n-2$ variables to distribute).

The argument that (L) and (C) give the same polynomials is analogous. ∎

Remark 2.2.

Some values of $\beta_{s}$ are given by

[TABLE]

( $B_{s}=s!\beta_{s}$ are the Bernoulli numbers.)

Some values of $\alpha_{s,r}$ are given by

[TABLE]

(The numerators are more complicated for higher indices in both cases.) ∎

Proposition 2.3.

The Lie-polynomials $\mu_{n}(X_{1},\ldots,X_{n})$ , $n\geq 1$ , satisfy the identities

[TABLE]

for $n\geq 2$ , $1<k\leq n$ .

Proof.

It is easy to check that $\mu_{2}(X_{1},X_{2})=\frac{1}{2}\boldsymbol{[}X_{1},X_{2}\boldsymbol{]}$ , which shows the statement for $n=2$ . By induction, assume that $\mu_{m}$ is well-defined for $m<n$ , $n\geq 3$ . If $\ldots_{2}$ is non-empty, then expand $\mu_{n}(\ldots_{1},X_{k-1},X_{k},\ldots_{2})-\mu_{n}(\ldots_{1},X_{k},X_{k-1},\ldots_{2})$ according to the (R)-expansions of the $\mu_{n}$ . Most of the terms cancel each other except those which contain $X_{k-1}$ and $X_{k}$ immediately next to each other. But then the induction hypothesis can be applied to show that it yields the (R)-expansion of $\mu_{n-1}(\ldots_{1},\boldsymbol{[}X_{k-1},X_{k}\boldsymbol{]},\ldots_{2})$ . If $\ldots_{1}$ is non-empty, then the (L)-expansion can be used to prove the identity in the same manner. ∎

3. From $\mu$ to the symmetric PBW theorem

Definition 3.1.

We define the map $\mu_{\Sigma}:\bigotimes\mathfrak{g}\rightarrow\bigotimes_{\Sigma}\mathfrak{g}$ such that

[TABLE]

(and it acts trivially in the [math]th order).

Proposition/Definition 3.2.

$\mu_{\Sigma}:\bigotimes\mathfrak{g}\rightarrow\bigotimes_{\Sigma}\mathfrak{g}$ * descends to a map $\boldsymbol{\mu}_{\Sigma}:\mathcal{U}\mathfrak{g}\rightarrow\bigotimes_{\Sigma}\mathfrak{g}$ .*

Proof.

It is sufficient to check that

[TABLE]

that is $\mu_{\Sigma}$ vanishes on the ideal generated by the elements $X\otimes Y-Y\otimes X-\boldsymbol{[}X,Y\boldsymbol{]}$ . This vanishing, when expanded, however, is a consequence of identities (2). ∎

Lemma 3.3.

Suppose that $P(X_{1},\ldots,X_{n})$ , with $n\geq 2$ , is a combination of Lie-monomials, such that in every Lie-monomial every variable appears exactly once. Then

[TABLE]

In particular, this holds for $\mu_{n}(X_{1},\ldots,X_{n})$ , $n\geq 2$ .

Proof.

This is sufficient to prove for Lie-monomials of $X_{1},\ldots,X_{n}$ . If $P$ is a non-trivial monomial, then it contains an inner Lie-commutator $\boldsymbol{[}X_{k},X_{l}\boldsymbol{]}$ , $k\neq l$ . Now, the permutations from $\Sigma_{n}$ come in pairs $\sigma\in A_{n}$ and $\sigma\circ(k\,\,l)\in\Sigma_{n}\setminus A_{n}$ , which cancel each other in the permuted monomial. ∎

Proposition 3.4.

$\boldsymbol{\mu}_{\Sigma}$ * inverts $\boldsymbol{m}_{\Sigma}$ .*

Proof.

Then

[TABLE]

Indeed, according to the previous definition, $\mu_{\Sigma}=\boldsymbol{\mu}_{\Sigma}\circ\boldsymbol{m}$ , furthermore $\boldsymbol{m}_{\Sigma}$ is just a restriction of $\boldsymbol{m}$ ; this implies the first equality. The second equality is due to the fact that the higher $\mu_{h}$ ( $h\geq 2$ ) vanish under symmetrization (Lemma 3.3). The third one is true due to $\mu_{1}(X_{1})=X_{1}$ and that the symmetrization of the symmetrization is the symmetrization. This proves $\boldsymbol{\mu}_{\Sigma}\circ\boldsymbol{m}_{\Sigma}=\operatorname{id}_{\bigotimes_{\Sigma}\mathfrak{g}}$ . In particular, $\boldsymbol{m}_{\Sigma}$ is injective. Then, the surjectivity of $\boldsymbol{m}_{\Sigma}$ implies bijectivity, and, in fact, the inverse relationship. ∎

This, in particular, proves the symmetric global version of the PBW theorem, i. e. that $\boldsymbol{m}_{\Sigma}$ is an isomorphism.

The facts behind the proof above are known for a long time: It is known that the canonical projections (the components of $(\boldsymbol{m}_{\Sigma})^{-1}\circ\boldsymbol{m}\equiv\boldsymbol{\mu}_{\Sigma}\circ\boldsymbol{m}=\mu_{\Sigma}:\bigotimes\mathfrak{g}\rightarrow\bigotimes_{\Sigma}\mathfrak{g}$ ) can be expressed by Magnus-commutators $\mu_{n}$ , see Solomon [33] and Mielnik, Plebański [23]; which satisfy rational Lie-recursions, see Magnus [21]. Cf. also Reutenauer [26].

The advantage of this proof is that it is constructive and explicit. On the other hand, the unmotivated nature of the definition of the $\mu_{n}$ is a disadvantage. However, we have not used the definition of the $\mu_{n}$ directly, but only that they satisfy

( $\mu 1$ )

$\mu_{1}(X_{1})=X_{1}$ ; 2. ( $\mu 2$ )

$\mu_{n}(X_{1},\ldots,X_{n})$ is a linear combination of Lie-monomials where every variable has multiplicity $1$ ; 3. ( $\mu 3$ )

identities (2) hold.

Thus, it might be useful to obtain a somewhat less explicit existence theorem for $\mu_{n}$ but which is motivated by simple universal algebraic principles. The best principle in that respect would be the global symmetric PBW theorem itself. From it we could obtain $\mu_{n}$ as a canonical projection. This is, of course, not the way we intend to follow (at this point). As it happens, some simpler arguments suffice:

4. The existence of $\mu$ II

We define a Lie-permutation $Ii$ of $\{1,\ldots,n\}$ as the following data. It is a partition $I_{1}\dot{\cup}\ldots\dot{\cup}I_{s}=\{1,\ldots,n\}$ such that $\max I_{1}<\ldots<\max I_{s}$ , and finite sequences $i_{k,1},\ldots,i_{k,p_{k}}$ such that $\{i_{k,1},\ldots,i_{k,p_{k}}\}=I_{k}$ , $p_{k}=|I_{k}|$ and $i_{k,p_{k}}=\max I_{k}$ .

Lemma 4.1.

The number of Lie-permutations of $\{1,\ldots,n\}$ , $n\geq 0$ , is $n!$ .

Proof.

For any Lie-permutation $Ii$ write down the sequence

[TABLE]

This yields a permutation of $\{1,\ldots,n\}$ . From this permutation the Lie-permutation can be reconstructed. Indeed, in the permutation sequence, the first couple of elements up to ‘ $n$ ’ form the last partition set $I_{s}$ with ordering. Then, from the rest, the first couple of elements up to the maximal element form the the partition set $I_{s-1}$ with ordering; etc. It is easy to see that we have a bijection between permutations and Lie-permutations. ∎

In what follows let $\mathbb{Q}X_{\Sigma_{n}}$ be the vector space spanned by the noncommutative monomials $X_{\sigma(1)}\ldots X_{\sigma(n)}$ in the corresponding noncommutative polynomial ring over $\mathbb{Q}$ .

Proposition 4.2.

Any element of $\mathbb{Q}X_{\Sigma_{n}}$ can uniquely be written in the form

[TABLE]

where $a_{Ii}\in\mathbb{Q}$ . (Here we used ordinary commutators and symmetrized products.)

Proof.

Existence is a consequence of the standard symmetrization argument but applied in the non-commutative polynomial algebra. This proves that any element is a sum symmetric products of Lie-monomials. Lie-monomials, on the other hand, can be brought into standard form (highest indices on the right in left-iterated Lie-commutators). Uniqueness follows from dimensional reasons, as the number of Lie-partitions of $\{1,\ldots,n\}$ is $n!$ , the same as the dimension of $\mathbb{Q}X_{\Sigma_{n}}$ . ∎

Let us write the monomial $X_{1}\ldots X_{n}$ into a form like above:

[TABLE]

the $b_{Ii}$ are concrete rational numbers.

Definition 4.3.

Then let us define

[TABLE]

where we now use Lie-commutators instead of commutators.

Now, the $\mu_{n}$ satisfy ( $\mu 1$ ), ( $\mu 2$ ), and we can also prove

Proposition 2.3.

The Lie-polynomials $\mu_{n}(X_{1},\ldots,X_{n})$ , $n\geq 1$ , satisfy the identities

[TABLE]

for $n\geq 2$ , $1<k\leq n$ .

Proof.

Using Lie algebra rules, both sides of (2) can be brought into form

[TABLE]

respectively. However, let us consider the expansion of

[TABLE]

with respect to (4) in each component separately (for $n$ , $n$ and $n-1$ terms, respectively), and apply formally the same standardization procedure (highest indices on the right in left-iterated commutators) to it. In the standardization process, the number of the components in the symmetric products does not change. All we ask to from the standardization process is to proceed in the lowest formal symmetric rank (i. e. 1) with commutators as we did with Lie-commutators before. Then, due to the unicity of the description (Proposition 4.2), the formally lowest symmetric orders agree,

[TABLE]

and again, due to the unicity, $c_{\chi}^{\mathrm{LHS}}=c_{\chi}^{\mathrm{RHS}}$ , and this is what we wanted to prove. ∎

Then one can proceed with the proof of symmetric global PBW theorem as in Section 3.

One can ask if the $\mu_{n}$ defined in Sections 2 and 4 are the same. Of course, they are, as they serve as components in $(\boldsymbol{m}_{\Sigma})^{-1}\circ\boldsymbol{m}$ (in particular, in the case of the free Lie algebra over $\mathbb{Q}$ ), and the inverse is unique.

From the content of Sections 3–4, one can simply develop several properties of the Magnus commutators. Some consequences are presented in Section 5 and Appendix D.

5. Related to $\mu$ I

Assume that expanded in the rational noncommutative polynomial algebra,

[TABLE]

For the moment, we are not interested in the actual values of the $\mu_{\sigma}$ (but see Remark D.1). Let us fix an arbitrary element $k\in\{1,\ldots,n\}$ . Now, $\mu_{n}(X_{1},\ldots,X_{n})$ is a Lie-polynomial, thus, using standard commutator rules, we can write it as linear combination of terms $\boldsymbol{[}X_{i_{1}},\ldots,X_{i_{n-1}},X_{k}\boldsymbol{]}_{\mathrm{L}}$ , where $\{i_{1},\ldots,i_{n-1}\}=\{1,\ldots,n\}\setminus\{k\}$ . However, evaluated in the noncommutative polynomial algebra, such a commutator expression gives only one monomial contribution $X_{i_{1}}\ldots X_{i_{n-1}}X_{k}$ such that the last term is $X_{k}$ . Thus, the coefficient of $\boldsymbol{[}X_{i_{1}},\ldots,X_{i_{n-1}},X_{k}\boldsymbol{]}_{\mathrm{L}}$ can be read off from (5). We find that

[TABLE]

(Cf. Arnal, Casas, Chiralt [1].) Summing this for all possible $k$ , we obtain

[TABLE]

This allows to prove the following general version of the Dynkin–Specht–Wever lemma:

Proposition 5.1.

( $\mathbb{Q}\subset K$ ) Suppose that the $K$ -submodule $W\subset{\textstyle\bigotimes}^{n}\mathfrak{g}$ is closed for actions of $\Sigma_{n}$ inducing permutations in the order of tensor product. Also assume that $\boldsymbol{m}|_{W}:W\rightarrow\mathcal{U}\mathfrak{g}$ is injective. Now, if

[TABLE]

such that $P\in W$ and $\boldsymbol{m}(P)\in\mathfrak{g}$ , then

[TABLE]

Proof.

$\boldsymbol{m}(P)\in\mathfrak{g}$ means that in $\mathcal{U}\mathfrak{g}$

[TABLE]

Due to the injectivity of $\boldsymbol{m}|_{W}$ , we find

[TABLE]

(both sides are in $W$ , because $W$ is permutation-invariant). Then, applying $\boldsymbol{[},\boldsymbol{]}_{\mathrm{L}}$ , and using (7), and, finally, $\boldsymbol{m}(P)\in\mathfrak{g}$ , we find

[TABLE]

This is what we wanted to prove. ∎

The discussion extends to the weighted case. If we assign the weight $w_{i}\in K$ to the variables $X_{i}$ (for accounting purposes), then we can sum (6) for all possible $k$ with weight $w_{k}$ respectively. Then we obtain

[TABLE]

Assume that $\mathfrak{g}$ is $K$ -graded as a $K$ -module (but not necessarily as a Lie $K$ -algebra). Then $\bigotimes\mathfrak{g}$ is also $K$ -graded naturally. Let $w:\bigotimes\mathfrak{g}\rightarrow\bigotimes\mathfrak{g}$ be the map which acts as multiplication by $k$ on the component of grade $k\in K$ .

Proposition 5.2.

( $\mathbb{Q}\subset K$ ) Suppose $W\subset{\textstyle\bigotimes}\mathfrak{g}$ is closed for actions of all $\sigma_{r}\in\Sigma_{r}$ inducing permutations in the $r$ th tensor order. Also assume that $\boldsymbol{m}|_{W}:W\rightarrow\mathcal{U}\mathfrak{g}$ is injective. Now, if

[TABLE]

such that $P\in W$ and $\boldsymbol{m}(P)\in\mathfrak{g}$ , then

[TABLE]

Proof.

We can assume that $a_{i,\lambda}$ is of homogeneous grade $w_{i,\lambda}$ . Then the previous proof works but using (9) instead of (7). ∎

The ordinary (weighted) Dynkin–Specht–Wever lemma is just the case when $\mathfrak{g}$ is the free Lie $K$ -algebra generated by the formal variables $X_{j}$ (with grade $w_{j}$ ), and $W$ is generated by tensor products $X_{j_{1}}\otimes\ldots\otimes X_{j_{n}}$ (multiplicities are possible).

Corresponding statements also hold with respect to the right-iterated Lie-commutators $\boldsymbol{[}X_{1},\ldots,X_{n}\boldsymbol{]}_{\mathrm{R}}=\boldsymbol{[}\boldsymbol{[}\ldots\boldsymbol{[}X_{1},X_{2}\boldsymbol{]},\ldots,X_{n-1}\boldsymbol{]},X_{n}\boldsymbol{]}$ .

The same arguments can also be carried out in the following way. Let us fix $k\neq l\in\{1,\ldots,n\}$ . The Lie-polynomial $\mu_{n}(X_{1},\ldots,X_{n})$ , using standard commutator rules, can be written as a linear combination of terms $\boldsymbol{[}\boldsymbol{[}X_{i_{1}},\ldots,X_{i_{p}},X_{k}\boldsymbol{]}_{\mathrm{L}},\boldsymbol{[}X_{l},X_{i_{p+1}},\ldots,X_{i_{n-2}}\boldsymbol{]}_{\mathrm{R}}\boldsymbol{]}$ , where $\{i_{1},\ldots,i_{n-2}\}=\{1,\ldots,n\}\setminus\{k,l\}$ . However, evaluated in the noncommutative polynomial algebra, such a commutator expression gives only one monomial contribution $X_{i_{1}}\ldots X_{i_{p}}X_{k}X_{l}X_{i_{p+1}}\ldots X_{i_{n-2}}$ such that $X_{k}$ is immediately followed by $X_{l}$ . Compared this to (5), we find that

[TABLE]

Summing this for all possible pairs $k,l$ , we obtain

[TABLE]

Arguing in the same manner as before, in the setting of Proposition 5.1

[TABLE]

is also true. This implies the following version of the Dynkin–Specht–Wever lemma, which holds for arbitrary $K$ :

Proposition 5.3.

Suppose that $P(X_{1},\ldots,X_{k})$ is a Lie-polynomial, i. e. an element of $\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ . Assume that $P(X_{1},\ldots,X_{k})$ expands in the commutator-evaluation in the noncommutative polynomial algebra as

[TABLE]

Then

[TABLE]

Proof.

If $\mathbb{Q}\subset K$ , then it follows from the previous argument. Invoking Proposition C.4 from Appendix C, $\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ embeds to $\mathrm{F}_{\mathbb{Q}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ naturally. So, it is also true for $K=\mathbb{Z}$ . The general case follows by taking tensor products with an arbitrary $K$ . ∎

This is ‘C’-bracketed version of the well-known statement. Weighted versions are also possible, but they are better to be formulated in a multigraded environment.

6. $\mathcal{U}\mathfrak{g}$ as a direct construction

Still assume $\mathbb{Q}\subset K$ . Let us define the maps $\operatorname{bch}_{n,m}:{\textstyle\bigodot}^{n}\mathfrak{g}\otimes{\textstyle\bigodot}^{m}\mathfrak{g}\rightarrow\mathfrak{g}$ for $n+m\geq 1$ , such that

[TABLE]

(See formula (29) for motivation with respect to the notation.) Considering the natural correspondence between $a_{1}\odot\ldots\odot a_{n}$ and $a_{1}\otimes_{\Sigma}\ldots\otimes_{\Sigma}a_{n}=\frac{1}{n!}a_{\sigma(1)}\otimes\ldots\otimes a_{\sigma(n)}$ , we obtain

Proposition 6.1.

$\mathcal{U}\mathfrak{g}$ * is naturally isomorphic to $\bigodot\mathfrak{g}$ endowed with a product rule $\cdot_{\mathcal{U}}$ such that*

[TABLE]

Proof.

Indeed, we have linear isomorphisms $\boldsymbol{m}_{\Sigma}$ / $\boldsymbol{\mu}_{\Sigma}$ between the two modules. Regarding the product structure, if we resolve $\odot$ as $\otimes_{\Sigma}$ , take the tensor product, evaluate by $\boldsymbol{\mu}_{\Sigma}$ , and resolve $\otimes_{\Sigma}$ back to $\odot$ , then we obtain the product rule as above. ∎

Thus, a direct construction $\mathcal{U}_{\mathrm{dir}}\mathfrak{g}$ for $\mathcal{U}\mathfrak{g}$ , in case $\mathbb{Q}\subset K$ , would simply be $\bigodot\mathfrak{g}$ endowed with the product rule (12). (Cf. Cartier [5].) Checking well-definedness directly is not particularly hard, but checking the arithmetics for associativity is not that easy. Nevertheless, we know that the arithmetics works out, because the proposition above holds for the free Lie algebra over the rational numbers.

In particular, it works out in the case of the free $k$ -nilpotent Lie algebra, where the identity $\boldsymbol{[}X_{1},\ldots,X_{k+1}\boldsymbol{]}_{\mathrm{L}}=0$ holds. In this case, we can consider the evaluator $\operatorname{bch}_{n,m}$ , $n+m\geq k+1$ as identically [math]. In particular, the associativity works out only using $\operatorname{bch}_{n,m}$ , $n+m\leq k$ and the $k$ -nilpotency rule. Now, $\operatorname{bch}_{n,m}$ , $n+m\leq k$ can be defined using only the ring $\mathbb{Z}[\frac{1}{k!}]$ ; indeed, in the “symmetric rearrangement procedure” leading to $\mu_{n+m}$ we use symmetrizations up to $k$ elements only, and also in the definitions of $\operatorname{bch}_{n,m}$ . (For a more quantitative argument regarding $\mu_{k}$ , see (5)–(7) and Remark D.1.) Now, the free $k$ -nilpotent Lie algebra over $\mathbb{Z}[\frac{1}{k!}]$ naturally embeds into the free $k$ -nilpotent Lie algebra over $\mathbb{Q}$ . In fact, the free $k$ -nilpotent Lie algebra (but not its universal enveloping algebra) naturally embeds to the $k$ -nilpotent noncommutative polynomial algebra by the commutator representation. This implies that the associativity computation works out in the free $k$ -nilpotent Lie algebra over $\mathbb{Z}[\frac{1}{k!}]$ . However, this implies that it works out in any $k$ -nilpotent Lie algebra with $\frac{1}{k!}\in K$ . Thus, in that case, $\mathcal{U}_{\mathrm{dir}}\mathfrak{g}$ yields an associative algebra. $\mathcal{U}_{\mathrm{dir}}\mathfrak{g}$ is generated by $\mathfrak{g}$ , thus we have a natural factorization map $\mathcal{U}\mathfrak{g}\rightarrow\mathcal{U}_{\mathrm{dir}}\mathfrak{g}$ . Regarding the filtration induced by the image of ${\textstyle\bigotimes}^{n}\mathfrak{g}$ , this induces a natural factor map ${\textstyle\bigodot}^{n+1}\mathfrak{g}/Z_{n}\rightarrow{\textstyle\bigodot}^{n+1}\mathfrak{g}$ . This however, implies that $Z_{n}$ is [math]. In particular, we obtain

Proposition 6.2.

If $\mathfrak{g}$ is $k$ -nilpotent and $1,\ldots,\frac{1}{k}\in K$ , then

(o) $\mathcal{U}_{\mathrm{dir}}\mathfrak{g}$ can be defined formally;

(a) $\mathcal{U}\mathfrak{g}$ is naturally isomorphic to $\mathcal{U}_{\mathrm{dir}}\mathfrak{g}$ ; and

(b) the (local) PBW theorem holds for $\mathfrak{g}$ .

Proof.

(a) and (b) are both implied by $Z_{n}=0$ . ∎

This is a generalization of the result of Nouazé, Revoy [24].

7. $\mathrm{F}_{K}^{\operatorname{Lie}}$ via $\mathrm{F}_{\mathbb{Q}}^{\operatorname{Lie}}$

First, we give a direct proof of the PBW theorem for free Lie algebras over $\mathbb{Q}$ . (The argument works for any field of characteristic [math].) We will use the fact that free Lie algebras are multigraded. The proof will be sketchy as we rely on familiar arguments.

Proposition 7.1.

The PBW theorem holds for $\mathfrak{g}=\mathrm{F}_{\mathbb{Q}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ .

Sketch of proof.

We will prove the symmetric global formulation. Consider

[TABLE]

Both sides are naturally multigraded, and the map is compatible with them. Thus, it is sufficient to prove isomorphism (that is injectivity) between them in every multigrade separately. If in the multigrade every variable has multiplicity at most one, then the injectivity holds due to Proposition 4.2. Regarding higher multigrades, assume that in multigrade $X_{1}^{i_{1}}\ldots X_{s}^{i_{s}}$ ,

[TABLE]

is such that it is not zero but evaluates to zero in $\mathrm{F}_{\mathbb{Q}}[X_{\lambda}:\lambda\in\Lambda]$ . Here $P$ was written such that every variable appears according to the multiplicity (for a monomial decomposition). Then the polarization

[TABLE]

( $n=i_{1}+\ldots+i_{s}$ ) is also not zero, because its depolarization is nonzero. On the other hand, it evaluates to the polarization of [math], i. e. to [math] as a noncommutative polynomial. This contradicts to the injectivity of the multigrades without multiplicity $\geq 2$ .

Remark. The proof is not particular to the symmetric formulation. We could have used a variant of Proposition 4.2 with respect to ordinary products, not with symmetrized products. The only place where $\operatorname{char}K=0$ was used is in (13), where we made a polarization such that its depolarization is the original. ∎

The PBW theorem for $\mathfrak{g}=\mathrm{F}^{\operatorname{Lie}}_{\mathbb{Q}}[X_{1},\ldots,X_{n}]$ yields $\mu_{n}(X_{1},\ldots,X_{n})$ as the component of degree 1 of $(\boldsymbol{m}_{\Sigma})^{-1}(X_{1}\ldots X_{n})$ , that is, as a canonical projection. Properties ( $\mu$ 1)–( $\mu$ 3) are straightforward to develop. (“The existence of $\mu$ III”.)

We will use Proposition C.4 from Appendix C in order to prove

Proposition 7.2.

The PBW theorem holds for $\mathfrak{g}=\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ .

Proof.

First, let us consider the case $K=\mathbb{Z}$ . We know that $\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ is a free $\mathbb{Z}$ -module, and $\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]\otimes\mathbb{Q}\simeq\mathrm{F}_{\mathbb{Q}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ naturally. Thus, starting from a basis of $\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ , we see that $\bigotimes_{\leq}\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ embeds to $\bigotimes_{\leq}\mathrm{F}_{\mathbb{Q}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ . The latter one evaluates in $\mathcal{U}\mathrm{F}_{\mathbb{Q}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ injectively, thus $\bigotimes_{\leq}\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ evaluates (taking Lie-commutator into commutator, tensor product into ordinary product) in some ring injectively. This implies that $\bigotimes_{\leq}\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ must evaluate in the universal enveloping $\mathcal{U}\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ algebra injectively.

Let us now consider the general case. We know that the evaluation yields an isomorphism $\bigotimes_{\leq}\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]\simeq\mathrm{F}_{\mathbb{Z}}[X_{\lambda}:\lambda\in\Lambda]$ of free $\mathbb{Z}$ -modules. But then evaluation (i. e. the process which sends Lie-commutators to commutators and tensor product to products) gives an isomorphism $K\otimes\bigotimes_{\leq}\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]\simeq\mathrm{F}_{K}[X_{\lambda}:\lambda\in\Lambda]$ . On the other hand, $K\otimes\bigotimes_{\leq}\mathrm{F}_{\mathbb{Z}}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]\simeq\bigotimes_{\leq}\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ naturally, and compatibly with evaluation. That proves that $\bigotimes_{\leq}\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ evaluates to $\mathrm{F}_{K}[X_{\lambda}:\lambda\in\Lambda]$ isomorphically. ∎

8. The $\mathrm{F}_{K}^{\operatorname{Lie}}$ case directly

We say that a free PBW word basis is the following data. We will consider words formed from an alphabet $\Lambda$ .

(A1)

Some words should be called primitive. 2. (A2)

To any primitive word $w$ a $\boldsymbol{[},\boldsymbol{]}$ -monomial $P_{w}^{\operatorname{Lie}}$ should be associated such that the variables $X_{\lambda}$ of $P_{w}^{\operatorname{Lie}}$ correspond to the $\lambda$ in $w$ with the same multiplicity. 3. (A3)

The Lie-polynomials $P_{w}^{\operatorname{Lie}}$ should generate $\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ as a $K$ -module. 4. (A4)

The primitive words should be endowed by an ordering $\preccurlyeq$ such that every word $w$ uniquely decomposes to a concatenation of primitive words $=w_{1}\ldots w_{s}$ such that $w_{1}\succcurlyeq w_{2}\succcurlyeq\ldots\succcurlyeq w_{s}$ . 5. (A5)

To any word decomposed as above we associate the noncommutative polynomial

[TABLE]

where $P_{w_{i}}$ is the commutator evaluation $P_{w_{i}}^{\operatorname{Lie}}$ . 6. (A6)

The noncommutative polynomials $P_{w}$ should be independent in $\mathrm{F}_{K}[X_{\lambda}:\lambda\in\Lambda]$ .

Our first observation is that the existence of a free PBW word basis implies the basic PBW theorem for $\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ . Indeed, considering (A3) and (A6), we see that that $P_{w}^{\operatorname{Lie}}$ should be a basis of $\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ . Then, due to (A3), every element in $\mathcal{U}\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ can be brought into a combination of products $P_{w_{1}}^{\operatorname{Lie}},\ldots,P_{w_{s}}^{\operatorname{Lie}}$ such that $w_{1}\succcurlyeq w_{2}\succcurlyeq\ldots\succcurlyeq w_{s}$ , by the usual basic rearrangement process. Such products are then independent in the universal enveloping algebra, as they are independent in the noncommutative polynomial evaluation, due to (A6). This establishes the global form of the basic PBW theorem with respect to a specific ordering. But then the local form holds, which implies the global form in general.

Next, we will find a free PBW word basis. In order to this, we will rely on the content of Appendix C. The argument is quite combinatorial; we will be somewhat sketchy. First we establish the case $\Lambda=\mathbb{Z}$ .

Firstly, we need a breaking pattern. We will use an ordering $\sqsubseteq$ on the words made from $\mathbb{Z}$ which is monotone with respect to monotone maps of $\mathbb{Z}$ . (Lexicographic ordering suffices). We break a finite $\mathbb{Z}$ -word $w$ as follows. The definition is recursive with respect to the length of the words. We identify the greatest number $n$ in $w$ . Then $w$ reads as

[TABLE]

( $s$ many occurrences of $n$ ). If the word contains only $n$ , then we break the word to letters completely. If not, then we surely break after the last occurrence after of $n$ , and we might break ( $\mid_{?}$ ) after other occurrences of $n$ , and we might break after the last occurrence of $n$ ( $\mid_{??}$ ):

[TABLE]

Regarding $\mid_{??}$ the rule is simple: we break as we would break $i_{s+1,1},\ldots,i_{s+1,p_{s+1}}$ . Regarding $\mid_{?}$ the rule is more difficult: Consider the sequence of sequences

[TABLE]

Replace it by a sequence of integers

[TABLE]

such that the internal ordering pattern of (16) with respect to $\leq$ is the same as the ordering pattern of (15) with respect to $\sqsubseteq$ . (We say that (15) is condensed to (16)). Then the breaking places of $\mid_{?}$ are defined to be the breaking places of (16). One can prove by induction that the breaking mechanism is well-defined, and it depends only on the internal ordering pattern of $w$ with respect to $\leq$ . A word is primitive if it does not break (so, in particular, the latest cipher is the maximal). Then there is a natural ordering $\preccurlyeq$ defined between primitive $\mathbb{Z}$ -words as follows: $w_{1}\preccurlyeq w_{2}$ is the greatest (i. e. last) cipher of $w_{1}$ is smaller than the greatest (i. e. last) cipher of $w_{2}$ ; or if the last ciphers are equal, then the simultaneous condensations of $w_{1}$ are $w_{2}$ are in $\preccurlyeq$ relation. By induction, one can prove that the decomposition is non-strictly $\preccurlyeq$ -decreasing, so (A1) and (A4) are established.

Secondly, we need an evaluation pattern. We do not go into various possibilities, but we simply define $\langle X_{1},\ldots,X_{n}\rangle_{\mathrm{L}}:=[X_{1},\ldots,X_{n}]_{\mathrm{L}}$ . To every sequence $w$ as above we associate a noncommutative polynomial $P_{w}$ such that its multidegree is given by the $X_{i}$ coming from the ciphers $i$ of $w$ , with multiplicities. If $w$ contains only the cipher $n$ , in length $l$ , then $P_{w}=(X_{n})^{l}$ . Otherwise we let

[TABLE]

where

[TABLE]

and $P_{A}$ is

[TABLE]

but $X_{j_{i_{f,1},\ldots,i_{f,p_{f}}}}$ is substituted by $\langle{X_{i_{f,1}},\ldots,X_{i_{f,p_{f}}},X_{n}\rangle_{\mathrm{L}}}$ . (Remark: A recursive evaluation pattern would be $\langle{X_{i_{1}},\ldots,X_{i_{p}},X_{n}\rangle_{\mathrm{L}}}=(P_{i_{1},\ldots,i_{p}})^{\operatorname{ad}}X_{n}$ , where $\operatorname{ad}$ indicates the replacement of $X_{i}$ by $\operatorname{ad}X_{i}$ .) By induction, one can see that $P_{w}$ is a product of commutator monomials corresponding to primitive words. If $w$ is a primitive, then $P_{w}^{\operatorname{Lie}}$ can be defined as a Lie-monomial. This establishes (A2) and (A5).

Then the generating statement (A3) follows by induction (on formal multigrade in $\mathrm{F}_{K}^{\operatorname{n-a}}$ ) using the standard fact that Lie-monomials containing $X_{n}$ are Lie-polynomials of $\boldsymbol{[}X_{j_{1}},\ldots,X_{j_{k}},X_{n}\boldsymbol{]}_{\mathrm{L}}$ . The independence statement (A6) follows by induction using Corollary C.2’. By that the construction is finished for $\Lambda=\mathbb{Z}$ .

Now, $\Lambda$ is not necessarily the same as $\mathbb{Z}$ . For that reason we introduce an ordering $\lesssim$ on $\Lambda$ . Then word breaking and evaluation is induced by replacing the letters $\lambda$ in $w$ by integers $i_{\lambda}$ such that the order structure of the replacement with respect to $\leq$ is compatible with the order structure in $w$ with respect to $\lesssim$ ; then $i_{\lambda}$ is replaced back to $\lambda$ . (This is well-defined because the word breaking and evaluation structure over $\mathbb{Z}$ was invariant for monotone maps of $\mathbb{Z}$ .)

One can fine-tune the construction combinatorially by choosing various breaking patterns (e. g. one can make $\sqsubseteq$ depend on condensation history) or evaluation patterns. In fact, such constructions were developed in great depth by Hall [15], Chen, Fox, Lyndon [8], Širšov [32], Schützenberger [29], Viennot [37], Melançon, Reutenauer [22], etc.; and it is recognized that these constructions imply the PBW theorem in the free case, cf. Širšov [32], Reutenauer [26]. For us, however, variety has little benefit, one construction is sufficient, and the PBW theorem works ultimately with respect to any basis.

9. From $\mathrm{F}_{K}^{\operatorname{Lie}}$ to the basic PBW theorem

Here we assume to know that free Lie $K$ -algebras are free $K$ -modules in every multigrade separately, and that the PBW theorem holds for them.

Proposition 9.1.

The PBW theorem holds if $\mathfrak{g}$ is a free $K$ -module.

Proof.

Consider a base $\{Z_{\lambda}:\lambda\in\Lambda\}$ for $\mathfrak{g}$ . Take $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ . Let us extend $\{X_{\lambda}:\lambda\in\Lambda\}$ by $\{P_{\omega}:\omega\in\Omega\}$ obtained from higher multigrades to a basis

[TABLE]

of $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ . Assume that

[TABLE]

( $X_{\lambda}$ is substituted by $Z_{\lambda}$ in $P_{\omega}$ ). Then let

[TABLE]

Now

[TABLE]

is still a basis. Take any ordering $\leq$ on that; and assume, say, that elements belonging $\Lambda$ precede the ones belonging to $\Omega$ . The elements $P_{\omega}^{\prime}$ span an ideal $\mathcal{I}$ in $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ . Indeed, they span exactly the kernel of the evaluation map $X_{\lambda}\mapsto Z_{\lambda}$ . If $[Z_{\lambda},Z_{\mu}]=c_{\lambda,\mu}^{\nu}Z_{\nu}$ , then $[X_{\lambda},X_{\mu}]-c_{\lambda,\mu}^{\nu}X_{\nu}\in\mathcal{I}$ ; thus there is a homomorphism

[TABLE]

where $\mathcal{I}^{\prime}$ is the ideal generated by the image of $\mathcal{I}$ . We claim that $\mathcal{I}^{\prime}$ is $K$ -linearly generated by the elements

[TABLE]

$\lambda_{1}\leq\ldots\leq\lambda_{n}\leq\omega_{1}\leq\ldots\leq\omega_{m}$ such that $n\geq 0$ , $m\geq 1$ . Indeed, if we take an arbitrary product of base elements which contains at least one $P_{\omega}^{\prime}$ and we apply the basic rearrangement procedure, then at least one element in any formal product monomial will be from $\mathcal{I}$ . A base element from $\{P_{\omega}^{\prime}:\omega\in\Omega\}$ is either unaffected in a step, or it gets commutated, but then the commutator is a $K$ -linear combination of elements of $\{P_{\omega}^{\prime}:\omega\in\Omega\}$ .

Now, the injectivity of $\boldsymbol{m}_{\leq}$ with respect to $\mathcal{B}^{\prime}$ means that the evaluation map given by

[TABLE]

( $\lambda_{1}\leq\ldots\leq\lambda_{n}$ ) is injective. Thus the evaluation map into $\mathcal{U}\mathfrak{g}$ must also be injective. (Remark: Actually, $\mathcal{U}\mathfrak{g}\simeq\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]/\mathcal{I}^{\prime}$ by universal algebraic reasons.) ∎

It seems to be a drawback that we obtained only the basic PBW theorem for free $K$ -modules. This can be remedied as follows. Due to the relatively transparent structure of $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ , one can define free Lie algebras $\mathrm{F}^{\operatorname{Lie}}_{K_{\Lambda}}[X_{\lambda}:\lambda\in\Lambda]$ with variable coefficient structure. This means that in multigrade $X_{\lambda_{1}}^{i_{1}}\ldots X_{\lambda_{s}}^{i_{s}}$ ( $i_{k}\geq 1$ ) the coefficient ring is ${\textstyle\bigotimes}^{i_{1}}(K/I_{\lambda_{1}})\otimes\ldots\otimes{\textstyle\bigotimes}^{i_{s}}(K/I_{\lambda_{i}})$ ( $\simeq K/(I_{\lambda_{1}}+\ldots+I_{\lambda_{s}})$ ). This has the same monomial structure as $\mathrm{F}^{\operatorname{Lie}}_{\mathbb{Z}}[X_{\lambda}:\lambda\in\Lambda]$ ; except that some multigrades are deselected (where the coefficient ring is [math]), but this makes no essential difference. This evaluates in the noncommutative polynomial algebra $\mathrm{F}_{K_{\Lambda}}[X_{\lambda}:\lambda\in\Lambda]$ , and the PBW theorem remains valid, as in every multigrade we have the same evaluation structure as in the free Lie algebra with with respect to the appropriate coefficient ring. But then the arguments of the previous proof can be modified in order to obtain the basic PBW theorem for sums of cyclic $K$ -modules.

10. Conclusions

If one is interested in the PBW theorem per se, then the approach of Witt and Lazard is rather satisfactory (as a starting point). If one is interested in Lie groups, then an existence argument for $\mu$ + Section 3 for the symmetric PBW theorem might be a good approach. Section 7 + Section 9 specialized to the case when $K$ is of characteristic [math] gives a relatively straightforward proof in the spirit of Poincaré. One interested in a deeper study of free Lie $K$ -algebras can obtain the basic PBW theorem essentially as a byproduct.

Appendix A The Witt–Lazard proof of the global PBW theorems

Although the classical proofs of the PBW theorem which work for general fields are quite similar to each other; the approach due to Witt [39] and Lazard [19] is characterized (as opposed to Birkhoff [2] and Bourbaki [4]) by (a) an emphatic appearance of the symmetric group, and (b) a more explicit description of the ideal structure of the universal factorization. In short terms, it algebraizes the combinatorics quite well. In fact, it allows to formulate the proof of the PBW theorem simultaneously in (i) the basic case (sum of cyclic $K$ -modules) and (ii) the symmetric case ( $\mathbb{Q}\subset K$ ).

(I) Actions of symmetric groups. The symmetric group $\Sigma_{n}$ acts naturally on ${\textstyle\bigotimes}^{n}\mathfrak{g}$ by the presription

[TABLE]

Then ${\textstyle\bigotimes}^{n}_{(0)}\mathfrak{g}$ is generated by $v-\sigma*v$ where $v\in{\textstyle\bigotimes}^{n}\mathfrak{g}$ , $\sigma\in\Sigma_{n}$ . Let $W_{k,n}$ denote the permutation $(k\,\,k+1)$ in $\Sigma_{n}$ . Then it is also true that ${\textstyle\bigotimes}^{n}_{(0)}\mathfrak{g}$ is generated by $v-W_{k,n}*v$ where $v\in{\textstyle\bigotimes}^{n}\mathfrak{g}$ , $k<n$ . Let us define the $W_{k,n}\bullet:{\textstyle\bigotimes}^{n}\mathfrak{g}\rightarrow{\textstyle\bigotimes}^{n-1}\mathfrak{g}$ by taking a Lie-commutator between the $k$ th and $(k+1)$ th positions. So,

[TABLE]

We can extend $W_{k,n}*$ and $W_{k,n}\bullet$ to $\bigotimes\mathfrak{g}$ . In the first case the action is identity outside ${\textstyle\bigotimes}^{n}\mathfrak{g}$ , and in the second case it is the zero map.

We define the action $W_{k,n}\diamond$ as $W_{k,n}*+W_{k,n}\bullet$ (extended sense). Then $v-W_{k,n}\diamond v$ vanishes if $v\in{\textstyle\bigotimes}^{k}\mathfrak{g}$ , $n\neq k$ . Let $J^{n}\mathfrak{g}$ be the module generated by $v-W_{k,n}\diamond v$ , $v\in{\textstyle\bigotimes}^{n}\mathfrak{g}$ . We see that $\displaystyle{J\mathfrak{g}=\sum J^{n}\mathfrak{g}}$ . Let us extend $\diamond$ to $\Sigma_{n}$ as follows. For $1\in\Sigma_{n}$ let $1\diamond$ be the identity. For $\sigma\in\Sigma_{n}\setminus\{1,W_{1,n},\ldots,W_{n-1,n}\}$ we choose an arbitrary (but fixed) decomposition $\sigma=W_{a_{1},n}\ldots W_{a_{s},n}$ , and we let $(\sigma\diamond)=(W_{a_{1},n}\diamond)\ldots(W_{a_{s},n}\diamond)$ . Now, $\sigma\diamond$ still acts as identity outside ${\textstyle\bigotimes}^{n}\mathfrak{g}$ , but it does not necessarily define an associative action of $\Sigma_{n}$ . However, it is not very far from it:

Lemma A.1.

$W_{k,n}\diamond$ * acts trivially on $J^{n-1}\mathfrak{g}$ (thus invariantly), and*

[TABLE]

Proof.

The triviality property follows from $J^{n-1}\mathfrak{g}\subset{\textstyle\bigotimes}^{n-1}\mathfrak{g}\oplus{\textstyle\bigotimes}^{n-2}\mathfrak{g}$ . The equalities follow from the identities

[TABLE]

if $l-k\geq 2$ ;

[TABLE]

which, checked against $v_{1}\otimes\ldots\otimes v_{n}$ , follow from the Lie-identities. ∎

Corollary A.2.

$\diamond$ * extends to an associative action of $\Sigma_{n}$ modulo $J^{n-1}\mathfrak{g}$ .*

Proof.

In (P1)–(P3) we recognize the semigroup presentation of $\Sigma_{n}$ based on $W_{k,n}$ (Cf. Dickson [10], P. 2, Ch. XIII). The relations are satisfied according to the previous lemma, thus the action descends to $\Sigma_{n}$ . ∎

(II) The tensorial splittings. We define the forgetting map $\mathrm{f}^{n}:\Sigma_{n}\otimes{\textstyle\bigotimes}^{n}\mathfrak{g}\rightarrow{\textstyle\bigotimes}^{n}\mathfrak{g}$ such that

[TABLE]

and we define the evaluation map $\mathrm{e}^{n}:\Sigma_{n}\otimes{\textstyle\bigotimes}^{n}\mathfrak{g}\rightarrow{\textstyle\bigotimes}^{n}\mathfrak{g}$ such that

[TABLE]

In case (i), we take a basis a $\{g_{\alpha}\,:\,\alpha\in A\}$ , and introduce an ordering $\leq$ on $A$ . We define $\eta_{n}:{\textstyle\bigotimes}^{n}\mathfrak{g}\rightarrow\Sigma_{n}\otimes{\textstyle\bigotimes}^{n}\mathfrak{g}$ such that

[TABLE]

where $\alpha_{\sigma^{-1}(1)}\leq\ldots\leq\alpha_{\sigma^{-1}(n)}$ and $i<j$ , $\alpha_{i}=\alpha_{j}$ implies $\sigma(i)<\sigma(j)$ . I. e. $\sigma$ is the permutation which orders $\alpha_{1},\ldots,\alpha_{n}$ with the least number of involutions.

In case (ii), we simply define

[TABLE]

Then

[TABLE]

It easy to see from the definition that

[TABLE]

for any $\sigma\in\Sigma_{n}$ , $v\in{\textstyle\bigotimes}^{n}\mathfrak{g}$ . This is the same thing to say as $\mathrm{e}^{n}\circ\eta_{n}\circ(\mathrm{f}^{n}-\mathrm{e}^{n})=0$ . Then

[TABLE]

Indeed, $\mathrm{e}^{n}\circ\eta_{n}\circ(\operatorname{id}-\mathrm{e}^{n}\circ\eta_{n})=\mathrm{e}^{n}\circ\eta_{n}\circ(\mathrm{f}^{n}-\mathrm{e}^{n})\circ\eta_{n}=0$ .

This idempotence yields the direct sum decomposition

[TABLE]

where the first factor is named so by definition, and regarding the identification of the second factor we note that $\operatorname{im}(\operatorname{id}-\mathrm{e}^{n}\circ\eta_{n})=\operatorname{im}\mathrm{(}f^{n}-\mathrm{e}^{n})\circ\eta_{n}\subset{\textstyle\bigotimes}^{n}_{(0)}\mathfrak{g}\equiv\operatorname{im}\mathrm{f}^{n}-\mathrm{e}^{n}\subset\ker\mathrm{e}^{n}\circ\eta_{n}$ .

(III) The PBW splittings. Note that in case (i), ${\textstyle\bigotimes}_{\eta}\mathfrak{g}={\textstyle\bigotimes}_{\leq}\mathfrak{g}$ ; and in case (ii), ${\textstyle\bigotimes}_{\eta}\mathfrak{g}={\textstyle\bigotimes}_{\Sigma}\mathfrak{g}$ . Thus, the statement of the PBW theorem is that $\bigotimes_{\eta}\mathfrak{g}$ and $J\mathfrak{g}$ do not intersect each other (and, in fact, they are complementer spaces in $\bigotimes\mathfrak{g}$ ).

Also note that very little happens in (II). It only algebraizes familiar combinatorial content which is otherwise accepted without much ado. The point is that we can modify this content as follows:

We define the evaluation map $\mathrm{e}\mathrm{e}^{n}:\Sigma_{n}\otimes{\textstyle\bigotimes}^{n}\mathfrak{g}\rightarrow{\textstyle\bigotimes}^{n}\mathfrak{g}\oplus{\textstyle\bigotimes}^{n-1}\mathfrak{g}$ such that

[TABLE]

Lemma A.3.

For $v\in{\textstyle\bigotimes}^{n}\mathfrak{g}$ ,

[TABLE]

Proof.

Let us note that $\sigma\diamond$ , $\sigma\in\Sigma_{n}$ , acts trivially on $W_{k,n}\diamond v-W_{k,n}*v\in{\textstyle\bigotimes}^{n-1}\mathfrak{g}$ ; so $(\operatorname{id}-\sigma\diamond)(W_{k,n}\diamond v-W_{k,n}*v)=0$ . This implies $(\operatorname{id}-\sigma\diamond)(W_{k,n}*v)=(\operatorname{id}-\sigma\diamond)(W_{k,n}\diamond v)$ .

Case (i): Assume that $v=g_{\alpha_{1}}\otimes\ldots\otimes g_{\alpha_{n}}$ . If $\alpha_{k}=\alpha_{k+1}$ , then $W_{k,n}*v=v$ , $W_{k,n}\diamond v=v$ ,

[TABLE]

If $\alpha_{k}\neq\alpha_{k+1}$ , $\eta_{n}(v)=\sigma\otimes v$ , then $\eta_{n}(W_{k,n}*v)=(\sigma W_{k,n})\otimes(W_{k,n}*v)$ . Thus

[TABLE]

Case (ii):

[TABLE]

Remark A.4.

The elements $v-\sigma\diamond v$ with $v\in{\textstyle\bigotimes}^{n}\mathfrak{g}$ , $\sigma\in\Sigma_{n}$ still generate only $J^{n}\mathfrak{g}$ . Indeed, if the canonical decomposition is $\sigma=W_{a_{1},n}\ldots W_{a_{s},n}$ , then

[TABLE]

Using similar arguments, $W_{k,n}$ can be replaced by an arbitrary $\sigma\in\Sigma_{n}$ in equation (20). Actually, the discussion yields constructive maps $\mathrm{h}_{n}:\Sigma_{n}\otimes{\textstyle\bigotimes}^{n}\mathfrak{g}\rightarrow W\otimes{\textstyle\bigotimes}^{n-1}\mathfrak{g}$ such that

[TABLE]

If we consider only the ${\textstyle\bigotimes}^{n}\mathfrak{g}$ -part, then this yields $v-\sigma*v=(\operatorname{id}-\mathrm{e}^{n}\circ\eta_{n})(v-\sigma*v)$ , which simplifies to $\mathrm{e}^{n}\circ\eta_{n}(v)=\mathrm{e}^{n}\circ\eta_{n}(\sigma*v)$ , cf. equation (19).

(This remark is not needed to the proof.) ∎

Let $J_{\eta}^{n}\mathfrak{g}$ be the image of ${\textstyle\bigotimes}^{n}_{(0)}\mathfrak{g}$ under $\operatorname{id}-\mathrm{e}\mathrm{e}^{n}\circ\eta_{n}=(\mathrm{f}^{n}-\mathrm{e}\mathrm{e}^{n})\circ\eta_{n}$ . (Strictly speaking, $J_{\eta}^{n}\mathfrak{g}$ depends not only on $\eta$ but also on $\diamond$ .) Note that any element $w\in J_{\eta}^{n}\mathfrak{g}$ can be reconstructed from its projection to ${\textstyle\bigotimes}^{n}\mathfrak{g}$ which is in ${\textstyle\bigotimes}^{n}_{(0)}\mathfrak{g}$ . Indeed, the projection of $v-W_{k,n}\diamond v$ is $v-W_{k,n}*v$ , and the projection of $J^{n-1}\mathfrak{g}$ is [math]; and formula (20) implies $w=(\operatorname{id}-\mathrm{e}\mathrm{e}^{n}\circ\eta_{n})(\operatorname{pr}_{\,\bigotimes^{n}\mathfrak{g}}w)$ .

Remark A.5.

One can show that $J_{\eta}^{n}\mathfrak{g}\subset J^{n}\mathfrak{g}$ . Indeed, the LHS is the image of ${\textstyle\bigotimes}^{n}_{(0)}\mathfrak{g}$ under $(\mathrm{f}^{n}-\mathrm{e}\mathrm{e}^{n})\circ\eta_{n}$ , while the RHS is the image of $\mathrm{f}^{n}-\mathrm{e}\mathrm{e}^{n}$ (cf. the beginning of the previous Remark). Now, $J^{n}\mathfrak{g}$ projects to ${\textstyle\bigotimes}^{n}_{(0)}\mathfrak{g}$ , thus $(\operatorname{id}-\mathrm{e}\mathrm{e}^{n}\circ\eta_{n})\circ\operatorname{pr}_{\,\bigotimes^{n}\mathfrak{g}}$ yields an idempotent on $J^{n}\mathfrak{g}$ . It is straightforward to see from Lemma A.3 that the corresponding inner direct sum decomposition is

[TABLE]

(E. g. $[x,y]\otimes z+[y,z]\otimes x+[z,x]\otimes y-z\otimes[x,y]-x\otimes[y,z]-y\otimes[z,x]\in J^{3}\mathfrak{g}\cap J^{2}\mathfrak{g}$ .)

(This remark is not needed to the proof.) ∎

Let $J^{\leq n}\mathfrak{g}=J^{0}\mathfrak{g}+\ldots+J^{n}\mathfrak{g}$ .

Corollary A.6.

The following inner direct sum decompositions hold:

(a) $J^{\leq n}\mathfrak{g}=J^{\leq n-1}\mathfrak{g}\oplus J_{\eta}^{n}\mathfrak{g}$ ;

(b) $J^{\leq n}\mathfrak{g}=J_{\eta}^{0}\mathfrak{g}\oplus\ldots\oplus J^{n}_{\eta}\mathfrak{g}$ ;

(c) $J\mathfrak{g}=J_{\eta}^{0}\mathfrak{g}\oplus\ldots\oplus J^{n}_{\eta}\mathfrak{g}\oplus\ldots$ ;

(d) ${\textstyle\bigotimes}^{\leq n}\mathfrak{g}={\textstyle\bigotimes}^{\leq n-1}\mathfrak{g}\oplus{\textstyle\bigotimes}^{n}\mathfrak{g}={\textstyle\bigotimes}^{\leq n-1}\mathfrak{g}\oplus{\textstyle\bigotimes}^{n}_{\eta}\mathfrak{g}\oplus J^{n}_{\eta}\mathfrak{g}$ ;

(e) $\underbrace{{\textstyle\bigotimes}^{0}\mathfrak{g}\oplus\ldots\oplus{\textstyle\bigotimes}^{n}\mathfrak{g}}_{\textstyle{\bigotimes}^{\leq n}\mathfrak{g}}=\underbrace{{\textstyle\bigotimes}^{0}_{\eta}\mathfrak{g}\oplus\ldots\oplus{\textstyle\bigotimes}^{n}_{\eta}\mathfrak{g}}_{\textstyle{\bigotimes}^{\leq n}_{\eta}\mathfrak{g}}\oplus\underbrace{J_{\eta}^{0}\mathfrak{g}\oplus\ldots\oplus J^{n}_{\eta}\mathfrak{g}}_{\textstyle{J^{\leq n}\mathfrak{g}}}$ ;

*(f) $\textstyle{\bigotimes}\mathfrak{g}=\textstyle{\bigotimes}_{\eta}\mathfrak{g}\oplus J\mathfrak{g}$ .

Remark: Here ${\textstyle\bigotimes}^{0}\mathfrak{g}={\textstyle\bigotimes}^{0}_{\eta}\mathfrak{g}=K$ , $J^{0}\mathfrak{g}=J^{0}_{\eta}\mathfrak{g}=0$ , $J^{1}\mathfrak{g}=J^{1}_{\eta}\mathfrak{g}=0$ .*

Proof.

(a) Lemma A.3 implies $J^{n-1}\mathfrak{g}+J^{n}\mathfrak{g}=J^{n-1}\mathfrak{g}+J_{\eta}^{n}\mathfrak{g}$ . Adding $J^{\leq n-2}\mathfrak{g}$ to both sides, we obtain $J^{\leq n}\mathfrak{g}=J^{\leq n-1}\mathfrak{g}+J_{\eta}^{n}\mathfrak{g}$ . The two factors in the sum must be disjoint, as projected to ${\textstyle\bigotimes}^{n}\mathfrak{g}$ , the first factor projects to [math], while the second factor projects faithfully.

(b) follows from (a) inductively.

(c) follows from (b) by taking increasing unions.

(d) The first equality in obvious. The second one follows from the fact that on the RHS, as projected to ${\textstyle\bigotimes}^{n}\mathfrak{g}$ , the second factor projects to ${\textstyle\bigotimes}^{n}_{\eta}\mathfrak{g}$ faithfully, and $J^{n}_{\eta}\mathfrak{g}$ projects to ${\textstyle\bigotimes}_{(0)}^{n}\mathfrak{g}$ faithfully.

(e) follows from (d) inductively, the labeling uses (b).

(f) follows from (e) by taking increasing unions. ∎

In particular, we find the statement of the PBW theorem in (f).

Appendix B About free Lie algebras. Version 1

Free Lie $K$ -algebras (or any other kinds of free algebras) do not really require specific constructions. Nevertheless, it is very useful to have some structure theorems which provide some control over them, even if minimal. Let us think about the free Lie $K$ -algebra $\mathrm{F}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ as the free nonassociative $K$ -algebra $\mathrm{F}^{\operatorname{n-a}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ factorized further by the $K$ -submodule (ideal) $\mathrm{I}_{K}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ . Additively, $\mathrm{F}^{\operatorname{n-a}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ is just the free $K$ -module generated by the $\boldsymbol{[},\boldsymbol{]}$ -monomials of the $X_{\lambda}$ .

Proposition B.1.

$\mathrm{I}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ * is generated by the elements*

(F1)

$kM(\boldsymbol{[}Z_{1},Z_{1}\boldsymbol{]},X_{\lambda_{1}},\ldots,X_{\lambda_{s}})$ * ,* 2. (F2)

$kM(\boldsymbol{[}Z_{1},Z_{2}\boldsymbol{]},X_{\lambda_{1}},\ldots,X_{\lambda_{s}})+kM(\boldsymbol{[}Z_{2},Z_{1}\boldsymbol{]},X_{\lambda_{1}},\ldots,X_{\lambda_{s}})$ * ,* 3. (F3)

$kM(\boldsymbol{[}\boldsymbol{[}Z_{1},Z_{2}\boldsymbol{]},Z_{3}\boldsymbol{]},X_{\lambda_{1}},\ldots,X_{\lambda_{s}})$ * $+kM(\boldsymbol{[}\boldsymbol{[}Z_{2},Z_{3}\boldsymbol{]},Z_{1}\boldsymbol{]},X_{\lambda_{1}},\ldots,X_{\lambda_{s}})$ *

$+kM(\boldsymbol{[}\boldsymbol{[}Z_{3},Z_{1}\boldsymbol{]},Z_{2}\boldsymbol{]},X_{\lambda_{1}},\ldots,X_{\lambda_{s}})$ ;

where $Z_{1},Z_{2},Z_{3}$ are monomials of the $X_{\lambda}$ , and $M(\ldots)$ is a $\boldsymbol{[},\boldsymbol{]}$ -bracketing with $s+1$ many positions (but not necessarily in the indicated order), and $k\in K$ .

Proof.

Such elements are clearly in the ideal $\mathrm{I}^{\operatorname{Lie}}_{K}$ . Conversely, whenever we take elements from $\mathrm{F}^{\operatorname{n-a}}_{K}$ and apply the Lie-identities, then they expand to sums of cases (F1)–(F3) with trivial $M$ . (Notice that case (F2) cannot be omitted.) Thus the primary relations (coming form the Lie-identities) are generated. The secondary relations (coming from $x\sim y\rightarrow\boldsymbol{[}x,z\boldsymbol{]}\sim\boldsymbol{[}y,z\boldsymbol{]},\boldsymbol{[}z,x\boldsymbol{]}\sim\boldsymbol{[}z,y\boldsymbol{]}$ are also generated due to linearity and that nontrivial $M$ are allowed. ∎

Corollary B.2.

$\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ * is multigraded by the number of various variables.*

In any multigrade, corresponding to finite multiset of the $X_{\lambda}$ , $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ is generated by finitely many monomials.

The structure in a given multigrade depends only on its multiplicity structure (independently of the presence of other variables, etc.).

Proof.

The multigradedness will be inherited from $\mathrm{F}^{\operatorname{n-a}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ , because the relations from Proposition B.1 are multigrade-homogeneous. Furthermore, every finite multiset of the $X_{\lambda}$ can be bracketed only in finitely many ways, so finitely generatedness is true even in $\mathrm{F}^{\operatorname{n-a}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ . The structure of $\mathrm{I}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ also depends only on the multiplicity pattern. ∎

The following result, the Dynkin–Specht–Wever lemma (cf. Dynkin [13], Specht [34], Wever [38]) is a simple consequence of the gradedness of the free Lie $K$ -algebra. We present the weighted version. Suppose that we assign the weight $w_{\lambda}\in K$ to every variable $X_{\lambda}$ . Let $w:\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]\rightarrow\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ be the map which multiplies by ${m_{1}}w_{\lambda_{1}}+\ldots+{m_{s}}w_{\lambda_{s}}$ in multigrade $X_{\lambda_{1}}^{m_{1}}\ldots X_{\lambda_{s}}^{m_{s}}$ .

Proposition B.3.

(Weighted Dynkin–Specht–Wever lemma.) Suppose that $P(X_{1},\ldots,X_{n})$ is a Lie-polynomial, i. e. an element of $\mathrm{F}^{\operatorname{Lie}}[X_{\lambda}:\lambda\in\Lambda]$ . Assume that $P(X_{1},\ldots,X_{n})$ expands in the commutator-evaluation to the noncommutative polynomial

[TABLE]

Then

[TABLE]

Proof.

Consider $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{1},\ldots,X_{n}]\oplus K\mathbf{u}$ , and extend the Lie bracket such that $\boldsymbol{[}\mathbf{u},\mathbf{u}\boldsymbol{]}=0$ , and $\boldsymbol{[}Q,\mathbf{u}\boldsymbol{]}=w(Q)$ , $\boldsymbol{[}\mathbf{u},Q\boldsymbol{]}=-w(Q)$ . This yields a Lie $K$ -algebra. (It is sufficient to check $[x,x]=0$ , $[x,y]+[y,x]=0$ , $[[x,y],z]+[[y,z],x]+[[z,x],y]=0$ when $x,y,z$ are Lie-monomials or $\mathbf{u}$ ). Then

[TABLE]

The “unweighted” Dynkin–Specht–Wever lemma is when every weight $w_{i}$ is equal to $1$ . Similar statements hold with respect to right-iterated higher commutators.

The gradedness also allows to apply the PBW theorem (for sum of cyclic submodules) to obtain the representability theorem Magnus [20] (cf. also Witt [39]) for free Lie $K$ -algebras.

Proposition B.4.

(Theorem of Magnus about the representability of free Lie algebras.)

(a) $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{1},\ldots,X_{n}]$ is a free $K$ -module (in every multigrade). In fact, $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{1},\ldots,X_{n}]\simeq\mathrm{F}^{\operatorname{Lie}}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]\otimes K$ naturally (in every multigrade).

(b) $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{1},\ldots,X_{n}]$ embeds to the noncommutative polynomial algebra $\mathrm{F}_{K}[X_{1},\ldots,X_{n}]$ by the commutator-evaluation.

Proof.

Assume $K=\mathbb{Z}$ . Then $\mathrm{F}^{\operatorname{Lie}}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]$ is a finitely generated $\mathbb{Z}$ -module in every multigrade, thus it is a sum of cyclic $\mathbb{Z}$ -modules. Then the PBW theorem (for sums of cyclic submodules) can be applied to show that $\mathrm{F}^{\operatorname{Lie}}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]$ embeds into $\mathcal{U}\mathrm{F}^{\operatorname{Lie}}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]\simeq\mathrm{F}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]$ . It is immediate that (b) the image is the commutator subalgebra; and (a) the image of $\mathrm{F}^{\operatorname{Lie}}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]$ has no torsion, so $\mathrm{F}^{\operatorname{Lie}}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]$ is a free $\mathbb{Z}$ -module in every multigrade.

In fact, we observe that additively $\mathrm{F}^{\operatorname{n-a}}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]\simeq(\text{a free$ \mathbb{Z} $-module})\oplus\mathrm{I}^{\operatorname{Lie}}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]$ (in every multigrade). This decomposition structure survives by tensoring with $K$ , so general case (a) follows. Then general case (b) follows using the PBW theorem. ∎

Remark B.5.

The approach of the proof of the Proposition B.4 is sort of the minimal if one wants to amend the basic PBW theorem (case (i)) to free Lie $K$ -algebras; although it is not very informative regarding the possible bases of free Lie $K$ -algebras. However, a generalization of the techniques used in the proof of the Dynkin–Specht–Wever lemma can be applied as an alternative:

Elimination by derivations. (We only sketch this approach.) It is easy to see that derivations $D$ of $\mathrm{F}^{\operatorname{n-a}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ are determined by arbitrary prescriptions $D(X_{\lambda})=P^{\operatorname{n-a}}_{\lambda}$ . Then Proposition B.1 implies easily that these derivations descend to $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ . In particular, its derivations are also given by arbitrary prescriptions $D(X_{\lambda})=P_{\lambda}$ . For any Lie $K$ -algebra $\mathfrak{g}$ , we can defined the extension $\mathfrak{g}\rtimes\operatorname{Der}_{K}\mathfrak{g}$ such that $\boldsymbol{[}D,x\boldsymbol{]}=D(x)$ . We can apply this in the setting when $\Lambda$ is a the set of words $\operatorname{Words}(A)$ on an alphabet, $\mathfrak{g}=\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ , and the derivations $\partial_{a}$ $(a\in A)$ are given by $\partial_{a}(X_{\lambda})=X_{a\lambda}$ .

Assume that $P$ is a Lie-polynomial of multigrade $X_{1}^{i_{1}}\ldots X_{n}^{i_{n}}$ , $i_{1},\ldots,i_{n}\geq 1$ ; $A=\{1,\ldots,n\}$ . Then, using the universal properties of Lie-polynomials, we can substitute $\partial_{i}$ to $X_{i}$ for $1\leq i<n$ . As $P$ is a Lie-polynomial of some $\boldsymbol{[}X_{j_{1}},\ldots,X_{j_{s}},X_{n}\boldsymbol{]}$ ( $j_{k}<n$ ), we see that the result is a Lie-polynomials of some $X_{j_{1},\ldots,j_{s},n}$ . Back substitution of $\boldsymbol{[}X_{j_{1}},\ldots,X_{j_{s}},X_{n}\boldsymbol{]}$ into $X_{j_{1},\ldots,j_{s},n}$ also works, so we obtain that Lie-polynomials of multigrade $X_{1}^{i_{1}}\ldots X^{n}_{i_{n}}$ , $i_{1},\ldots,i_{n}\geq 1$ , are in bijective correspondence to Lie-polynomials of $X_{j_{1},\ldots,j_{s},n}$ satisfying some simple multigrade conditions. This allow to clarify the structure of free Lie $K$ -algebras inductively. (In particular, Proposition C.4 can be proven.) We do not pursue this approach, because if it comes to elimination, then it is just simpler to use noncommutative polynomials.

Regarding the pattern of eliminations, we remark that we could have left $X_{1},\ldots,X_{n-1}$ intact, but substituted $\partial_{n}$ into $X_{n}$ . As $P$ can be expressed as a Lie-polynomial of some $\boldsymbol{[}X_{n},\ldots,X_{n},X_{j}\boldsymbol{]}$ ( $j<n$ ), the result is a Lie-polynomial of some $X_{n,\ldots,n,j}$ , etc. In fact, this is the traditional Lazard–Shirshov elimination process, cf. Širšov [30], Reutenauer [26]. (Or, we could have eliminated other subsets of variables.) It is merely the preference of the author to eliminate not one but all but one variables. ∎

Appendix C About free Lie algebras. Version 2

Elimination by polynomials. Consider the noncommutative polynomial algebra $\mathrm{F}_{K}[X,E_{1}\ldots E_{n}]$ . Let $\theta$ be the operation which sends the monomial

[TABLE]

into the polynomial

[TABLE]

Lemma C.1.

The map $\theta$ leaves the multigrading of $\mathrm{F}_{K}[X,E_{1}\ldots E_{n}]$ invariant. It acts as an isomorphism in every multigrade.

Proof.

It is obvious that the multigrading is left invariant. If $A$ is an alphabet with ordering $\preccurlyeq$ , then let $\preccurlyeq^{\operatorname{mlex}}$ be the ordering on the words of $A$ such that longer words are greater, and equally long words are ordered lexicographically. Now let $\leq$ be an arbitrary ordering on the alphabet $\{E_{1},\ldots,E_{n}\}$ . To any monomial (21) we assign the word of words

[TABLE]

Let us order the monomials (21) in the order (22) with respect to $(\leq^{\operatorname{mlex}})^{\operatorname{mlex}}$ . Then it is easy to see that in that basis the action of $\theta$ is triangular with $1$ ’s in the diagonal, thus it is an isomorphism. ∎

Corollary C.2.

Suppose that $P(X_{1},\ldots,X_{r})$ is a noncommutative polynomial over $K$ , and assume that the noncommutative polynomial

[TABLE]

yet the monomials

[TABLE]

are different from each other. Then

[TABLE]

In fact, we can also prove the following stronger statement. A divided noncommutative polynomial $P(X_{1},\ldots,X_{r}|Y_{1},\ldots,Y_{n})$ is a noncommutative polynomial where the monomials are of shape $X_{i_{1}}\ldots X_{i_{p}}Y_{j_{1}}\ldots Y_{j_{q}}$ .

Corollary C.2’.

Suppose that $P(X_{1},\ldots,X_{r}\mid Y_{1},\ldots Y_{n})$ is a noncommutative polynomial over $K$ , and assume that the noncommutative polynomial

[TABLE]

yet the monomials

[TABLE]

are different from each other. Then

[TABLE]

Proof.

Let us apply the isomorphism $\theta^{-1}$ . This gives

[TABLE]

But then the difference in the $E$ -monomials implies $P(X_{1},\ldots,X_{r}\mid Y_{1},\ldots Y_{n})=0$ . ∎

Proposition C.3.

$\mathrm{F}^{\operatorname{Lie}}_{K}[X_{\lambda}:\lambda\in\Lambda]$ * embeds to the noncommutative polynomial algebra $\mathrm{F}_{K}[X_{1},\ldots,X_{n}]$ by the commutator-evaluation.*

Proof.

We have to prove that if $P\in\mathrm{F}^{\operatorname{Lie}}_{K}[X_{1},\ldots,X_{n}]$ evaluates to [math] in the commutator expansion in $\mathrm{F}_{K}[X_{1},\ldots,X_{n}]$ , then $P$ simplifies to [math] in $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{1},\ldots,X_{n}]$ . We can assume that $P$ is expanded to Lie-monomials, thus it is represented by a non-associative polynomial $P^{\operatorname{n-a}}$ . In that viewpoint, we have to prove that if $P^{\operatorname{n-a}}\in\mathrm{F}^{\operatorname{n-a}}_{K}[X_{1},\ldots,X_{n}]$ evaluates to [math] in the commutator expansion in $\mathrm{F}_{K}[X_{1},\ldots,X_{n}]$ , then $P^{\operatorname{n-a}}$ can be simplified to [math] using Lie rules. We prove the statement by induction on the maximal length $\deg(P^{\operatorname{n-a}})$ of the $\boldsymbol{[},\boldsymbol{]}$ -monomials in $P^{\operatorname{n-a}}$ . If $\deg P^{\operatorname{n-a}}=0$ , then the statement is obvious. Let us gather the terms of $P^{\operatorname{n-a}}$ into groups $P_{\omega}^{\operatorname{n-a}}$ corresponding to multigrades. The various $P_{\omega}^{\operatorname{n-a}}$ expand to different multigrades in $\mathrm{F}_{K}[X_{1},\ldots,X_{n}]$ , thus the various $P_{\omega}^{\operatorname{n-a}}$ must also expand to [math] in $\mathrm{F}_{K}[X_{1},\ldots,X_{n}]$ independently. Thus it is sufficient to consider the cases $P_{\omega}^{\operatorname{n-a}}$ separately. We can assume that $P^{\operatorname{n-a}}_{\omega}$ has monomials with variables $X_{1},\ldots,X_{n}$ with multiplicities $i_{1},\ldots,i_{n}\geq 1$ respectively. If $n=1$ , then the statement is very easy: in the $i_{1}=1$ case the commutator expansion is identical, in the $i_{1}>1$ case $P_{\omega}^{\operatorname{n-a}}$ obviously reduces to [math] using Lie rules. So, assume $n\geq 2$ . Then by standard Lie rules we can expand $P_{\omega}^{\operatorname{n-a}}$ to a Lie-polynomial of some $\boldsymbol{[}X_{j_{1}},\ldots,X_{j_{p}},X_{n}\boldsymbol{]}_{\mathrm{L}}$ ( $j_{k}<n$ ) but so that formally the multiplicities of the variables remain. Thus

[TABLE]

where the sequences $(X_{j_{1,1}},\ldots,X_{j_{1,p_{1}}}),\ldots,(X_{j_{s,1}},\ldots,X_{j_{r,p_{r}}})$ are different from each other, while the multiplicities of the variables $X_{i}$ on the two sides are the same. Nevertheless, the RHS of (23) must also expand to [math] in the commutator evaluation. But then according to Corollary C.2, $Q^{\operatorname{n-a}}_{\omega}(Y_{1},\ldots,Y_{r})$ also expands to [math] in the commutator expansion. Now $\deg Q^{\operatorname{n-a}}_{\omega}=i_{n}<\deg P^{\operatorname{n-a}}$ due to the multiplicity structure, thus by induction we know that $Q^{\operatorname{n-a}}_{\omega}(Y_{1},\ldots,Y_{r})$ expands to [math] using Lie rules. But this implies that the RHS of (23) simplifies to [math] using Lie rules. So, consequently, also the LHS of (23). ∎

Then $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{1},\ldots,X_{n}]$ is multigraded induced from the multigrading in $\mathrm{F}_{K}[X_{1},\ldots,X_{n}]$ through commutator evaluation.

Proposition C.4.

(The uniformity of free Lie $K$ -algebras.)

(i) $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{1},\ldots,X_{n}]$ is a free $K$ -module (in every multigrade). In fact, we can choose a set of $\boldsymbol{[},\boldsymbol{]}$ -monomials which acts as a basis (in every multigrade), independently from $K$ .

(ii) $\mathrm{F}^{\operatorname{Lie}}_{K}[X_{1},\ldots,X_{n}]\simeq\mathrm{F}^{\operatorname{Lie}}_{\mathbb{Z}}[X_{1},\ldots,X_{n}]\otimes K$ naturally.

Proof.

(i) We can assume that in a multigrade we have variables $X_{1},\ldots,X_{n}$ with multiplicities $i_{1},\ldots,i_{n}\geq 1$ respectively. We proceed by induction on the degree $i_{1}+\ldots+i_{n}=i$ . (Due to the previous statement we will use the terms Lie-polynomial and commutator polynomial synonymously.) If $i=0$ , then the statement is obvious. Assume that $i\geq 1$ . If $n=1$ , then the statement is trivial. So assume $n\geq 2$ . Using standard Lie rules, any Lie-polynomial $P$ of multigrade $X_{1}^{i_{1}}\ldots X_{n}^{i_{n}}$ can be written in form

[TABLE]

such that $Q(Y_{1},\ldots,Y_{r})$ is a Lie-polynomial, and the sequences $(X_{j_{k,1}},\ldots,X_{j_{k,p_{k}}})$ run through every word of length at most $i$ made from $\{X_{1},\ldots,X_{n-1}\}$ .

Regarding the multigrade structure of $Q$ , not every multigrade $Y_{1}^{j_{1}}\ldots Y_{r}^{j_{r}}$ is allowed to appear nontrivially. But if an multigrade $Y_{1}^{j_{1}}\ldots Y_{r}^{j_{r}}$ is allowed, then every commutator polynomial of multigrade $Y_{1}^{j_{1}}\ldots Y_{r}^{j_{r}}$ is allowed to be used in $Q$ . Thus we obtain that $P$ is of shape

[TABLE]

such that $Q_{Y_{1}^{j_{1}}\ldots Y_{r}^{j_{r}}}$ is of multigrade $Y_{1}^{j_{1}}\ldots Y_{r}^{j_{r}}$ . On the other hand, this description is unique in terms of the commutator polynomials $Q_{Y_{1}^{j_{1}}\ldots Y_{r}^{j_{r}}}$ due to Corollary C.2. Hence, the situation decomposes in allowed multigrades. Thus, in particular, if

[TABLE]

form systems of base monomials in the allowed multigrades $Y_{1}^{j_{1}}\ldots Y_{r}^{j_{r}}$ , then the elements

[TABLE]

form a system of base monomials for multigrade $X_{1}^{i_{1}}\ldots X_{n}^{i_{n}}$ . However, for allowed multigrades, $\deg Y_{1}^{j_{1}}\ldots Y_{r}^{j_{r}}=i_{n}<\deg X_{1}^{i_{1}}\ldots X_{n}^{i_{n}}$ , so by induction we have monomial bases in allowed multigrades. It is also clear that this process can be made independent from the actual coefficients $K$ . (ii) This is transparent form the fact that formally the same monomial base can be chosen, independently from $K$ . ∎

Thinking algorithmically, the method described by (24)–(25) allows to construct bases rather easily. In fact, there are several choices due to the arbitrariness of the labeling the variables $Y_{k}$ . Another thing is that we descended using simple $[]_{\mathrm{L}}$ -commutators but even those can be twisted by some multidegree-compatible maps on noncommutative polynomials. Due to this wealth of possibilities, free Lie algebra bases are interesting only as long as they have some additional combinatorial properties. E. g., accountability with respect to the PBW theorem.

Appendix D Related to $\mu$ II

Considering $\mathcal{U}\mathrm{F}_{\mathbb{Q}}^{\operatorname{Lie}}[X_{1}\ldots,X_{n}]\simeq\mathrm{F}_{\mathbb{Q}}[X_{1}\ldots,X_{n}]$ , as $\boldsymbol{m}_{\Sigma}$ inverts $\boldsymbol{\mu}_{\Sigma}$ , we see that in the noncommutative polynomial algebra

[TABLE]

Taking this for any subsequences of $X_{1},\ldots,X_{n}$ , and summing up, we find that

[TABLE]

i. e. modulo monomials where some variable has multiplicity more than $1$ . Taking logarithm, we find

[TABLE]

i. e. modulo monomials where some variable has multiplicity more than $1$ . This implies that

[TABLE]

Formally,

[TABLE]

Remark D.1.

According to a simple combinatorial argument of Strichartz [35], which we do not reproduce here (cf. also [18]), (27) quickly implies that in (5)

[TABLE]

where $\operatorname{asc}(\sigma)$ denotes the number of ascents, i. e. the number of pairs such that $\sigma(i)<\sigma(i+1)$ ; and $\operatorname{des}(\sigma)$ denotes the number of descents, i. e. the number of pairs such that $\sigma(i)>\sigma(i+1)$ . (This is originally a result of Solomon [33] and Mielnik, Plebański [23].) In conjunction to (6), (7), (10), (11), this results several explicit formulas for $\mu_{n}$ . Taking (26) into account, this also allows to obtain the coefficients $b_{Ii}$ in (4). ∎

Substituting $X$ to the first $r$ many variables, and $Y$ to the last $n-r$ many variables, we find that

[TABLE]

Inspecting the power series in $(t_{1}+\ldots+t_{r})$ and $(\tau_{1}+\ldots+\tau_{n-r})$ , we can quickly identify the coefficients of $t_{1}\ldots t_{r}$ and $\tau_{1}\ldots\tau_{s}$ , respectively. This yields

[TABLE]

As a consequence, regarding to the (formal) Taylor series of $\log(\exp(tX)\exp(\tau Y))$ around $(t,\tau)=(0,0)$ , evaluated at $(t,\tau)=(1,1)$ , one finds

[TABLE]

In particular, we find that the Baker–Campbell–Hausdorff terms are commutator polynomials:

[TABLE]

(This is the viewpoint of Magnus [21], Chen [7], Cartier [5] on the BCH formula.) One obtains the full expansion of $\log(\exp(X_{1})\ldots\exp(X_{n}))$ analogously.

Once we know that the components of $\log(\exp(X)\exp(Y))$ are commutator polynomials (which can also be shown in other ways), we can apply the Dynkin–Specht–Wever lemma to (the homogeneous parts) of the power series expansion

[TABLE]

In this standard manner, commas in $[]_{\mathrm{L}}$ omitted, we obtain

[TABLE]

the formula of Dynkin [13].

Some works, e. g. Kolář, Michor, Slovák [17], or Duistermaat, Kolk [12] present

[TABLE]

as the BCH formula/ “Dynkin’s formula”, which they prove by differential equational/ geometric means, but formally just by using the old Schur(–Poincaré) argument

[TABLE]

(Cf. Schur [28], Poincaré [25], Duistermaat [11], Bonfiglioli, Fulci [3] Ch. 1, and references therein. This is also the line of reasoning which leads to the natural derivations of the (R) and (L) recursions in Proposition/Definition 2.1, cf. Magnus [21] and [18].)

Now, (32) can also be realized algebraically from (30) but by applying the weighted Dynkin–Specht–Wever lemma with weight prescription $\deg X=1$ , $\deg Y=0$ : The part when the total weight is [math] can be seen to be $Y$ easily. Then relabel $k$ to $j+1$ , and notice that only the $q_{j+1}=0$ , $p_{j+1}=1$ part survives weighting and commutatoring, respectively.

One can also apply the weight prescription $\deg X=0,\deg Y=1$ . Another possibility is to apply $\log((\exp X)(\log Y))=-\log((\exp-Y)(\exp-X))$ , which also corresponds to the rewriting of the $[]_{\mathrm{R}}$ -version to $[]_{\mathrm{L}}$ -terminology. Altogether, this yields six formulas of Dynkin type. These formulas (in power series form) are all highly redundant, though. This is due to the particular inefficiency of expansion (30) and the general nature of commutators. Nevertheless, one can do (naive) convergence estimates as usual.

Bibliography39

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Arnal, Ana; Casas, Fernando; Chiralt, Cristina: A general formula for the Magnus expansion in terms of iterated integrals of right-nested commutators J. Phys. Commun. 2 (2018) 035024
2[2] Birkhoff, Garrett: Representability of Lie algebras and Lie groups by matrices. Ann. of Math. 38 (1937), 526–532.
3[3] Bonfiglioli, Andrea; Fulci, Roberta: Topics in noncommutative algebra. The theorem of Campbell, Baker, Hausdorff and Dynkin. Lecture Notes in Mathematics, 2034. Springer-Verlag; Berlin, Heidelberg, 2012.
4[4] Bourbaki, Nicolas: Groupes et algebres de Lie. Chapitre 1: Algebres de Lie. Hermann, Paris, 1960.
5[5] Cartier, P.: Démonstration algébrique de la formule de Hausdorff. Bull. Soc. Math. France 84 (1956), 241–249.
6[6] Cartier, P.: Remarques sur le théoreme de Birkhoff-Witt. Ann. Scuola Norm. Sup. Pisa (sér. 3) 12 (1958) 1–4.
7[7] Chen, Kuo-Tsai: Integration of paths, geometric invariants and a generalized Baker-Hausdorff formula. Ann. of Math. 65 (1957), 163–178.
8[8] Chen, K.-T.; Fox, R. H.; Lyndon, R. C. Free differential calculus, IV. The quotient groups of the lower central series. Ann. of Math. 68 (1958), 81–95.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Some proofs of the Poincaré–Birkhoff–Witt theorem and related matters

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

2. The existence of μ\muμ I

Proposition/Definition 2.1**.**

Proof.

Remark 2.2**.**

Proposition 2.3**.**

Proof.

3. From μ\muμ to the symmetric PBW theorem

Definition 3.1**.**

Proposition/Definition 3.2**.**

Proof.

Lemma 3.3**.**

Proof.

Proposition 3.4**.**

Proof.

4. The existence of μ\muμ II

Lemma 4.1**.**

Proof.

Proposition 4.2**.**

Proof.

Definition 4.3**.**

Proposition 2.3.

Proof.

5. Related to μ\muμ I

Proposition 5.1**.**

Proof.

Proposition 5.2**.**

Proof.

Proposition 5.3**.**

Proof.

6. Ug\mathcal{U}\mathfrak{g}Ug as a direct construction

Proposition 6.1**.**

Proof.

Proposition 6.2**.**

Proof.

7. FKLie⁡\mathrm{F}_{K}^{\operatorname{Lie}}FKLie​ via FQLie⁡\mathrm{F}_{\mathbb{Q}}^{\operatorname{Lie}}FQLie​

Proposition 7.1**.**

Sketch of proof.

Proposition 7.2**.**

Proof.

8. The FKLie⁡\mathrm{F}_{K}^{\operatorname{Lie}}FKLie​ case directly

9. From FKLie⁡\mathrm{F}_{K}^{\operatorname{Lie}}FKLie​ to the basic PBW theorem

Proposition 9.1**.**

Proof.

10. Conclusions

Appendix A The Witt–Lazard proof of the global PBW theorems

Lemma A.1**.**

Proof.

Corollary A.2**.**

Proof.

Lemma A.3**.**

Proof.

Remark A.4**.**

Remark A.5**.**

Corollary A.6**.**

Proof.

Appendix B About free Lie algebras. Version 1

Proposition B.1**.**

Proof.

Corollary B.2**.**

Proof.

Proposition B.3**.**

Proof.

Proposition B.4**.**

Proof.

Remark B.5**.**

Appendix C About free Lie algebras. Version 2

Lemma C.1**.**

Proof.

2. The existence of $\mu$ I

Proposition/Definition 2.1.

Remark 2.2.

Proposition 2.3.

3. From $\mu$ to the symmetric PBW theorem

Definition 3.1.

Proposition/Definition 3.2.

Lemma 3.3.

Proposition 3.4.

4. The existence of $\mu$ II

Lemma 4.1.

Proposition 4.2.

Definition 4.3.

5. Related to $\mu$ I

Proposition 5.1.

Proposition 5.2.

Proposition 5.3.

6. $\mathcal{U}\mathfrak{g}$ as a direct construction

Proposition 6.1.

Proposition 6.2.

7. $\mathrm{F}_{K}^{\operatorname{Lie}}$ via $\mathrm{F}_{\mathbb{Q}}^{\operatorname{Lie}}$

Proposition 7.1.

Proposition 7.2.

8. The $\mathrm{F}_{K}^{\operatorname{Lie}}$ case directly

9. From $\mathrm{F}_{K}^{\operatorname{Lie}}$ to the basic PBW theorem

Proposition 9.1.

Lemma A.1.

Corollary A.2.

Lemma A.3.

Remark A.4.

Remark A.5.

Corollary A.6.

Proposition B.1.

Corollary B.2.

Proposition B.3.

Proposition B.4.

Remark B.5.

Lemma C.1.

Corollary C.2.

Corollary C.2’.

Proposition C.3.

Proposition C.4.

Appendix D Related to $\mu$ II

Remark D.1.