This paper defines quadratic forms on bimodules and proves a classification theorem for subgroups of the general linear group normalized by the elementary unitary group, under certain conditions.
Contribution
It introduces a new definition of quadratic forms on bimodules and establishes a sandwich classification theorem for specific subgroup structures.
Findings
01
Classification theorem for subgroups normalized by elementary unitary groups
02
Extension of quadratic form theory to bimodules
03
Conditions for hyperbolic parts in bimodules
Abstract
We will give a definition of quadratic forms on bimodules and prove the sandwich classification theorem for subgroups of the general linear group GL(P) normalized by the elementary unitary group EU(P) if P is a nondegenerate bimodule with large enough hyperbolic part.
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
We will give a definition of quadratic forms on bimodules and prove the sandwich classification theorem for subgroups of the general linear group GL(P) normalized by the elementary unitary group EU(P) if P is a nondegenerate bimodule with large enough hyperbolic part.
1 Introduction
The classical groups over fields, i. e. linear, orthogonal and symplectic groups (and their twisted forms, including all infinite families of simple Lie groups) were studied for quite a long time. In Emil Artin’s book [1] they are constructed from the point of view of projective geometry. Many properties of these groups over arbitrary division ring, such as normal subgroups, automorphisms and presentations, are written in Dieudonné’s papers [11, 13, 12].
Later the definitions of classical groups were generalized to the case of arbitrary underlying commutative rings. The most complicated case of orthogonal groups can be found in famous book [20]. Many authors proposed a general definition of unitary groups that should include all classical groups considered before and that should work for arbitrary non-commutative rings (with involutions). The most important papers of this type are [30, 41, 42].
Nevertheless, all definitions of generalized unitary groups were unsatisfactory (first of all, they were different for rings with invertible 2 and for rings of characteristic 2). In his dissertation [2] Antony Bak gave the definition of form parameters for a non-commutative ring with an involution and used this to define the unitary group as the family of all module automorphisms that preserve a hermitian form and a quadratic form (defined through a form parameter) simultaneously. The definition allowed to transfer all results of the classical theory over division rings to the general case of even groups (this means the families Al, Cl, and Dl for reductive group schemes) for those problems that have any sense in such generality. All these results were published in H. Bass’s paper [10].
Later Victor Petrov proposed a generalization of form parameters and corresponding groups to the odd case in paper [24]. Odd form parameters introduced there allows to give the right definition of unitary groups in full generality and to transfer many results about even groups to them. Unfortunately, odd form parameters in his definition depend on a module itself. This does not allow to construct the category of all such modules and, for example, to define an orthogonal sum operation on the category.
First several sections of the current paper, i. e. from 2 to 6, further generalize these ideas. For a give hermitian form we will define corresponding quadratic forms with values in special algebraic objects called quadratic structures (which generalize factors by a form parameter in Bak’s theory or factor-groups of the Heisenberg groups by an odd form parameter in Petrov’s theory). Moreover, our quadratic forms will be constructed not for right modules, but for bimodules, as in books [3, 20] in the case of hermitian forms. All triples consisting of a bimodule, a hemitian form and a corresponding quadratic form constitute a bicategory (with respect to tensor products) satisfying natural properties.
One of the most important problems about unitary groups is the classification of their normal subgroups (it is so-called sandwich classification because of the type of results). In the case of groups over division rings and non-degenerate forms these results are classical, including the simplicity theorems of the corresponding projective groups. In the case of general rings the problem is closely related with lower unitary K-theory (that is, generalizing of algebraic K-functors from linear to unitary groups)
A lot of papers deal with the sandwich classification of normal subgroups of classical groups (i. e. symplectic and orthogonal groups) and their generalizations. We pick out only [43, 14, 21, 22, 33, 34, 35]. Later these results were generalized for unitary Bak’s groups in [2, 26] and even for odd Petrov’s groups in [6]. Also Raimund Preusser gave new short proofs of the sandwich classification of normal subgroups in the case of commutative rings in his recent papers [25, 27].
We should also mention the construction and the proof of normality of the elementary unitary group, since this is a basis for sandwich classification theorems. For the linear groups over commutative rings this is a famous Suslin’s normality theorem [28] and in the case of unitary groups there are separate papers [7, 8] concerning this problem.
Lower unitary K-theory is a theme of many papers. Let us mention only [10, 9, 16, 29], overview [17], and book [15]. Similar results for classical groups were published in [31, 32].
Another problem of big interest is the sandwich classification of intermediate subgroups between various classical groups, for example, between Sp(2l,K) and GL(2l,K). If K is a field, then the problem leads to the description of many maximal subgroups of classical groups. The sandwich classification theorems for overgroups of classical groups in the corresponding general linear group were proved for an arbitrary commutative ring K in papers [37, 38, 39, 18, 19]. Nowadays similar problems for various pairs of group schemes are a popular theme of research in St. Petersburg mathematical school, this is written in overviews [36] and [40]. In the case of Bak’s unitary groups the sandwich classification for overgroups was obtained in Petrov’s paper [23].
The mail goal of this paper is to generalize all these results for odd unitary groups over a bimodule P with hermitian and quadratic forms. More precisely, we a going to classify all subgroups of the general linear groups GL(P) normalized by the elementary unitary group EU(P). The exact statement is given below, all necessary definitions will given in the main text.
Theorem**.**
Suppose that K is a commutative ring with an involution, S and R are quadratic K-algebras, and P is a quadratic S-R-bimodule with non-degenerate hermitian form. Suppose also that End(P) is a quasi-finite K-algebra, P=P0⊥H(P1)⊥…H(Pl), l⩾4, End(P0) is generated by less than 4l2 elements as a K-module, and the bimodule Pi is isomorphic to a direct summand of PjN for N big enough and for any i,j∈{−l,…,−1,1,…,l} (where P−i=Pi∨d for i>0). Then there exists explicitly described levels L and groups EU(P,L),GU(P,⌊L⌋)⩽GL(P) such that every subgroup G⩽GL(P) normalized by EU(P) satisfies the inequality EU(P,L)⩽G⩽GU(P,⌊L⌋) for unique level L. Conversely, every group satisfying such an inequality is normalized by EU(P).
In sections 7–9 we will give the definitions of elementary unitary groups, levels and level subgroups. We will also prove their simplest properties. These lemmas are interested not only because of the main theorem, but also since they may help to prove other results about unitary groups, such as lower K-theory.
The proof of the sandwich classification theorem occupies sections 10–12. The main techniques are the absolute Noetherian reduction and the extraction of transvections taken and modified from paper [6]. The sufficient information about inductive limits used (i. e. about quasi-finite algebras over commutative rings) can be found in [4]. Further development of these ideas is in [5], including the localization-completion method applied to unitary K-theory.
In the final section we will derive several known classification theorems. These theorems include the sandwich classification of overgoups of symplectic and split orthogonal groups in the general linear group, and also the classification of normal subgroups of split orthogonal and symplectic groups.
The author wants to express his gratitude to his advisor Nikolai Vavilov and also to Antony Bak for ideas that lead to the definition of quadratic forms on bimodules.
The research is supported by «Native towns», a social investment program of PJSC «Gazprom Neft».
2 General notions
We will assume that all ring to be considered are associative with unity. For an arbitrary ring R let R∗ be the group of invertible elements in R, R∙ — the multiplicative monoid of R (i. e. the set R with multiplication and unity), C(R) — the center of R. The center of a group G will also be denoted as C(G). If H⩽G a subgroup, then NG(H) and CG(H) are the corresponding normalizer and centralizer. For a commutative ring K with involution k↦kd let H(K)={k∈K∣k=kd} be the subring of symmetric elements.
Recall the standard group theoritic identities [xy,z]=x[y,z][x,z] and [x,yz]=[x,y]y[x,z], which will be used further. If G is a group and F,H⩽G are subgroups, then [F,H] and FH are the subgroups of G generated by elements [f,h] and fh for all f∈F and h∈H. Clearly, FH=[F,H]H.
We will also use the language of bicategories. Let us sketch one of possible definitions of these objects. A bicategory B is a set of its objects Ob(B) such that for all X,Y∈Ob(B) a category B(X,Y) is given. These categories are called categories of morphisms from X to Y, morphisms in them are called 2-morphisms. Moreover, there are composition polyfunctors ⊗:B(Xn−1,Xn)×…×B(X0,X1)→B(X0,Xn) in order to multiply morphisms for all n⩾0 and X0,…,Xn∈Ob(B). Finally, these polyfunctors are coherent up to certain natural isomorphisms that are also coherent through some identitites. The [math]-ary compositions (i. e. the identity morphisms) are also denoted by IX∈B(X,X).
Let BimK be the bicategory of K-algebras and bimodules, where K is a commutative ring. Objects in BimK are K-algebras, morphisms from R to S are those S-R-bimodules for which left and right K-module structures coincide, 2-morphisms are bimodule homomorphisms. The composition of bimodules is given by the tensor product and neutral morphisms are the bimodules RRR.
An adjunction in a bicategory B is a pair of morphisms f∈B(Y,X) and g∈B(X,Y) with 2-morphisms ε:f⊗g→IX and η:IY→g⊗f such that ε and η satisfy the ordinary unit and counit equations of an adjunction. An adjunction is called an adjoint equivalence if ε and η are isomorphisms.
3 Hermitian bimodules
Most part of this section is taken from books [3] and [20].
A pseudo-involution with symmetry λ∈R∗ on a ring R is a map (−)d:R→R,a↦ad such that r+r′d=rd+r′d, 1d=1, rr′d=r′drd, and rdd=λrλd. It is easy to see that in this case λd=λ−1. If λ=1, then we obtain an involution in the usual sense. If R is a K-algebra for commutative ring K with involution, then we also will assume that f(kd)=f(k)d, where f:K→R is the structure homomorphism.
Suppose that K is a commutative ring with involution. By iBimK we denote the bicategory of K-algebras with pseudo-involutions and bimodules over them. We omit the index K in the case K=Z.
Let R and S be rings with pseudo-involutions and SMR be a bimodule. One can construct a bimodule RMdS={md∣m∈M} with operations md+m′d=m+m′d,rdmdsd=smrd. Clearly, there exists coherent isomorphisms
[TABLE]
and they form a contravariant bifunctor iBimK→iBimK for all K (which is an identity on objects). In this case we also have isomorphisms HomR(SMR,TNR)d≅RHom(Md,Nd), RHom(RMS,RNT)d≅HomR(Md,Nd). Finally, it is easy to see that there are natural isomorphisms SMRdd≅SMR,mdd↦λSmλRd.
A sesquilinear form B on a bimodule SMR is a biadditive map B:M×M→R such that B(mr,m′r′)=rdB(m,m′)r′ and B(sm,m′)=B(m,sdm′). Equivalently, B is a bimodule homomorphism Md⊗SM→R. The form B is called nondegenerate (or regular) if MR is finitely generated projective and B induces a bimodule isomorphism B:Md≅HomR(M,R).
A sesquilinear form B is called hermitian if the composition Md⊗SM≅Md⊗SMd→BdRd≅R equals B. In the language of elements it means that B(m,λSm′)=B(m′,m)dλR. Such a pair (M,B) will be called a hermitian bimodule (and a hermitian space if B is nondegenerate).
Let (SM1R,BM1), …, (SMnR,BMn) be hermitian bimodules. The orthogonal sum of them is the hermitian bimodule \mathop{\mathchoice{\vbox{\hbox{\scalebox{2.1}{\displaystyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\textstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptscriptstyle\perp}}}}}\displaylimits_{i=1}^{n}(M_{i},B_{M_{i}})=(\bigoplus_{i=1}^{n}M_{i},\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.1}{\displaystyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\textstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptscriptstyle\perp}}}}}\displaylimits_{i=1}^{n}B_{M_{i}}), where (\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.1}{\displaystyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\textstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptscriptstyle\perp}}}}}\displaylimits_{i=1}^{n}B_{M_{i}})((m_{i})_{i=1}^{n},(m^{\prime}_{i})_{i=1}^{n})=\sum_{i=1}^{n}B_{M_{i}}(m_{i},m^{\prime}_{i}). It is easy to see that the orthogonal sum is a monoidal operation and the zero bimodule is a neutral element. If BMi are nondegenerate, then their orthogonal sum is also nondegenerate.
The tensor product of hermitian bimodules (R0M1R1,BM1), …, (Rn−1MnRn,BMn) is the hermitian bimodule ⨂i=1n(Ri−1MiRi,BMi)=(⨂i=1nMi,⨂i=1nBMi), where (⨂i=1nBMi)(⨂i=1nmi,⨂i=1nmi′)=BMn(mn,…BM1(m1,m1′)…mn′) (and the form is biadditive). These tensor products are coherent with neutral elements (R,B1), where B1(r,r′)=rdr′. Hence one can construct the bicategory hBimK of K-algebras with pseudo-involutions, hermitian bimodules and isometries (a homomorphism f:M→N is called an isometry if BN(f(m),f(m′))=BM(m,m′)).
Recall that all adjunctions in BimK are of type (RP∨S,SPR) up to isomorphisms, where ε:P∨⊗SP→R and η:S→EndR(P)≅P⊗RP∨ are the canonical homomorphisms, PR is finitely generated projective and P∨=HomR(P,R). The adjunction is an adjoint equivalence (or Morita equivalence) iff PR is fully projective (i. e. P is finitely generated projective and R is a direct summand of Pn for n big enough) and S≅EndR(P).
Lemma 1**.**
Let (TMS,BM) and (SNR,BN) be hermitian bimodules, MS and NR are finitely generated projective. If BM and BN are nondegenerate, then BM⊗BN is nondegenerate. Conversely, if BM⊗BN is nondegenerate, MS is fully projective, NR is fully projective and S≅EndR(N), then BM and BN are nongenerate.
Proof.
Clearly, (M⊗SN)R is finitely generated projective. Note that BM⊗BN can be expressed as the composition
[TABLE]
and the first claim follows.
To prove the second claim note that SMd, S(HomS(M,S)), NdS and HomR(N,R)S are faithfully flat. Since the isomorphism BN⊗BM can be presented as Nd⊗SMd→id⊗BMNd⊗SHomS(M,S)→BN⊗idHomR(N,R)⊗SHomS(M,S) and as Nd⊗SMd→BN⊗idHomR(N,R)⊗SMd→id⊗BMHomR(N,R)⊗SHomS(M,S), both BM and BN are injective and surjective.
∎
Lemma 2**.**
Let PR be a finitely generated projective module over a ring with pseudo-involution, BP:P×P→R is a nondegenerate sesquilinear form (if we consider P as the bimodule KPR). Then S=EndR(P) has unique pseudo-involition such that B is hermitian, in this case (P∨,P) is an adjunction in hBimK (the hermitian form on P∨=HomR(P,R) is uniquely determined). All adjoint equivalences in hBimK are of this type for fully projective P (up to unique isomorphism).
Proof.
Let us construct the pseudo-involition on S. The necessary ans sufficient conditions for BP to be a hermitian form are equalities BP(sp,p′)=BP(p,sdp′) and BP(p,λSp′)=BP(p′,p)dλR for all p,p′∈P. Since BP is nondegenerate, for all s∈S the element sd is uniquely determined from the first equality. It is easy to see that s↦sd is an antihomomorphism of K-algebras and is consistent with the involution on K. Similarly, the element λS∈S∗ can be found from the second equality. It remains to prove that this is a pseudo-involution.
From equalities BP(p′,λSsp)=BP(sp,p′)dλR=BP(p,sdp′)dλR=BP(sdp′,λSp) it follows that sddλS=λSs. Besides, equalities BP(p,p′)=BP(p,p′)dλRdλR=BP(p′,λSp)dλR=BP(λSp,λSp′)=BP(p,λSdλSp′) imply λSd=λS−1, hence S has unique pseudo-involution.
In order to obtain an adjunction, it remains to construct an appropriate hermitian form on P∨. Let
[TABLE]
i. e. BP∨((p′↦BP(pλR,λSp′)),q)=p⊗q in the language of elements. A simple check shows that this is a hermitian form and the pair (P∨,P) is an adjunction (where ε:P∨⊗SP→R and η:S≅P⊗RP∨ are canonical). The uniqueness of such a form on P∨ follows from the general uniqueness theorem for an adjoint morphism to a given one.
If (Q,P) is an adjoint equivalence in hBimK, then it is also an adjoint equivalence in BimK, hence by lemma 1 both P and Q are hermitian spaces. Finally, Q≅P∨ by the uniqueness theorem for an adjoint morphism to a given one.
∎
Note that lemma 1 implies that nondegeneracy of hermitian forms is preserved under adjoint equivalences.
4 Quadratic structures
Before we define quadratic forms on hermitian bimodules, we should define the objects where quadratic forms maps to. As a motivating example, let us consider the module F2n (as a Z-F2-bimodule) and the quadratic form q(b1,…,bn)=∑i=1nbi2 on it. In turns out that the values of this form can be considered as elements of Z/4Z instead of F2, which has application in the coding theory in the definition of doubly even codes.
Let R be a ring with pseudo-involution. The monoid R∙ acts of R from the right by the formula r⋅r′=r′drr′. A quadratic structure on R is a triple (AR,φ,tr), where AR is a right R∙-module and φ:R→AR, tr:AR→R are R∙-module homomorphisms that satisfy the following axioms:
QS1.
φ(r)=φ(rdλR),
2. QS2.
tr(φ(r))=r+rdλR,
3. QS3.
tr(a)=tr(a)dλR,
4. QS4.
a⋅(r+r′)=a⋅r+φ(r′dtr(a)r)+a⋅r′.
Note that the axiom QS4 is symmetric with respect to r and r′ under assumptions QS1 and QS3. Besides, axioms QS3 and QS4 follows from QS1 and QS2 for a in the image of φ. The axioms imply the useful equality φ(tr(a))=a+a⋅(−1).
Below we will be interested not in ordinary rings with pseudo-involutions, but in algebras over a fixed commutative ring K (also with involution). Hence we need to generalize the notion of algebra for quadratic structures.
A quadratic ring is a commutative ring K with involution and quadratic structure AK such that AK itself is a commutative ring (with the additive structure from the K∙-module structure) satisfying additional axioms
QR1.
a(a′⋅k)=aa′⋅k,
2. QR2.
aφ(k)=φ(tr(a)k),
3. QR3.
tr(1)=1,
4. QR4.
tr(aa′)=tr(a)tr(a′).
Finally, a quadratic algebra over a quadratic ring (K,AK) is a K-algebra with pseudo-involution and quadratic structure (R,AR) such that AR is a left AK-module (with the same addition as in the R∙-module structure) satisfying the axioms
QA1.
aK(aR⋅r)=aKaR⋅r,
2. QA2.
(aK⋅k)aR=aKaR⋅k,
3. QA3.
aKφR(r)=φR(trK(aK)r),
4. QA4.
φK(k)aR=φR(ktrR(aR)),
5. QA5.
trR(aKaR)=trK(aK)trR(aR).
Morphisms between quadratic rings are defined in a natural way: morphism (K,AK)→(L,AL) is a pair of ring homomorphisms (f:K→L,g:AK→AL) such that g(aK⋅k)=g(aK)⋅f(k), g(φK(k))=φL(f(k)) and trL(g(aK))=f(trK(aK)). Clearly, such a morphism makes (L,AL) a quadratic algebra over (K,AK) (in particular, any quadratic ring is a quadratic algebra over itself). Conversely, if (L,AL) is a quadratic ring and simultaneously a quadratic algebra over (K,AK), and aK(aLaL′)=(aKaL)aL′ is satisfied, then there exists unique morphism (K,AK)→(L,AL) that induces this quadratic algebra structure. Moreover, any quadratic algebra over (L,AL) becomes a quadratic algebra over (K,AK) through a morphism (K,AK)→(L,AL) (this is exactly the restriction of scalars).
Let K be a commutative ring with involution. It turns out that K has the universal quadratic structure making it a quadratic algebra (more precisely, the initial quadratic structure in the category of all quadratic structures with this property), and all K-algebras with quadratic structures becomes quadratic algebras over it. Let the Heisenberg group of K be the set Heis(K)=K×K with the addition operation (x,y)∔(x′,y′)=(x+x′,y+y′−xdx′) (in general this operation is noncommutative, but it is always a group operation with inverses −˙(x,y)=(−x,−y−xdx)). This group has a right action of K∙ given by the formula (x,y)⋅k=(xk,kdyk) and also the multiplication (x,y)×˙(x′,y′)=(xx′,xdy′x+x′dyx′+yy′+ydy′), which is associative with neutral element 1˙=(1,0). Finally, there are group homomorphisms φ:K→Heis(K),k↦(0,k) and tr:Heis(K)→K,(x,y)↦xdx+y+yd.
Lemma 3**.**
The group AK=Heis(K)ab=Heis(K)/{(0,k−kd)} makes K a quadratic ring (all operations defined above are well-defined on AK). The pair (K,AK) is the universal quadratic algebra for K fixed. All algebras over K with quadratic structures have unique quadratic algebra structure over (K,AK).
Proof.
It is easy to see that ×˙ is distributive over ∔ from the left, {(0,k−kd)} is the commutant of Heis(K), and ×˙ is well-defined on AK=Heis(K)ab. Moreover, ×˙ is commutative on AK, hence AK is a commutative ring. Clearly, tr(h∔h′)=tr(h)+tr(h′) and tr(h×˙h′)=tr(h)tr(h′), hence tr:AK→K is well-defined ring homomorphism. It can be checked directly that φ and tr are K∙-linear and (K,AK) satisfies all quadratic ring axioms. If AR is a quadratic structure on a K-algebra R, then axioms QA2 and QA4 imply that (R,AR) had at most one quadratic algebra structure on (K,AK) (because AK=1⋅K∔φ(K)), and one can easily prove that this structure is well-defined. The universality property is obvious.
∎
As a corollary, any ring R with pseudo-involution and quadratic structure AR is a quadratic algebra over (Z,Heis(Z)) in a unique way. We have the ring isomorphism Heis(Z)≅Z[T]/(T2−2T), (x,y)↦x+(y+(2x))T.
5 Quadratic forms
Now we are able to define quadratic forms. Let (SMR,B) be a hermitian bimodule, (S,AS) and (R,AR) be quadratic algebras over a quadratic ring (K,AK). A map q:AS×M→AR is called a quadratic form on (M,B), if
The triple (M,B,q) will be called quadratic bimodule (and quadratic space, if B is nondegenerate). Note that QF7 is always satisfied in case AK=Heis(K)ab. Moreover, the axioms imply the relation q(aS,m)+q(aS,−m)=φR(B(m,trS(aS),m))
Now let us explain the relation between our definition and odd quadratic forms from paper [24]. First of all, we will reformulate our definition in the case of a right hermitian module (MR,B), where R is a quadratic algebra over K and B is a hermitian form on KMR.
Lemma 4**.**
There is a canonical one-to-one correspondence between quadratic forms on a right hermitian module (MR,B) and maps q:M→AR such that q(mr)=q(m)⋅r, q(m+m′)=q(m)+φR(B(m,m′))+q(m′), and trR(q(m))=B(m,m).
Proof.
Recall that we consider any right module MR as the bimodule KMR, where K is the base commutative ring. It is easy to see that if q is a quadratic form on the module KMR, then the map q:m↦q(1,m) satisfies the required identities. Conversely, any such a map q is obtained in this way from unique form q given by the formula q(aK,m)=aKq(m).
∎
In particular, this lemma implies that when we work with right modules we can forget about K and work only with the ring R with quadratic structure AR. It turns out that in some sense one can describe all possible structures and quadratic forms for a fixed module MR.
Recall the notions from paper [24]. If MR is a module with a hermitian form B, then its Heisenberg group Heis(B) is the set M×R with the operation (m,r)∔(m′,r′)=(m+m′,r+r′−B(m,m′)) (it is a group operation with inverses −˙(m,r)=(−m,−r−B(m,m))). The group Heis(B) has a right action of R∙ given by the formula (m,r)⋅r′=(mr′,r′drr′). Moreover, there are R∙-linear maps φ:R→Heis(B),r↦(0,r) and tr:Heis(B)→R,(m,r)↦B(m,m)+r+rdλR. Finally, there is a map q:M→Heis(B),m↦(m,0). A subgroup L⩽Heis(B) is called an odd form parameter, if it is closed under the action of R∙ and Lmin⩽L⩽Lmax, where Lmin={(0,r−rdλR)}, Lmax=Ker(tr) (it is true that Lmin⩽Lmax, these subgroups of Heis(B) are closed under the action of R∙, and Lmin contains the commutant Heis(B)).
Lemma 5**.**
The abelian group Heis(B)/L is a quadratic structure, a pair (R,Heis(B)/L) is a quadratic algebra over (K,AK), where AK=Heis(K)ab, and q induces unique quadratic form q on KMR such that q(1,m)=q(m)∈Heis(B)/L. Conversely, any quadratic form q on KMR with a quadratic structure on R can be obtained in this way (uniquely up to unique isomorphism), if AR is generated by the images of q and φR as an abelian group.
Proof.
All claims can be checked directly. The quadratic form q can be constructed by the explicit formula
[TABLE]
The odd form parameter L can be recovered as the kernel of R∙-linear map Heis(B)→AR,(m,r)↦q~(1,m)+φ(r), and the same map gives the isomorphism between Heis(B)/L and AR.
∎
If (M1,BM1,qM1), …, (Mn,BMn,qMn) are quadratic bimodules over (S,AS) and (R,AR), the their orthogonal sum is the quadratic bimodule \mathop{\mathchoice{\vbox{\hbox{\scalebox{2.1}{\displaystyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\textstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptscriptstyle\perp}}}}}\displaylimits_{i=1}^{n}(M_{i},B_{M_{i}},q_{M_{i}})=(\bigoplus_{i=1}^{n}M_{i},\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.1}{\displaystyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\textstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptscriptstyle\perp}}}}}\displaylimits_{i=1}^{n}B_{M_{i}},\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.1}{\displaystyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\textstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptscriptstyle\perp}}}}}\displaylimits_{i=1}^{n}q_{M_{i}}), where the quadratic form is given by the formula (\mathop{\mathchoice{\vbox{\hbox{\scalebox{2.1}{\displaystyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\textstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptstyle\perp}}}}{\vbox{\hbox{\scalebox{1.5}{\scriptscriptstyle\perp}}}}}\displaylimits_{i=1}^{n}q_{M_{i}})(a_{S},(m_{i})_{i=1}^{n})=\sum_{i=1}^{n}q_{M_{i}}(a_{S},m_{i}). Clearly, this is a monoidal operation (the zero bimodule is a neutral element). The orthogonal sum of quadratic spaces is a quadratic space.
One can also define the tensor product of quadratic bimodules. If (R0,AR0), …, (Rn,ARn) are quadratic algebras and (R0M1R1,BM1,qM1), …, (Rn−1MnRn,BMn,qMn) are quadratic bimodules, then their tensor product is ⨂i=1n(Mi,BMi,qMi)=(⨂i=1nMi,⨂i=1nBMi,⨂i=1nqMi), where
[TABLE]
This tensor product is associative up to isomorphism with neutral elements (R,B1,q1), where q1(aR,r)=aR⋅r. Therefore we obtain a bicategory qBimK, where objects are quadratic algebras over a quadratic ring (K,AK), morphisms are quadratic bimodules, and 2-morphisms are isometries (which preserve both hermitian and quadratic forms). Note that morphisms (K,AK)→(L,AL) induces the scalar restriction bifunctors qBimL→qBimK.
Proposition 1**.**
Let (PR,BP) be a finitely generated projective module with a nondegenerate sesquilinear form, (R,AR) be a quadratic algebra. Then S=EndR(P) has unique (up to unique isomorphism) quadratic structure with a quadratic form qP on P such that (P,BP,qP) has a left adjoint in qBimK. All adjoint equivalences are obtained in this way (up to unique isomorphism).
Proof.
The pseudo-involution on S is constructed uniquely by lemma 2, and this lemma also implies the second claim (after proving uniqueness in the first one). Let (Q,BQ) be the left adjoint to (P,QP) in hBimK (from now on P is an S-R-bimodule). Note that adjointness for quadratic forms means aS⋅(p⊗q)=qQ(qP(aS,p),q) and aR⋅⟨q,p⟩=qP(qQ(aR,q),p).
Let us start from existence. Let
[TABLE]
where qQ(aR,q) and φS(s) are formal symbols (from now on we can consider them as values of the maps qQ:AR×Q→AS and φS:S→AS). The abelian group AS has a right S∙-module structure given by the formulas φS(s)⋅s′=φS(s′dss′) and qQ(aR,q)⋅s=qQ(aR,qs) (this structure is well-defined). Moreover, let us define an additive map trS:AS→S by the formulas trS(φS(s))=s+sdλS and trS(qQ(aR,q))=BQ(q,trR(aR)q) (this map is also well-defined). Finally, AS has a well-defined structure of a left AK-module given by aKφS(s)=φS(trK(aK)s) and aKqQ(aR,q)=qQ(aKaR,q).
One can directly check that (S,AS) is a quadratic algebra and qQ is a quadratic form. The quadratic form on the bimodule P is given by the formulas qP(φS(s),p)=φR(BP(p,sp)) and qP(qQ(aR,q),p)=aR⋅⟨q,p⟩, it is also well-defined. Another check shows that this is a quadratic form and (Q,BQ,qQ) is the left adjoint to (P,BP,qP).
Finally, let us prove uniqueness. Let AS′ be another quadratic structure on S, qQ′ and qP′ be the corresponding quadratic forms. Clearly, there exists a natural map π:AS→AS′ such that all needed diagrams are commutative, and we have to prove that it is one-to-one. Note that a (Q,AS)-(Q,AS′)-bimodule of type (S,B1,q) with some quadratic form q is the same as a map f:AS→AS′ making the obvious diagrams commutative (it is true for any ring S with two quadratic structures). The map π is obtained from the bimodule (P,BP,qP)⊗R(Q,BQ,qQ′) in this way. But this bimodule has a quasi-inverse (P,BP,qP′)⊗R(Q,BQ,qQ) (their composition in any order has an isometry into the identity morphism, which is an isomorphism in hBimK, and, consequently, in qBimK). Therefore, π is an isomorphism (and it is clearly unique if we require commutativity of all necessary diagrams).
∎
6 Tensor products
Here we will give definitions of tensor products of quadratic algebras and their localizations with respect to a multiplicative subset of the base ring K. These notions are given in the sake of completeness and will not be used further in the paper.
Let R and S be algebras with pseudo-involutions over K. Their tensor product R⊗KS has a pseudo-involution r⊗sd=rd⊗sd with symmetry λR⊗λS. This operation is monoidal on iBimK with neutral element K. For any commutative algebra L with involution we have a well-defined functor L⊗K−:iBimK→iBimL (which can be called the scalar extension functor), it preserves the tensor product of algebras up to isomorphism. In particular, if S=Sd⊆K is a multiplicative subset and L=S−1K we have L⊗KR=S−1R, that is the scalar extension coincides with the localization. Note that the localization with respect to such S is the same as the localization with respect to {ssd∣s∈S}, hence we can consider only localizations with respect to subsets of H(K).
One can also define tensor products over K and scalar extension functors for the bicategory hBimK. The hermitian form on the tensor product of (SMR,BM) and (TNU,BN) is given by the obvious formula BM⊗KN(m⊗n,m′⊗n′)=BM(m,m′)⊗BN(n,n′). It is easy to see that tensor products and scalar extensions preserve nondegeneracy of forms. Next we will generalize this to qBimK.
Let (R,AR) and (S,AS) be quadratic algebras on (K,AK). Their tensor product is (R,AR)⊗(K,AK)(S,AS)=(T,AT), where T=R⊗KS and
[TABLE]
This AT is a right T∙-module with the multiplication
[TABLE]
φ(t)⋅t′=φ(t′dtt′) and a left AK-module with the multiplication aK(aR⊗aS)=aKaR⊗aS, aKφ(t)=φ(trK(aK)t). Let φT(t)=φ(t), trT(aR⊗aS)=trR(aR)⊗trS(aS), trT(φ(t))=t+tdλT. A direct check shows that these operations are well-defined and (T,AT) is actually a quadratic algebra. If (SMR,BM,qM) and (TNU,BN,qN) are quadratic bimodules, then we can construct the quadratic bimodule (S⊗T(M⊗N)R⊗U,BM⊗N,qM⊗N), where
[TABLE]
and qM⊗N(φ(t),∑1⩽i⩽nmi⊗ni)=φ(BM⊗N(∑1⩽i⩽nmi⊗ni,t∑1⩽j⩽nmj⊗nj)). Therefore, we obtain the bifunctor qBimK×qBimK→qBimK, which is commutative and associative up to isomorphism, (K,AK) is a neutral element.
Now let us consider scalar extensions. If (L,AL) is a quadratic ring and (K,AK)→(L,AL) is a morhism, then there is a functor (L,AL)⊗(K,AK)−:qBimK→qBimL. Indeed, for any quadratic algebra (R,AR) over (K,AK) the tensor product (L,AL)⊗(K,AK)(R,AR) is a quadratic algebra on (L,AL), the left AL-module structure on AL⊗KR is given by the formulas aL(aL′⊗aR)=(aLaL′)⊗aR and aLφ(t)=φ(trL(aL)t). It is easy to check that this construction is well-defined and (L⊗KR,AL⊗KR) is a quadratic algebra on (L,AL). Moreover, any bimodule (L,B1,q1)⊗(M,BM,qM) is automatically a morphism in qBimL. Note that the scalar extension is functorial on (K,AK)→(L,AL) and preserves tensor products of quadratic algebras.
It is easy to see that the tensor product (L,AL)⊗(K,AK)(O,AO) of quadratic rings is a pushout. Hence the category of quadratic rings is finitely cocomplete with an initial object (Z,Heis(Z)).
Finally, we consider localizations. Let (K,AK) be a quadratic ring, S=Sd⊆K be a multiplicative subset, (R,AR) be a quadratic algebra over (K,AK). The localization of (R,AR) with respect to S is the pair (S−1R,AS−1R), where AS−1R=(1⋅S)−1AR (note that 1⋅S is a multiplicative subset in AK). Obviously, this pair is a quadratic algebra on (K,AK) with the operations 1⋅saR⋅s′r=1⋅ss′aR⋅r, φS−1R(sr)=1⋅sφR(rsd), and trS−1R(1⋅saR)=ssdtrR(aR). The quadratic algebra (S−1K,AS−1K) is actually a quadratic ring. Moreover, there is an isomorphism (S−1R,AS−1R)≅(S−1K,AS−1K)⊗(K,AK)(R,AR). The localization on quadratic bimodules is given by S−1(RMT,BM,qM)=(S−1M,BS−1M,qS−1M), where qS−1M(1⋅sr,s′m)=1⋅ss′qM(r,m).
At the end let us make a remark. If K=Kf1+…+Kfn, where f1,…,fn∈H(K), then AK=AK(1⋅f1)+…+AK(1⋅fn). Indeed, we have ∑i=1nkifi=1 and after simplifying 1=1⋅(∑i=1nkifi)N for N big enough all summands of type φK(…) will have at least two identical factors fi inside the brackets, therefore they can be pushed outside by the formula φK(gfi2)=φK(g)⋅fi. Then one can, for example, construct sheaves of quadratic algebras on Spec(H(K)).
7 Elementary transvections
Let us modify several standard definitions for hermitian bimodules in the our context. If (TPR,BP,qP) is a quadratic bimodule and PR is finitely generated projective, then the quadratic bimodule (M(P),BM(P),qM(P)) is called the metabolic space over P, where M(P)=P⊕P∨d is a T-R-bimodule, B_{\operatorname{M}(P)}(\bigl{(}\begin{smallmatrix}p\\
\!\;\overline{\!\!\>f\vphantom{d}\!\!\>}\;\!\end{smallmatrix}\bigr{)},\bigl{(}\begin{smallmatrix}p^{\prime}\\
\!\;\overline{\!\!\>f^{\prime}\vphantom{d}\!\!\>}\;\!\end{smallmatrix}\bigr{)})=\lambda_{R}f(\!\;\overline{\!\!\>\lambda\vphantom{d}\!\!\>}\;\!_{T}p^{\prime})+\!\;\overline{\!\!\>f^{\prime}(p)\vphantom{d}\!\!\>}\;\!+B_{P}(p,p^{\prime}) and q_{\operatorname{M}(P)}(a_{T},\bigl{(}\begin{smallmatrix}p\\
\!\;\overline{\!\!\>f\vphantom{d}\!\!\>}\;\!\end{smallmatrix}\bigr{)})=\varphi_{R}(\lambda_{R}f(\!\;\overline{\!\!\>\lambda_{T}\vphantom{d}\!\!\>}\;\!\operatorname{tr}_{T}(a_{T})p))+q_{P}(a_{T},p). This bimodule is actually a quadratic space. In the case BP=qP=0 the metabolic space is called the hyperbolic space and is denoted by H(P).
Let (TPR,BP,qP) be a quadratic space. Its automorphism group is called the unitary group and is denoted by U(P)=U(P,BP,qP). We will be interested in the case P=P0⊥H(P1)⊥…⊥H(Pl), where P0 is a quadratic space (the odd part of P). Let P−i=Pi∨d for all 1⩽i⩽l, then P=⨁i=−llPi as a T-R-bimodule. From now on we will assume that for all i,j=0 the bimodule Pi is isomorphic to a direct summand in PjN for N big enough.
Let E=EndR(PR), then by proposition 1 there is an adjunction (P∨,P) in qBimK. In this case E is a quadratic space (as a bimodule TEE) and the adjunction gives the isomorphism AutBim(E)≅AutBim(P)=GL(P) (since E⊗EP⊗RP∨≅E and P⊗RP∨⊗EP≅P), which induces the isomorphism U(E)≅U(P). Let C=CE(T)=EndBim(E)=EndBim(P), it is a K-algebra with involution (that is obtained as the restriction of the pseudo-involution on E). The algebra C commutes with λE, because λE equals to the image of λT by lemma 2). Then AutBim(E)=AutBim(P)=C∗.
The condition P=P0⊥H(P1)⊥…⊥H(Pl) is equivalent to existence of a complete family of pairwise orthogonal idempotents {ei}−l⩽i⩽l in C such that eid=e−i, q(AT,ei)=0 for all i=0, and our additional assumption is equivalent to (1−e0)CeiC(1−e0)=(1−e0)C(1−e0) for all i=0.
Let
[TABLE]
for all x∈eiCej and i=j, similarly to the elementary transvections in the general linear group. It is easy to check the identities
LT1.
ti,j(x)ti,j(y)=ti,j(x+y);
2. LT2.
[ti,j(x),tj,k(y)]=ti,k(xy) for all i=k;
3. LT3.
[tj,i(x),tk,j(y)]=tk,i(−yx) for all i=k;
4. LT4.
[ti,j(x),tk,l(y)]=1 for all j=k and l=i.
The elementary transvections are
[TABLE]
for all x,y∈eiCej, 0=i=±j=0 and
[TABLE]
for all x∈e0Cei, y∈e−iCei, z∈e−iCe0, i=0. The elementary unitary group is the group
[TABLE]
For comfort work with τi,j(x,y) and τi(x,y,z) let us introduce the K-algebra A=C×C with the diagonal embedding C↪A and the involution (x,y)d=(yd,xd), and the group H=e0C×C×Ce0 with the group operation (x,y,z)∔(x′,y′,z′)=(x+x′,y+zx′+y′,z+z′), the neutral element 0˙=(0,0,0) and the inverses −˙(x,y,z)=(−x,zx−y,−z). There is the right action of A∙ on H given by the formula (x,y,z)⋅(p,q)=(xp,qdyp,qdz), and there are the group homomorphisms π:H→A,(x,y,z)↦(x,−zd) and φ:A→H,(x,y)↦(0,x−yd,0). Finally, there is the map tr:H→A,(x,y,z)↦(y,zx−yd). All these operations satisfy
τi,j:eiAej→GL(P)* and τi:H⋅ei→GL(P) are homomorphisms;*
2. T2.
τi,j(a)=τ−j,−i(−ad);
3. T3.
[τi,j(a),τk,l(a′)]=1, if i=l=−j=−k=i;
4. T4.
[τi,j(a),τj,k(a′)]=τi,k(aa′)* and [τj,i(a),τk,j(a′)]=τk,i(−a′a), if i=±k;*
5. T5.
[τ−i,j(a),τj,i(a′)]=τi(φ(aa′));
6. T6.
[τi(h),τj(h′)]=τ−i,j(−π(h)dπ(h′)), if i=±j;
7. T7.
[τi(h),τj,k(a)]=1, if j=i=−k;
8. T8.
[τi(h),τi,j(a)]=τ−i,j(tr(h)a)τj(−˙h⋅(−a)).
Proof.
This directly follows from the relations for ti,j(x).
∎
8 Level groups
Let Λi={h∈H⋅ei∣τi(h)∈U(P)} for i=0. This group depends on q, for any (x,y,z)∈Λi there are the equalities z=−xd and y+yd=zx (they are equivalent to τi(x,y,z)d=τi(−x,zx−y,−z)). Let also Λ=∑i=0⋅Λi∔φ(C)⋅(1−e0).
Lemma 7**.**
Let l⩾3. Then π(Λ)⩽e0C(1−e0), tr(Λ)+π(Λ)dπ(Λ)⩽(1−e0)C(1−e0), and Λ⋅(1−e0)C(1−e0)∔φ(C)⋅(1−e0)⩽Λ. Moreover, Λi=Λ⋅ei for all i=0.
Proof.
This is followed from lemma 6, because τi,j(a)∈U(P) iff a∈eiCej.
∎
As an example let us find explicitly Λ in the case T=K and the the quadratic form is given by an odd form parameter L⩽Heis(B). The condition q(1,ei)=0 means (ei,0)∈L for all 0=i. The element τi(x,y,z) is in U(P), if z=−xd and (xp,B(p,yp))∈L for all p∈Pi. Therefore, Λi={(x,y,−xd)∈H⋅ei∣(xp,B(p,yp))∈L for p∈⨁i=0Pi}. Conversely, if Λ satisfies all conditions from lemma 7, T=K and P=E=C, then Λ is obtained from the odd form parameter L={(x,y)∣(x,y,−xd)∈Λ}⋅C∔⟨(ei,0)∣i=0⟩∔Lmin.
An augmented level is a pair L=(I,Γ), where I=Id⩽A, Γ⩽H and I(I+K+(1−e0)C(1−e0)+π(Λ)+π(Λ)d)⩽I, π(Γ)⩽I, tr(Γ)+π(Γ)dπ(Γ)⩽I, Γ⋅(I+K+(1−e0)C(1−e0)+π(Λ)+π(Λ)d)∔Λ⋅I∔φ(I)⩽Γ, e0I(1−e0)=π(Γ⋅(1−e0)). Two augmented levels L=(I,Γ) and L′=(I′,Γ′) are called equivalent if I(1−e0)=I′(1−e0) and Γ⋅(1−e0)=Γ′⋅(1−e0), the equivalence classes are called levels. We will often denote the class of an augmented level L also as L. For a level L its enveloping level is the level L, where I(1−e0)=I(1−e0)+(1−e0)C(1−e0)+π(Λ) and Γ⋅(1−e0)=Γ⋅(1−e0)∔Λ. Any level L as a class contains the smallest and the biggest augmented levels ⌊L⌋ and ⌈L⌉, where e0⌊I⌋e0=e0I(1−e0)Ie0+e0I(1−e0)Ie0, ⌊Γ⌋⋅e0=Γ⋅(1−e0)Ie0∔Λ⋅Ie0∔φ(⌊I⌋)⋅e0, e0⌈I⌉e0={a∈e0Ae0∣aI(1−e0)+(1−e0)Ia⩽I}, and ⌈Γ⌉⋅e0={h∈H⋅e0∣π(h)∈⌈I⌉,tr(h)∈⌈I⌉,h⋅I(1−e0)+φ(I)⋅(1−e0)⩽Γ}.
Lemma 8**.**
Let l⩾3 and G⩽GL(P) is normalized by EU(P). Then there exists unique level L(G)=(I(G),Γ(G)) such that τi,j(a)∈G iff a∈I(G) and τi(h)∈G iff h∈Γ(G).
Proof.
The proof is completely similar to the one of lemma 7, if we note that [G,G]⩽G and [G,EU(P)]⩽G.
∎
Therefore, any group G normalized by EU(P) corresponds to the level L(G). If L=(I,Γ) is a level, then one can define the group
[TABLE]
and the elementary level group
[TABLE]
Let us define α(g)=(g,g−1d) and ε=(e0,e+,−e0), where e+=∑i>0ei and e−=∑i<0ei. It is easy to see that π(ε)=e0 and tr(ε)=e+−ζ, where ζ=(0,1).
Lemma 9**.**
The map α:C∗→A∗ is a homomorphism, α(g)d=α(g−1), and there are the following identities
[TABLE]
Proof.
All claims are followed from the definitions.
∎
Now let L be an augmented level. The principal level group of augmented level L is the group
[TABLE]
This is a group by lemma 9: indeed, α(g−1)−1=(α(g)−1)d, α(gg′)−1=(α(g)−1)α(g′)+(α(g′)−1), ε⋅α(g−1)−˙ε=−˙(ε⋅α(g)−˙ε)⋅α(g−1), and ε⋅α(gg′)−˙ε=(ε⋅α(g)−˙ε)⋅α(g′)∔(ε⋅α(g′)−˙ε). Moreover, the lemma implies that U(P,L) is normalized by EU(P) (because α(h)α(g)−1=α(h)(α(g)−1) and (ε⋅α(h)α(g)−˙ε)⋅α(h)=(ε⋅α(h)−˙ε)⋅α(g)∔(ε⋅α(g)−˙ε)−˙(ε⋅α(h)−˙ε)) and its level is L. Therefore, the level of EU(P,L) is also L. We also will use the general level group
[TABLE]
Clearly, it is a group normalized by EU(P).
Lemma 10**.**
Let l⩾3 and L be an augmented level. The group GU(P,L) contains U(P,L) and its level is L. Besides, GU(P,L) is contained in the set GU′(P,L)={g∈GL(P)∣[{g,g−1},EU(P,L)]⊆U(P,L)}, which can also be defined through the equations
[TABLE]
Moreover, [GU′(P,⌊L⌋),U(P,⌊I⌋+K,⌊Γ⌋)]⊆U(P,⌊L⌋) and GU(P,⌊L⌋)=GU′(P,⌊L⌋).
Proof.
Suppose that g∈C∗ satisfies [g,EU(P,L)]⊆U(P,L). This is equivalent to α([g,τi,j(a)])−1∈I, ε⋅α([g,τi,j(a)])−˙ε∈Γ for all a∈eiIej (where 0=i=±j=0) and α([g,τi(h)])−1∈I, ε⋅α([g,τi(h)])−˙ε∈Γ for all h∈Γ⋅ei (where i=0). Then α(g)(a−ad)≡a−admodI and α(g)(tr(h)+π(h)−π(h)d)≡tr(h)+π(h)−π(h)dmodI. After multiplying the congruences of the first kind for various i and j (we use only that l⩾2) we obtain α(g)a≡amodI for all a∈(1−e0)I(1−e0). Then, using the congruences of the second kind, we obtain the first equation on g, that is α(g)a≡amodI for all a∈⌊I⌋+K.
Further, we know that ε⋅α(g)(1+a−ad)≡ε⋅(1+a−ad)modΓ for all a∈eiIej (where 0=i=±j=0). It follows that ε⋅α(g)a∈Γ (note that \Gamma\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}\langle\varepsilon\cdot(I+\lfloor\widehat{I}\rfloor+K)\rangle\dotplus\Gamma and φ(ζ(a−ad))=φ(a) for all a∈A). Therefore, ε⋅α(g)a≡ε⋅aα(g−1)modΓ for all a∈(1−e0)I. Moreover, ε⋅α(g)(1+tr(h)+π(h)−π(h)d)≡ε⋅(1+tr(h)+π(h)−π(h)d)modΓ for all h∈Γ⋅ei and i=0. After simplifications we obtain ε⋅α(g)π(h)∔ε⋅α(g)(−π(h)d)≡−˙h⋅(−1)∔φ(ζα(g)tr(h))modΓ. Recall that ε⋅α(g)(−π(h)d)≡ε⋅(−π(h)dα(g−1))=0, hence ε⋅α(g)π(h)−˙ε⋅π(h)α(g−1)−˙h∗⋅(−α(g−1))∔h⋅(−1)∈Γ. After changing h into −˙h⋅(−1) the third equation on g follows.
The level of GU(P,L) can be found from the equations (it is sufficient to use the first and the second ones). If we prove [GU′(P,⌊L⌋),U(P,⌊I⌋+K,⌊Γ⌋)]⊆U(P,⌊L⌋), then it will be clear that GU′(P,⌊L⌋) is a group and is equal to GU(P,⌊L⌋). Let [g,EU(P,L)]⊆U(P,⌊L⌋) and x∈EU(P,⌊I⌋+K,⌊Γ⌋), then
[TABLE]
and if h=ε⋅α(x)−˙ε∈⌊Γ⌋, then
[TABLE]
9 Localization and roots
Let S⩽H(K)∙ be a multiplicative subset, L be a level. Then S−1L⊆S−1A×S−1H can be considered as a level (if instead of Λ we will take S−1Λ), hence one can define the corresponding subgroups EU(S−1P,S−1L), U(S−1P,S−1L) and GU(S−1P,S−1L) in (S−1C)∗ (in general, they are not related to the bimodule S−1P, because its endomorphism ring can be strictly bigger than S−1C). Let L0 be the level of EU(P), EU(S−1P) be the group EU(S−1P,S−1L0), then EU(S−1P,S−1L)=EU(S−1P)EU(S−1L). The localization homomorphism will be denoted by ΨS.
If L=(I,Γ) is a level and s∈S, then let L⋅s=(Is,Γ⋅s+φ(Is)), it is also a level. Let us introduce the system of subgroups ΩS(P,L)={EU(P,L⋅s)∣s∈S} in EU(P,L). We a going to prove that ΩS(P,L) is a neighborhood base of the identity for some group topology (it will be called a base for short) on EU(P,L), not necessary Hausdorff, and that EU(P,L) acts continuously by conjugation on EU(P,L). Moreover, we a going to prove similar statements for EU(S−1P,S−1L) with the subgroup system ΨS(ΩS(P,L)).
Let Φ={±ei±ej,±ek,±2ek∣1⩽i<j⩽l;1⩽k⩽l}⊆Rl, it is a non-reduced (crystallographic) root system of type BCl. Its elements (i. e. roots) of length 2 are called long, of length 2 — the short ones, and of length 1 — the ultrashort ones. If α∈Φ is a root and L=(I,Γ) is a level, then one can defines a subgroup Uα(L)⩽EU(P,L) in the following way (where ei=−e−i for i<0):
[TABLE]
Clearly, U_{2\mathrm{e}_{i}}(L)\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}U_{\mathrm{e}_{i}}(L). From lemma 6 it follows that
[TABLE]
if iα+jβ=0 for all i,j>0 (the lemma also implies that the product in the right hand side is a group and is independent from the order of factors).
[TABLE]
Lemma 11**.**
If l⩾3 and L is a level, then [EU(P,L),EU(P)]=EU(P,L).
Proof.
It is sufficient to prove that EU(L)⩽[EU(L),EU(L0)]. If α is a short root, then it can be presented as α=β+γ, where β and γ are also short, hence Uα(L)=[Uβ(L),Uγ(L0))]. If α is ultrashort, then α=β+γ, where β is short and γ is ultrashort, hence
[TABLE]
Note that if H,H′∈ΩS(P,L), then there exists H′′∈ΩS(P,L) such that H′′⩽H∩H′ (if H=EU(P,L⋅s) and H′=EU(P,L⋅s′), then one can take H′′=EU(P,L⋅ss′)). Therefore, in order to prove that ΩS(P,L) is a base and EU(P,L) acts continuously on EU(P,L), is is sufficient to prove continuity in 1 (or in (1,1)) of maps EU(P,L)→EU(P,L),g↦[g,x] for x∈EU(P,L) fixed, EU(P,L)→EU(P,L),x↦[g,x] for g∈EU(P,L) fixed, and EU(P,L)×EU(P,L)→EU(P,L),(g,x)↦[g,x].
Lemma 12**.**
If l⩾3 and L is a level, then EU(P,L)=⟨U−α(L0)Uα(L)∣α∈Φ⟩. If S⩽H(K)∙ is a multiplicative subset, then ΩS(P,L) is a base of EU(P,L) and ΨS(ΩS(P,L)) is a base of EU(S−1P,S−1L). Moreover, EU(P,L) acts continuously by conjugation on EU(P,L) and EU(S−1P,S−1L) acts continuously by conjugation on EU(S−1P,S−1L).
Proof.
Let EU′(L)=⟨U−α(L0)Uα(L)∣α∈Φ⟩, then EU(L)⩽EU′(L)⩽EU(P,L). At first we will prove that EU′(L) is closed under conjugation by elements x∈Uδ(L) and these conjugations are continuous at 1∈EU′(L)=EU(P,L). It is sufficient to prove the claim for subgroups U−α(L0)Uα(L) (with the base {U−α(L0)Uα(L⋅s)∣s∈S}), which are mapped into EU(P,L) by conjugation.
If α and δ are linearly independent, then
[TABLE]
If α=±δ is short, then α=β+γ, where β and γ are also short, hence
[TABLE]
If α=±δ is ultrashort, then α=β+γ, where β is short and γ is ultrashort, hence
[TABLE]
and
[TABLE]
It can be proved similarly that x(−):EU(S−1P,S−1L)→EU(S−1P,S−1L) is continuous at 1 for x∈EU(S−1P,S−1L) and that both maps (g,x)↦[g,x] are continuous at (1,1). It remains to prove continuity of the map x↦[g,x], where g∈EU(P,L) (and similarly for EU(S−1P,S−1L)). Without loss of generality, g∈Uα(L), then [Uδ(L⋅s),Uα(L)]⩽∏iα+jδ∈Φi,j>0Uiα+jδ(L⋅s) for linearly independent or codirectional α and δ. If α=−β is short and α=β+γ, where both β and γ are short, then
[TABLE]
and similarly in the case when α=−β is ultrashort.
∎
Note that the embeddings EU(P,L)→EU(P,L′) and EU(S−1P,S−1L)→EU(S−1P,S−1L′) are continuous, if the level L is contained in L′ (that is ⌊I⌋⩽⌊I′⌋ and ⌊Γ⌋⩽⌊Γ′⌋).
Lemma 13**.**
Let l⩾3 and L be a level. Then the group EU(P,L) is generated by the subgroups U±el±ei(L) for 1−l⩽i⩽l−1 as a subgroup of C∗=GL(P) normalized by EU(P,L) (and EU(P,L) itself is generated by similar subgroups as an abstract group).
Recall that a quasi-finite K-algebra is a direct limit of finite K-algebras (one can consider it as a noncommutative integrality). This condition on an algebra has equivalent reformulations (which are written in [4]), and these reformulations imply that a subalgebra B of a quasi-finite algebra A is quasi-finite itself (we also will use the equality B∗=A∗∩B). It is also useful to note that quasi-finiteness over K is equivalent to quasi-finiteness over H(K), since K is integral over H(K) (the integral dependence equation for k∈K is k2−k(k+kd)+(kkd)=0). Finally, we will use that in a quasi-finite algebra one-sided invertibility is equivalent to a two-sided one.
We a going to prove the inclusion [GU(P,L),EU(P,L)]⊆EU(P,L) for all augmented levels L if C is quasi-finite over K under certain additional assumption. First of all we will prove one lemma for K local.
Recall that GU′(P,L)={g∈GL(P)∣[{g,g−1},EU(P,L)]⊆U(P,L)} for an augmented level L. Clearly, this set is closed under conjugation by EU(P,L) and under multiplication by EU(P,L) from both sides. Let GUk′(P,L)={g∈GU′(P,L)∣ekα(g)=ekα(g)ek=α(g)ek} for all k=0. Obviously, [GUk′(P,L),EU(P,L)]⊆EU(P,L) by lemma 13.
Lemma 14**.**
Suppose that l⩾3, K is a local ring with the maximal ideal m, C is a finite K-algebra, α(g±1)a≡amodI and ε⋅α(g±1)a∈Γ for all a∈e±kIe∓k, and also ekα(g)ek is invertible in ekAek. Then there exist h,h′∈EU(P,L) such that ekα(hgh′)=ekα(hgh′)ek=α(hgh′)ek.
Proof.
Note that always eiα(g)ekIe−kα(g−1)e−kIek⩽I and ek∈ekIe−kα(g−1)e−kIekα(g)ek for all 0=i=±k (since e−kα(g−1)e−k is invertible in e−kAe−k). If ek=xα(g)ek for x∈ekIe−kα(g−1)e−kIek, then eiα(τi,k(−eiα(g)x)g)ek=0. Since multiplication by such a transvection preserves the condition, we may assume that eiα(g)ek=0. Now let h=ε⋅α(g)x∈Γ, then (1−ek)α(τk(h⋅(−1))g)ek=0 and multiplication by this transvection also preserves the condition on g.
Applying the same transformations for g−1 instead of g (and −k instead of k) we can obtain the equality ekα(g)=ekα(g)ek.
∎
Proposition 2**.**
Suppose that l⩾3, K is a local ring with the maximal ideal m, C is a finite K-algebra and e0Ce0 is generated by less than 4l2 elements as a K-module. Then GU′(P,L)=EU(P,L)GUk′(P,L) for all k=0.
Proof.
Without loss of generality, k=l. We will prove that every g∈GU′(P,L) is in the set EU(P,L)GUl′(P,L). We will conjugate it by elements from EU(P,L) and multiply it from both sides by elements from EU(P,L), until we obtain an element from GUl′(P,L). For convenience we will work in A, applying conjugations and multiplications directly to α(g).
At first let us prove that there exist p∈elI and q∈Iel such that pα(g)elq in invertible in elIel. In order to do it we note that α(g)((1−e0)I(1−e0))⊆(1−e0)I(1−e0)+I. Without loss of generality, K is a field and A is semi-simple (since we can take the factor by the Jacobson radical J(A)), then A is a finite-dimensional K-algebra. Let A=I(1−e0)I+K and f0,f1,…,fm be liftings to A of the primitive central idempotents of A/J(A) (such that fi form a complete family of orthogonal idempotents), one can assume that fi∈I(1−e0)I for all i=0, f0I(1−e0)I⩽J(A) and f0=e0f0e0 (if A=I(1−e0)I, then let f0=0). Let also B=I+I(1−e0)I+K. It is easy to show that the classes of fi in B/J(B) are primitive central idempotents for all i=0. Moreover, (1−e0)A(1−e0)=(1−e0)B(1−e0) is semi-simple.
Let us fix an index 1⩽i⩽m. Clearly, X=α(g)((1−e0)fiA(1−e0))⊆B is a semi-simple subalgebra (generally their identity elements are different) and fjX⊆fjA[ζ]fiA=0 for all 1⩽j⩽m and j=i (since fjA(1−e0)fi⊆A[ζ]). Moreover, f0X⊆J(B), because the image of f0X in B/J(B) is a semi-simple subalgebra contained in e0Be0/J(e0Be0), all simple factors of X have dimension at least 4l2, and e0Be0/J(e0Be0) can be embedded in the product e0Bζe0/J(e0Bζe0)×e0B(1−ζ)e0/J(e0B(1−ζ)e0), where both factors have dimension strictly less than 4l2. It follows that X⊆fiX+J(B)⊆IfiI+J(B), hence there are p∈elI and q∈Iel such that pα(g)elq is invertible in elIel.
Now let us show that p can be taken of type elα(h), where h∈EU(P,L) and e−lα(h)=e−l. We will work in the algebra T=B/J(B). Let p′=elα(h) be such that h∈EU(P,L), e−lα(h)=e−l and the ranks of p′α(g)elq are maximal possible (in every simple factor of T). Let us denote α(hg)elα(g−1)q as f. It is easy to see that eifel∈Telfel for all 0=i=±l, otherwise we can multiply f by some element of type τl,i(a) from the left, increasing some rank of elf. Next, e−lfel∈Telfel, since otherwise we can multiply f by τl,2−l(a)τl−1,l−2(b)τl,l−2(c) from the left, increasing some rank of e+fel. Finally, e0fel∈Telfel, because otherwise we can multiply f by an element of type τ−l(h) from the left, increasing some rank of (1−e0)fel. Hence fel∈Telfel, then elfel has the same ranks as el, that is this element is invertible in elTel.
Suppose that l⩾3, C is a quasi-finite K-algebra and e0Ce0 is generated by less than 4l2 elements as a K-module. Then for any augmented level L there is the inclusion [GU′(P,L),EU(P,L)]⊆EU(P,L).
Proof.
Fix g∈GU′(P,L), then without loss of generality H(K) is Noetherian (for examply it is finitely generated over Z), K and C are finite H(K)-algebras. Consider the ideal
[TABLE]
It is trivial to check that it is an ideal. Let \mathfrak{m}\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}\operatorname{H}(K) be a maximal ideal, then Km is local and proposition 2 can be applied to Cm as a Km-algebra. An element Ψm(g) is a product of elements from EU(Pm,Lm) and D(Pm,Lm), hence for all H∈Ωm(P,L) there exists H′∈Ωm(P,L) such that Ψm([g,H′])⊆Ψm(H). We can take H=EU(P,L⋅s) and H′=EU(P,L⋅ss′) for such s that Ψm∣Cs is injective, then [g,H′]⊆H and ss′∈a⩽m. Therefore, a=(1).
∎
Therefore, GU(P,L)⩽GU′(P,L)={g∈GL(P)∣[{g,g−1},EU(P,L)]⊆EU(P,L)}=GU(P,⌊L⌋) under assumptions of the theorem.
Let us give an example that shows necessity of the condition on e0Ce0. Let S=R=K with the trivial involution, 2∈K∗, λS=λR=1, Pi are free K-modules of rank 1 for all i=0 and P0≅H(Pl) as a hermitian space. Let also AS=AK=Heis(K) and AR=Heis(B)/∑i=0⋅q(Pi) (in this case both Heisenberg groups are abelian). Then U(P)={g∈GL(P)∣gTb=bg−1,g0,i=0 for i=0}, where gT is the transpose, b is the matrix of the split symmetric bilinear form, and g0,i is e0gei in out previous notation. Since g∈U(P) is an orthogonal operator, it follows that gi,0=0. In other words, U(P)=O(2l,K)×O(P0), where the first factor means the split orthogonal group and EU(P)=EO(2l,K). Further, U(P,⌈L0⌉)=O(2l,K)×GL(P0). There is some augmented level L0′ (which corresponds to the level L0) such that U(P,L0′)=U(P) and GU(P,L0′)=(GO(2l,K)×GO(P0))⋊⟨σ⟩, where σ isometrically swaps P0 and ⨁i=0Pi, σ2=id. Clearly, [GU(P,L0′),EU(P)]⩽EU(P). Worse, there does not exists the greatest group with the level L0, since ⟨GU(P,L0′),U(P,⌈L0⌉)⟩=(GL(2l,P)×GL(P0))⋊⟨σ⟩ has a strictly greater level.
11 Global extraction of transvections
Now we will prove the main theorem. Let G⩽GL(P) be a subgroup normalized by EU(P) and L be its level. We want to prove that G⩽GU(P,⌊L⌋). More precisely, it will be proved that if EU(P,L)⩽G, then either G⩽GU(P,⌊L⌋), or G contains an elementary transvection not from EU(P,L).
It is simpler to prove the assertion in the local case, hence we need an auxiliary result that allows us to pass from the local case to the global one. Suppose that S⩽H(K)∙ is a multiplicative subset, H∈ΨS(ΩS(P,L)), H′∈ΨS(ΩS(P,L)) (and H′⩽H), g∈G and suppose that we found a sequence of elements g0=ΨS(g),g1,…,gN in GL(S−1P)=(S−1C)∗ such that gN∈/EU(S−1P,S−1L) is an elementary transvections (such a transvection will be called nontrivial) and every gi+1 can be obtained from gi by one of the rules
EX1.
gi+1=fi,0higifi,1, where hi∈EU(S−1P,S−1L) and fi,0,fi,1∈EU(S−1P,S−1L);
2. EX2.
gi+1=fi,0∏k=1ni([gi,ti,k]±1fi,k), where ti,k∈EU(S−1P,S−1L)), fi,k∈EU(S−1P,S−1L)), ti,khi−1′…h0′∈H and [ti,k,bi]hi−1′…h0′,fi,khi−1′…h0′∈H′ (the elements hj′∈EU(S−1P,S−1L) and bi∈GU(S−1P,⌊S−1L⌋) will be defined below, their definitions use only the previous gj);
3. EX3.
gi+1=gixi, where xi∈GU(S−1P,⌊S−1L⌋);
4. EX4.
gi+1=gi−1, and if before this rule there was an application of the third rule, then somewhere after that application there was an application of the second rule.
Usually we will not use the third rule at all until a certain moment, and after this moment we will not use the fourth one. In all lemmas about local extraction of transvections we will assume that g is obtained as result of such a sequence and that in this sequence after every application of the third rule there is an application of the second one. In the second rule we will usually use only one factor with fi,0=fi,1=1 and ti,1 will usually be taken as a transvection from a small enough element of ΨS(ΩS(P,L)) (depending of the previous gj in order to satisfy the necessary conditions), that is transvections of type τp,q(as) or τp(h⋅s) for s∈S big enough.
It is easy to prove by induction that gi=aibi for some ai and bi∈GU(S−1P,⌊S−1L⌋). Also a0=ΨS(g) and either ai+1=hi′ai±1, or ai+1=fi,0′∏k=1ni′([ai,ti,k′]fi,k′) (in this case hi′=1 and bi=1), where hi′∈EU(S−1P,S−1L), ti,k′∈EU(S−1P,S−1L), fi,k′∈EU(S−1P,S−1L), fi,k′hi−1′,…,h0′∈H′, and ti,k′hi−1′…h0′∈H. In the unique nontrivial case of the second rule one may use the formulas [aibi,ti,k]=[bi,ti,k][[ti,k,bi],ai][ai,ti,k] and [ti,k,aibi]=[ti,k,ai][ai,[ti,k,bi]][ti,k,bi]. Clearly, these ai can be obtained using the same rules (after including additional steps), and for them in the first rule always fi,0=fi,1=1, all bi constructed from this new sequence are equal to 1 (if fi,0=fi,1=1 in all applications of the first rule, one can take hi′=hi).
Let gN be a nontrivial transvection of a root type α. Clearly, there exists a short root β∈Φ such that the angle between α and β is obtuse. There also exists a∈I⋅s for s∈S big enough such that gN+1=[gN,τβ(1a)] still is not in EU(S−1P,S−1L), but τβ(1a)hN′…h0′∈H and [bN,τβ(1a)]hN′…h0′∈H′. Clearly, gN+1hN′…h0′∈⟨[ΨS(g),H],H′⟩ (the commutant can be written because we take a commutator at least at the (N+1)-st step).
In the following lemma a nontrivial transvection is a product of at most two elementary transvections (with an angle 4π between roots) if the product is not in EU(S−1P,S−1L).
Lemma 15**.**
Suppose that l⩾3, H(K) is Noetherian, K and C are finite H(K)-algebras, L is a level, and ΩSL(P)={EU(P,⟨a⟩)∣a∈(1−e0)A(1−e0);∀s∈Sas∈/I}∪{EU(P,⟨h⟩)∣h∈H⋅(1−e0);∀s∈Sh⋅s∈/Γ}, where ⟨a⟩ and ⟨h⟩ are the least levels containing a and h. Then:
•
For all H∈ΨS(ΩS(P,L)) and for all nontrivial transvections g∈/EU(S−1P,S−1L) there exists H′∈ΨS(ΩSL(P)) such that H′⩽H⟨g⟩.
•
For all H∈ΨS(ΩSL(P)) and for all elementary transvections f∈EU(S−1P,S−1L) there is a nontrivial transvection in fH.
•
For all H∈ΨS(ΩS(P,L)), f∈EU(S−1P,S−1L) and for all nontrivial transvection g∈/EU(S−1P,S−1L) there is H′∈ΨS(ΩSL(P)) such that H′⩽H⟨fg⟩.
Proof.
The first two claims easily follow from lemma 6. In the third claim let f=fn…f1, where all fi are elementary transvections. We will prove the claim by induction on n, the case n=0 is exactly the first claim. If the claim is true for n−1, then
[TABLE]
Let us prove the main theorem. Let G⩽C∗ be a subgroup normalized by EU(P) and L be its level. We will prove G⩽GU(P,⌊L⌋) by contradiction. Suppose that it is false, then we want to find a nontrivial transvection in G, i. e. an elementary transvection not in EU(P,L). In proposition 3 (which will be proved in the next section) we will prove the assertion if K is local and C is a finite K-algebra, and the resulting nontrivial transvection can be obtained from some g∈G∖GU(P,⌊L⌋) by the rules EX1 – EX4.
Theorem 2**.**
Suppose that l⩾4, C is a quasi-finite K-algebra, and that e0Ce0 is generated by less than 4l2 elements as a K-module.
If G⩽GL(P) is normalized by the group EU(P) and has a level L, then EU(P,L)⩽G⩽GU(P,⌊L⌋). Conversely, all subgroups satisfying these inequalities are normalized by EU(P).
Proof.
Suppose that g∈G∖GU′(P,⌈L⌉) (recall that GU′(P,⌈L⌉)=GU(P,⌊L⌋) by theorem 1). Without loss of generality, H(K) is Noetherian (for example, it is finitely generated over Z), both K and C are finite K-algebras. It is easy to see that Ψm(g)∈/GU′(Pm,⌈Lm⌉) for some maximal ideal \mathfrak{m}\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}\operatorname{H}(K) (here ⌈Lm⌉=⌈L⌉m because of Noetherian condition). Let us apply proposition 3, then there is a sequence of elements g0=Ψm(g),…,gN+1 such that gN+1 is a nontrivial transvection (it can also be a product of two elementary transvections with an angle 4π between their roots) and gN+1h∈⟨[ΨS(g),H],H′⟩ for some h∈EU(S−1P,S−1L). The groups H∈Ψm(Ωm(P,L)) and H′∈Ψm(Ωm(P,L)) can be chosen arbitrary, if only H′⩽H. We take H=Ψm(EU(P,L⋅s)) and H′=Ψm(EU(P,L⋅s)) for such s that Ψm∣sC in injective (it suffices to take such s that Ann(s)=Ker(Ψm)). By lemma 15 there is H′′∈Ψm(ΩmL(P)) such that H′′⩽H⟨gN+1f⟩. But then G itself has a nontrivial transvection (it is some expression of g modulo Ker(Ψm), and we can eliminate the modulus because of our choice of s0), this is a contradiction.
Let K be local with the maximal ideal m, C be a finite K-algebra, L be a level, and g∈C∗. We want to prove that either g∈GU(P,⌊L⌋), or it is possible to obtain a nontrivial transvection from g using the rules EX1 – EX4.
The idea of the extraction of transvections is the following: first of all, we will do it modulo the Jacobson radical J(C) of the algebra C note that (C/J(C) is a semi-simple (K/m)-algebra, because C is semi-local). After that if we have found a nontrivial transvection then it can be used to contsruct a nontrivial transvection in C∗. Otherwise, we still need to extract a transvection in C∗, but we can use that [g]∈GU(P/J(C),⌊L/J(C)⌋) (here the right hand side means the corresponding subgroup of (C/J(C))∗). For simplicity let E=el+el−1.
Lemma 16**.**
Suppose that l⩾4 and Edα(g)=Ed. Then either g∈U(P,⌈L⌉), or it is possible to extract a nontrivial transvection.
Proof.
Suppose that it is impossible to extract a nontrivial transvection. Firstly suppose that (1−E)(α(g)−1)(1−Ed)=1−Ed−E. In this case g=∏α∈Φ,(α,E)<1,∣α∣<2τα(uα) for some uniquely determined uα. Then all factors are products of certain transvections that can be obtained from g by multiple applying of commutators and multiplications by transvections, hence they all are in EU(P,L).
In general case Edα(g)=Ed and α(g)E=E. If we prove that x=Ed+(1−E)g(1−Ed)+E∈U(P,⌈L⌉), then the considered case can be applied to gx−1. Let h=[g,τi,−l(a)] (or h=[g,τi,1−l(a)]) for a∈L and 2−l⩽i⩽l−2, i=0, then (1−E)α(h)(1−Ed)=1−Ed−E. Therefore, h∈U(P,⌈L⌉) for all such i and a, i. e. (α(x)−1)ei∈⌈I⌉. From consideration of [g,τl,l−1(a)] and [g,τl−1,l(a)] it follows that eiα(g)Ed∈I. Returning to h, it is easy to see that ε⋅h−˙ε∈⌈Γ⌉ implies ε⋅α(x)ei−˙ε⋅ei∈⌈Γ⌉, i. e. all columns x with nonzero index satisfy the required equalities. For the remaining column it can be proved using similar reasoning, if we consider τi(h)g instead of g for arbitrary 2−l⩽i⩽l−2, i=0 and h∈Γ (note that (1−e0)α(x)e0∈⌈I⌉ because e0α(x−1)(1−e0)∈⌈I⌉).
∎
Lemma 17**.**
Suppose that l⩾4 and Edα(g)(Ed+e−1+e−2)=Ed. Then either g∈U(P,⌈L⌉), or it is possible to extract a nontrivial transvection.
Proof.
Assume that we cannot extract a nontrivial transvection. Let y=[τl−1,−l(a),x−1]. Clearly, α(y)(e−1+e−2)=e−1+e−2, hence y∈U(P,⌈L⌉) by lemma 16. Since (1−E)α(y)Ed=Ed+(1−E)α(x)−1(ad−a), then Edα(x)(1−Ed)∈I. Therefore, if we multiply x from the right by some element from EU(P,L), we can assume that Edα(x)(1−E)=Ed. Next, ε⋅α(y)Ed∈Γ and EdxE+Edx−1E=0, so it easily follows that (0,EdxE,0)∈Γ. Now after another multiplication of x from the right by an element from EU(P,L), we can assyme that Edx=Ed, hence lemma 16 can be applied.
∎
Lemma 18**.**
Suppose that l⩾4, (1−E)gE=0 or Edg(1−Ed)=0, and that EdgEd and EgE are invertible in EdCEd and ECE. Then either g∈GU(P,⌊L⌋), or it is possible to extract a nontrivial transvection.
Proof.
Assume that one cannot extract a nontrivial transvection. If (1−E)gE=0, then using commutators of type x=[g−1,τl,1−l(a,b)] it is easy to see that g can be multiplied from the right by such an element f∈EU(P,L) that the product g′=gf satisfies the equalities (1−E)g′E=0=Edg′(1−Ed). If Edg(1−Ed)=0, then one can proceed similarly, using commutators of type x=[g,τl,1−l(a,b)] and multiplying by f from the left in the definition of g′. Now if we prove that d=Edg′Ed+(1−E−Ed)g′(1−E−Ed)+Eg′E is in GU(P,⌊L⌋), then for g′d−1 the claim will be obvious.
We will prove that d∈GU′(P,⌈L⌉) using explicit equations from lemma 10. Since [d±1,τi,j(a)]∈U(P,⌈L⌉) for a∈I, 0=i=±j=0, and i,−j=l,l−1 (by lemma 16) and since [d±1,τl,l−1(a)],[d±1,τl−1,l(a)]∈U(P,⌈L⌉) for a∈I (by lemma 17), then α(d)±1a≡amod⌈I⌉ for a∈IEd+(1−Ed)⌊I⌋(1−E)+EI. For a∈(1−E)IE it follows from the fact that all b∈EI(1−E) satisfy congruences ba≡α(d−1)bα(d)a≡bα(d)aα(d−1)mod⌈I⌉. The remaining equations easily follow from this.
∎
Lemma 19**.**
Suppose that l⩾4, e0Ce0 is generated by less than 4l2 elements as a K-module, (e−1α(g)e−1 and e−2α(g)e−2 are invertible in e−1Ae−1 and e−2Ae−2. Suppose also that e−1α(g)(1−e−1),(1−e−1)α(g)e−1,e−2α(g)(1−e−2),(1−e−2)α(g)e−2∈J(A). Then either g∈GU(P,⌊L⌋), or it is possible to extract a nontrivial transvection.
Proof.
Suppose that a nontrivial transvection cannot be extraced. Note that if we swap indices 1 and −1, and also swap 2 and −2, then the condition on g will be preserved. Let us prove that [g,τ−1,2(a)]∈U(P,⌈L⌉) for all a∈I.
Consider the element x=f[g,τ−1,2(a,b)] for some f∈EU(P,L). We will assume that f=e−fe−+e0+e+fe+ and f≡1modJ(C). If there exists such f that Edα(x)(e−l+…+e0)=Ed, then we may apply lemma 17. Let us take any such f∈EU(P) that also Edfg(e−1+e−2)=0 (it is possible, because e−1ge−1 and e−2ge−2 are invertible). It is easy to see that Edx(e−l+…+e0)=Ed, hence we only need to prove that rd=Ed(fg)−1d(e−1+e−2)∈I0={x∈C∣(x,0)∈I}. In order to do this we may assume that a=b (we will need that all such a together generate the two-sided ideal (1−e0)C(1−e0)). A direct calculation shows that Edx(e1+e2)=w and (e−1+e−2)x−1E=wd+pr for some p∈(e−1+e−2)C(e−1+e−2), which is congruent to −a+ad modulo the Jacobson radical, and for w∈J(C).
Let y=[x−1,τl,1−l(u,u)], then y(e−1+e−2+e0)=e−1+e−2+e0 and y∈GU(P,⌊L⌋) by lemma 18. Is is easy to see that (e−1+e−2)y−1Ed=(e−1+e−2)x−1E(ud−u)Ed and Ey(e1+e2)=Ex−1E(u−ud)Edx(e1+e2), where Ex−1E∈E+J(C)r. Since y∈GU(P,⌊L⌋), then ((e−1+e−2)y−1Ed,Ey(e1+e2)y−1(e1+e2)d)∈I, i. e. (e−1+e−2)y−1Ed−(e−1+e−2)y−1d(e−1+e−2)ydEd∈I0. Since also (e1+e2)y−1(e1+e2)=e1+e2+rJ(C), it follows that r∈CrdJ(C)+J(C)rdC+I0. We a going to prove that [r]=0∈A=(1−e0)C(1−r0)/((1−e0)I0(1−e0)). Note that [r]∈A[rd]J(A)+J(A)[rd]A, hence [r]∈A[r]J(A)+J(A)[r]A. Applying the Nakayama lemma to A/A[r]J(A), it follows that [r]∈A[r]J(A). Then [r]=0 again by the Nakayama lemma.
Now lemma 14 implies that there are f1,f2∈EU(P,L) such that e−1f1gf2=e−1f1gf2e−1=f1gf2e−1. Therefore, g∈GU(P,⌊L⌋) by lemma 13.
∎
Note that we can also apply the rules EX1 – EX4 to [g] in C/J(C) in order to obtain a nontrivial transvection. Here we may assume that S=1. The resulting sequence can be lifted into C∗ in such a way that lifts of all auxiliary transvections are in the required open subgroups.
Lemma 20**.**
Suppose that l⩾4, K is a field, C is a finite-dimensional semi-simple K-algebra, and dimK(e0Ce0)<4l2. Then either g∈GU(P,⌊L⌋), or it is possible to extract a nontrivial transvection.
Proof.
Assume that we cannot extract a nontrivial transvection. Without loss of generality, C is a simple algebra as an algebra with involution (i. e. either C=M(n,D), where D is a division algebra with an involution, or C=M(n,D)×M(n,Dop) and the involution swaps the factors). If C=e0Ce0, then one can finish the proof by using lemma 16. We will prove that g∈GU′(P,⌈L⌉) similarly to the proof of lemma 19. First of all, we will prove that [g,τ−1,2(a)]∈U(P,⌈L⌉) for all a∈I.
Consider the element x=f[g,τ−1,2(a,b)], where f∈EU(P,L), f=e−fe−+e0+e+fe+, and Edfg(e−1+e−2)=0. If there is f such that Edα(fg)(e−1+e−2)=0, then we can apply lemma 17 to x. In the case when (1−e0)I(1−e0)=(1−e0)A(1−e0) this is obvious, hence we may assume that (1−e0)I(1−e0)⩽(1−e0)C(1−e0), i. e. either (1−e0)I(1−e0)=0, or (1−e0)I(1−e0)=(1−e0)C(1−e0).
In the case (1−e0)I(1−e0)=0 consider the element y=[x−1,τl,1−l(u,u)], it is in GU(P,⌊L⌋) by lemma 18. The equations of GU′(P,⌈L⌉) imply that (1−e0)y(1−e0)=1−e0. In other words, E(u−ud)Ed=(1−e0)x−1E(u−ud)Edx(1−e0)τl,1−l(u,u)−1 for all u∈elCe1−l. Hence (1−e0)x−1E has full rank (in every simple factor of C it has the same rank as E), and therefore Edxe+=0. Lemma 18 implies that x∈GU(P,⌊L⌋): indeed, if we apply the lemma to x−1 and recall that Edx−1(1−Ed)=0, then from the beginning of the proof of the lemma it follows that (1−E−e0)x−1E=0, hence Ex−1E is invertible, because (1−e0)x−1E has full rank. Similarly, e−α(f)(e−1+e−2)(a−bd)(e1+e2)α(f)e+=e−α(fg)(e−1+e−2)(a−bd)(e1+e2)α(g−1)τ−1,2(a,b)−1α(f−1)(1−e0). The factor e−α(fg)(e−1+e−2) has full rank, hence (e1+e2)α(g−1)τ−1,2(a,b)−1α(f−1)e−=0 and (e1+e2)α(g)(e−1+e−2)=0. Dividing the long equality by e−α(f)e− from the left and by e+α(f)−1e+ from the right, we obtain (e−1+e−2)(a−bd)(e1+e2)=e−α(g)(e−1+e−2)(a−bd)(e1+e2)α(g)−1e+. Therefore, Edα(g)(e−1+e−2)=0.
In the last case, when (1−e0)I(1−e0)=(1−e0)C(1−e0), we will consider x′=[f′g,τ−1,2(a,a)] instead of x, where f′∈EU(P,L), f′=e−f′e−+e−f′e++e+f′e++e0, and Edf′g(e−1+e−2)=0 (clearly, it is sufficient to prove [g,τ−1,2(a,a)]∈U(P,⌈L⌉)). Moreover, we assume that e−1f′g(e−1+e−2)E=0 and e−2f′g(e−1+e−2)E=0, if (1−e0)g(e−1+e−2)E=0, for all primitive central idempotents E in C (there are only 1 or 2 of them). Since Edx′=Ed, then x′∈GU(P,⌊L⌋) by the beginning of the proof of lemma 18 (applied to x′−1d) and Ed(fg)−1d(e−1+e−2)(a−ad)(fg)d(e1+e2)=0. In the case (1−e0)g(e−1+e−2)E=0 we have Ed(fg)−1d(e−1+e−2)Ed=0. In another case, i. e. (1−e0)g(e−1+e−2)E=0, we may take f such that Ed(fg)−1d(e−1+e−2)Ed=0. Therefore, in any case x′∈U(P,⌈L⌉) by lemma 16.
Now [g±1,τα(a)]∈U(P,⌈L⌉) for all short roots α. Hence there is h∈EU(P,L) such that e−lα(hg)e−l in invertible in e−lAe−l (using the proof of the proposition 2). But [(hg)±1,τα(a)] are also in U(P,⌈L⌉) for all short roots α, hence by lemma 14 there are f1,f2∈EU(P,L) such that e−lα(f1hgf2)=e−lα(f1hgf2)e−l=α(f1hgf2)e−l. The rest of the proof is obvious by lemma 13.
∎
Proposition 3**.**
Suppose that l⩾4, K is a local ring, C is a finite K-algebra, e0Ce0 is generated by less than 4l2 elements as a K-module, and g∈C∗. Then either g∈GU(P,⌊L⌋), or it is possible to extract a nontrivial transvection.
Proof.
By lemma 20 either we can extract a nontrivial transvection modulo J(C) (in this case we can extract a nontrivial transvection in C by lemma 19), or [g]∈GU(P/J(C),⌊L/J(C)⌋). In the second case double application of proposition 2 to g gives us an element g′=fhg for f∈EU(P,L) and h∈EU(P,L) such that lemma 19 can be applied to g′.
∎
13 Application to classical groups
Let us begin from the case of even unitary groups. If P=⨁i=1nH(Pi) is a quadratic module over a ring R with a pseudo-involution, then by paper [24] we can assume that AR=R/Ker(φ). The set Λ=Ker(φ)⩽R is a form parameter in Bak’s sense (it is an additive subgroup closed under the action of R∙ and it is contained between Λmin={r−rdλ∣r∈R} and Λmax={r∈R∣r+rdλ=0}). The map tr is given by the formula tr(r+Λ)=r+rdλ. We also assume that all Pi are fully projective over R (for example, Pi=R) and l⩾4.
The ring C=E=EndR(PR) is Morita equivalent to the ring R, hence the lattices of ideals of C and R are isomorphic. Since quasi-finiteness is Morita invariant, the assumption of the main theorem 2 will be satisfied if R is quasi-finite over its center (and the center can be taken as K). The group H in this case equals C with the addition operations and Λ from lemma 7 is Morita equivalent to the form parameter, i. e. Λi={x∈EndR(Pi)∣B(p,xp)∈Ker(φ) for p∈Pi}.
In the even case levels and augmented levels are the same, they are just pairs L=(I,Γ), where I=\!\;\overline{\!\!\>I\vphantom{d}\!\!\>}\;\!\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}\mathcal{A} and Γ⩽H=C are such that I2⊆I, tr(Γ)⩽I and Γ⋅I∔Γ⋅C∔Λ⋅I∔φ(I)⩽Γ. The main theorem claims that if l⩾4 and R is quasi-finite, then all subgroups G⩽GL(P) normalized by EU(P) are contained between EU(P,L) and GU(P,L) for unique level L. Here GU(P,L)={g∈GL(P)∣[{g,g−1},EU(P,L)]⊆EU(P,L)}.
If we are interested only in subgroups of U(P) normalized by EU(P), then these subgroups can be described in the same way if we take GU(P,L)∩U(P) instead of GU(P,L) and if we consider only the levels contained in the level EU(P). In other words, these subgroups are described by pairs L=(I,Γ), where I=\!\;\overline{\!\!\>I\vphantom{d}\!\!\>}\;\!\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}C and Γ⩽Λ are such that tr(Γ)⩽I and Γ⋅C∔Λ⋅I∔φ(I)⩽Γ. Under Morita equivalence these pairs correspond to form ideals of the ring R, hence we obtain the main result of [26], though in this paper the theorem was proved under weaker assumption l⩾3. In particular, we have the sandwich classification theorem for subgroups of O(2l,K) and Sp(2l,K) normalized by corresponding elementary subgroups, where K is a commutative ring.
Now let us classify overgroups of EU(P) in GL(P). These groups are normalized by EU(P), hence for l⩾4 and quasi-finite R the main theorem can be applied. The groups are described through levels containing the level EU(P), i. e. pairs L=(I,Γ), where I=C⊕J×0⊆A=C×C and Γ⩽H=C are such that J=\!\;\overline{\!\!\>J\vphantom{d}\!\!\>}\;\!\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}C, Λ∔Γ⋅C∔J⩽Γ⩽{x∈C∣x+xd∈J}. Under Morita equivalence these pairs (J,Γ) correspond to the levels from paper [23], hence we obtain the main result from this paper (though in the paper there was another assumption instead of quasi-finiteness of R, but it also was only for l⩾4). In particular, in this way we have the sandwich classification of overgroups of O(2l,K) and Sp(2l,K) in the general linear group over a commutative ring K, i e. the results from papers [37] and [38].
In the case of odd orthogonal group O(2l+1,K) the main theorem cannot be applied directly, because we have assumed nondegeneracy of the hermitian form and for odd orthogonal group the form may be degenerate if two is not invertible in K. Therefore, in order to deal with it properly, we should generalize the classification theorem for the case of P0 with degenerate form, where one cannot freely use proposition 1.
Fortunately, we can circumvent this in the classification of overgroups of EO(2l+1,K) in GL(2l+1,K). Take a nondegenerate symmetric bilinear form B(x,y)=xy on P0=K and take the corresponding quadratic form from the odd form parameter L=0. If we take Pi=K for 1⩽i⩽l, then we will obtain the quadratic space P=K2l+1, its unitary group equals U(P)=O(2l,K). Indeed, if g∈GL(P) preserves the quadratic form, then this element trivially acts on P0. If g also preserves the symmetric bilinear form, then it reduces to an orthogonal operator on P0⊥=⨁i>1H(Pi). In this case our theorem gives among other things the sandwich classification of overgroups of EO(2l,K) in GL(2l+1,K). The overgroups of EO(2l+1,K) will be obtained from the theorem, if we calculate the level of EO(2l+1,K).
For the module P=K2l+1 we have C=E=M(2l+1,K) with the involution (ai,j)i,j=−lld=(a−j,−i)i,j=−ll (it is the reflection across the antidiagonal), A=E×E, and H=e0E×E×Ee0=2k+1K×M(2l+1,K)×K2l+1, where K2l+1 is the column space and 2k+1K is the row space. Also Λi=0, i. e. Λ=φ(E)⋅(1−e0). An augmented level is a pair L=(I,Γ), where I=Id⩽A is a K-submodule and Γ⩽H are such that I2⩽I, I(1−e0)E(1−e0)⩽I, π(Γ)⩽I, tr(Γ)+π(Γ)dπ(Γ)⩽I, Γ⋅(I+K+(1−e0)E(1−e0))∔φ(I)⩽Γ, and e0I(1−e0)=π(Γ⋅(1−e0)). The group EO(2l,K) has the level L0=((1−e0)E(1−e0),Λ) and the group EO(2l+1,K) has the level L1=(I1,Γ1), where (1−e0)I1(1−e0)=(1−e0)E(1−e0) and Γ1⋅(1−e0)={(x,y,z)∈H⋅(1−e0)∣z=2xd,y=xdx}∔φ(E).
This can be used in the classification of overgroups of EO(2l+1,K) for l⩾4. Let L=(I,Γ) be an arbitraty level containing L1. It is uniquely detreminted by the groups (1−e0)I(1−e0)=(1−e0)E(1−e0)⊕0×J, e0I(1−e0)={(x,−2x)∣x∈2lK}⊕0×b2l, and Γ⋅e1, where J\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}(1-e_{0})E(1-e_{0}) as a two-sided ideal is Morita-equivalent to the ideal \mathfrak{a}\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}K, \mathfrak{b}\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}K is also an ideal, and the group Γ⋅e1⩽H⋅e1≅K×K×K is of type {(x,x2,2x)}∔W, where 0×a×0⩽W⩽{(0,y,z)∈K×K×K∣2y∈a,z∈b} is uniquely determined. Moreover, 2a⩽b⩽a, for every z∈b there is y such that (0,y,z)∈W, and for all (0,y,z)∈W, a∈a, and k∈K the elements (0,k2y,kz) and (0,0,az) are also in W. Conversely, if a,b and W satisfy these conditions, then they can be obtained from some level containing L1. In the case 2∈K∗ we have a=b and W=0×a×a, hence we obtain the result from paper [39] (where this was proved under weaker assumption l⩾3).
Finally, we can classify the subgroups of O(2l+1,K) that contain EO(2l,K) for l⩾4. Let L=(I,Γ) be a level between L0 and L1. It is uniquely determined by the groups e0I(1−e0)⩽{(x,−2xd)∣x∈2lK} and Γ⋅e1. Here e0I(1−e0)={(x,−2x)∣x∈2la} for unique ideal \mathfrak{a}\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}K and Γ⋅e1⩽H⋅e1≅K×K×K is of type {(x,x2,2x)∣x∈b} for unique ideal \mathfrak{b}\mathbin{\hbox to0.0pt{\displaystyle\text{\raisebox{1.0pt}{⊲}}\hss}\mspace{-5.0mu}\leqslant}K. The condition π(Γ⋅e1)=e0Ie1 means that a=b. Conversely, every ideal a can be obtained from unique level between L0 and L1. Therefore, the overgroups of EO(2l,K) in O(2l+1,K) are classified by the ideals of K.
Bibliography43
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] E. Artin. Geometric algebra . Inters. Publ., New York, 1957.
2[2] A. Bak. The stable structure of quadratic modules . Thesis Columbia Univ., 1969.
3[3] A. Bak. K 𝐾 {K} -Theory of forms . Ann. of Math. Stud. 98. Princeton Univ. Press, Princeton, 1982.
4[4] A. Bak. Nonabelian K 𝐾 {K} -theory: the nilpotent class of K 1 subscript 𝐾 1 {K}_{1} and general stability. K-Theory , 4:363–397, 1991.
5[5] A. Bak, R. Hazrat, and N. Vavilov. Localisation-completion strikes again: relative K 1 subscript 𝐾 1 {K}_{1} is nilpotent by abelian. J. Pure Appl. Algebra , 213:1075–1085, 2009.
6[6] A. Bak and R. Preusser. The E-normal structure of odd-dimensional unitary groups. J. Pure Appl. Algebra , 222(9):2823–2880, 2018.
7[7] A. Bak and N. Vavilov. Normality for elementary subgroup functors. Math. Proc. Cambridge Philos. Soc. , 118(1):35–47, 1995.
8[8] A. Bak and N. Vavilov. Structure of hyperbolic unitary groups I: elementary subgroups. Algebra Colloq. , 7(2):159–196, 2000.