Central European Journal of Mathematics

DOI: 10.2478/s11533-006-0014-9 Review article CEJM 4(3) 2006 323–357 Left-symmetric algebras, or pre-Lie algebras in ge...

93 downloads 801 Views 3MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

DOI: 10.2478/s11533-006-0014-9 Review article CEJM 4(3) 2006 323–357

Left-symmetric algebras, or pre-Lie algebras in geometry and physics Dietrich Burde∗ Fakult¨ at f¨ ur Mathematik, Universit¨ at Wien, 1090 Wien, Austria

Received 3 January 2006; accepted 15 March 2006 Abstract: In this survey article we discuss the origin, theory and applications of left-symmetric algebras (LSAs in short) in geometry in physics. Recently Connes, Kreimer and Kontsevich have introduced LSAs in mathematical physics (QFT and renormalization theory), where the name pre-Lie algebras is used quite often. Already Cayley wrote about such algebras more than hundred years ago. Indeed, LSAs arise in many diﬀerent areas of mathematics and physics. We attempt to give a survey of the ﬁelds where LSAs play an important role. Furthermore we study the algebraic theory of LSAs such as structure theory, radical theory, cohomology theory and the classiﬁcation of simple LSAs. We also discuss applications to faithful Lie algebra representations. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Pre-Lie algebra, rooted tree, vertex algebra, operad, deformation complex, convex homogeneous cone, aﬃne manifold, faithful representation, radical, cohomology of pre-Lie algebras MSC (2000): 17-02, 22-02 53-02

1

Introduction

Left-symmetric algebras, or LSAs in short, arise in many areas of mathematics and physics. They have already been introduced by A. Cayley in 1896, in the context of rooted tree algebras, see [20]. Then they were forgotten for a long time until Vinberg [66] in 1960 (in the original Russian version) and Koszul [44] in 1961 introduced them in the context of convex homogeneous cones and aﬃnely ﬂat manifolds. From this time on many articles related to LSAs, from quite diﬀerent research areas, have been published. As a consequence, perhaps, LSAs are known under many diﬀerent names. LSAs are also called ∗

E-mail: [email protected]

324

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

Vinberg algebras, Koszul algebras or quasi-associative algebras. Right-symmetric algebras, or RSAs, are also called Gerstenhaber algebras, or pre-Lie algebras [24]. The aim of the ﬁrst section is to give a survey on the main topics involving LSAs and to describe the role of LSAs therein. The importance of LSAs for the subject may be quite diﬀerent. In the problems comming from diﬀerential geometry LSAs have been introduced in order to reformulate the geometric problem in terms of algebra. In this case the original problem is equivalent to a certain problem on LSAs. For other problems a combinatorically deﬁned product turns out to be left- or right-symmetric, but the importance of this structure is not always obvious. There exist also many attempts to provide a structure theory for ﬁnite-dimensional LSAs over the real or complex numbers. We will describe known results on the algebraic theory of LSAs and its applications in the second section. We start with some basic deﬁnitions which we already need for the ﬁrst section. Let (A, ·) be an algebra over K, not necessarily associative and not necessarily ﬁnite-dimensional. The associator (x, y, z) of three elements x, y, z ∈ A is deﬁned by (x, y, z) = (x · y) · z − x · (y · z). Definition 1.1. An algebra (A, ·) over K with a bilinear product (x, y) → x · y is called LSA, if the product is left-symmetric, i.e., if the identity (x, y, z) = (y, x, z) is satisﬁed for all x, y, z ∈ A. The algebra is called RSA, if the identity (x, y, z) = (x, z, y) is satisﬁed. The opposite algebra of an LSA is an RSA. Indeed, if x · y is the product in A, then x ◦ y = y · x is the product in Aop . An associative product is right- and leftsymmetric. Note that the converse is not true in general: the algebra A := Kx ⊕ Ky with product x.x = 0, x.y = 0, y.x = −x, y.y = x − y is an RSA and LSA, but we have (y.y).y − y.(y.y) = x. We note that LSAs and RSAs are examples of Lie-admissible algebras, i.e., the commutator [x, y] = x · y − y · x deﬁnes a Lie bracket. This follows from the identity [[a, b], c] + [[b, c], a] + [[c, a], b] = (a, b, c) + (b, c, a) + (c, a, b) − (b, a, c) − (a, c, b) − (c, b, a). valid in any K-algebra. We denote the Lie algebra by gA .

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

2

325

Origins of left-symmetric algebras

2.1 Vector ﬁelds and RSAs Let U be an associative commutative algebra, and D = {∂1 , . . . , ∂n } be a system of commuting derivations of U . If we regard the derivations in the endomorphism algebra we will require them to be linearly independent. For any u ∈ U the endomorphisms u∂i : U → U,

(u∂i )(v) = u∂i (v)

are derivations of U . Denote by U D = Vec(n) the vector space of derivations n ui ∂i | ui ∈ U, ∂i ∈ D . Vec(n) = i=1

We may consider this as a space of vector ﬁelds. We introduce on Vec(n) the following algebra product u∂i ◦ v∂j = v∂j (u) ∂i .

(1)

Proposition 2.1. The algebra (Vec(n), ◦) is an RSA. It is called right-symmetric Witt algebra generated by U and D. Proof. The associator is given by (u∂i , v∂j , w∂k ) = (u∂i ◦ v∂j ) ◦ w∂k − u∂i ◦ (v∂j ◦ w∂k ) = v∂j (u)∂i ◦ w∂k − u∂i ◦ (w∂k (v)∂j ) = w∂k (v∂j (u)) ∂i − w∂k (v)∂j (u) ∂i = w{∂k (v)∂j (u) + v∂k (∂j (u)))} ∂i − w∂k (v)∂j (u) ∂i = wv∂k (∂j (u)) ∂i . Since wv = vw in U and the elements in D commute it follows (u∂i , w∂k , v∂j ) = vw∂j (∂k (u)) ∂i = (u∂i , v∂j , w∂k ). As an example, let M n be a smooth n-dimensional ﬂat manifold, U be the algebra of smooth functions on M and Diﬀ(n) the algebra of n-dimensional diﬀerential operators α∈Zn

α

λ α uα ∂ ,

α

∂ =

n

∂iαi

i=1

where ∂i = ∂/∂xi , α = (α1 , . . . , αn ) ∈ Zn and uα ∈ U . Note that ∂i ∂j = ∂j ∂i for all i, j since M n is ﬂat. The subspace of diﬀerential operators of ﬁrst order is just Vec(n) as

326

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

above. It can be interpretated as as a space of vector ﬁelds on M . The algebra Diﬀ(n) is associative, whereas the algebra Vec(n) under the product (1) is right-symmetric but not associative. Other important special cases of U are the polynomial ring U = K[x1 , . . . , xn ] in n ±1 variables, or the Laurent polynomial algebra U = K[x±1 1 , . . . , xn ]. In the ﬁrst case U D is denoted by Wnr with Lie algebra Wn , the Witt algebra of rank n. There exists a grading and a ﬁltration of U D as an RSA and as a Lie algebra. For n = 1 the algebra W1r satisﬁes an additional identity: x ◦ (y ◦ z) = y ◦ (x ◦ z).

(2)

This means that the left multiplications in this algebra commute. RSAs satisfying this identity are called right Novikov algebras. There exists a large literature on Novikov algebras, see [3, 4, 13, 57, 58, 67] and the references given therein. For details concerning right-symmetric Witt algebras see [30] and [31].

2.2 Rooted tree algebras and RSAs Probably Caley was the ﬁrst one to consider RSAs. In his paper [20] he also described a realization of the right-symmetric Witt algebra as a rooted tree algebra. A rooted tree is a pair (T, v) where T is a non-empty ﬁnite, connected graph without loops and v is a distinguished vertex of T called the root. This root gives an orientation of the graph; edges are oriented towards the root. Denote by |T | the set of vertices of T . Now we will introduce a product on the vector space generated by rooted trees: Denote by T1 ◦v T2 the graph deﬁned by adding to the disjoint union of T1 and T2 an edge joining the root of T2 to the vertex v of T1 , and keeping the root of T1 as the root. In other words, for rooted trees (T1 , v1 ) and (T2 , v2 ) we have the rooted tree (T1 ◦v T2 , v1 ). Then we deﬁne (T1 , v1 ) ◦ (T2 , v2 ) = (T1 ◦v T2 , v1 ). v∈|T1 |

The graph (T1 , v1 ) ◦ (T2 , v2 ) is obtained by the sum over all possible graftings: add a new branch to the root of T2 and plant this graph to each node of T1 and add the resulting trees. Here is an example:

Now this product is right-symmetric. In fact, the right-symmetry for the associator of three elements (T1 , v1 ), (T2 , v2 ), (T3 , v3 ) may be seen from the fact that insertion of

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

327

graphs is a local operation, and that on both sides, the diﬀerence amounts to plugging the subgraphs T2 , T3 into T1 at disjoint places, which is evidently symmetric under the exchange of T2 and T3 . We have the following result [24]: Proposition 2.2. The free RSA on a generator {u} has the rooted trees as a basis. In [30] the algebra of labelled rooted trees is considered. Let S be a set. A labeling of T by S is a map |T | → S. Denote the set of rooted trees labelled by S by T (S). Identify two rooted labeled trees if there is an isomorphism of labelled graphs sending the root to the root. For example, the following two labelled trees belong to the same class: a

b

b

b

b

a b

b

a

a

This is written as T (a, b, T (b, a, b)) = T (a, T (b, b, a), b). Let us write a rooted tree as T (v, x1 , . . . , xn ), where xi are trees, v is the root and n is the number of incomming edges of the root. Deﬁne a non-associative and non-commutative operation on T (S) by T (v, x1 , . . . , xn ) • y = T (v, x1 , . . . , xn , y). This satisﬁes the identity (a • b) • c = (a • c) • b. Let R be a commutative ring. Deﬁne the tree algebra T (S) as the free R-module on T (S). A bilinear multiplication ◦ on T (S) is deﬁned recursively on basis elements as follows. Let v ∈ S and x1 , . . . , xn , y ∈ T (S). Then v ◦ y = v • y = T (v, y), T (v, x1 , . . . , xn ) ◦ y = T (v, x1 , . . . , xn , y) n T (v, x1 , . . . , xi , . . . , xn ) • (xi ◦ y). + i=1

It follows that (T (S), ◦) is a derivation algebra of (T (S), •): (x • y) ◦ z = (x ◦ z) • y + x • (y ◦ z). Moreover, (T (S), ◦) is right-symmetric. We have the following result [30]: Proposition 2.3. As an R-algebra, T (S) is generated by S. Let t(S) be the Lie algebra of T (S) and HS = (U (t(S))∗ be the the dual of the universal enveloping algebra of t(S). The algebra HS is a Hopf algebra. Remark 2.4. Hopf algebras and RSAs of rooted trees play an important role in renormalizable quantum ﬁeld theories. Feynman graphs form an RSA.

328

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

More precisely, for any QFT the combinatorics of Feynman graphs gives rise to an RSA of rooted trees, and hence to a Lie algebra and a Hopf algebra of rooted trees. In fact, the structure of the pertubative expansion of a QFT is in many ways determined by the Hopf and Lie algebra structures of Feynman graphs. This allows a conceptual interpretation of renormalization theory. For example, the Hopf algebra of rooted trees yields the ﬁnite renormalization needed to satisfy the requirements of quantized gauge symmetries. There is an extensive literature available, see [25, 45, 46] and the references given therein. One should note that it is possible to construct on any class C of graphs a right-symmetric product: for each pair of graphs Γ1 , Γ2 ∈ C one deﬁnes a set I(Γ1 , Γ2 ) and a map γ : I(Γ1 , Γ2 ) → C where I(Γ1 , Γ2 ) is, roughly spoken, the set of possible insertions of Γ2 in Γ1 staying in the class C, and γ realizes these insertions. Then the product γ(i) Γ1 Γ2 = i∈I(Γ1 ,Γ2 )

is right-symmetric, i.e., Γ1 (Γ2 Γ3 ) − (Γ1 Γ2 ) Γ3 is symmetric in Γ2 and Γ3 . To see this, one has to prove that this associator corresponds to the insertion of Γ2 and Γ3 at two distinct vertices of Γ1 , which is of course symmetric in Γ2 and Γ3 . This way it is possible to consider also classes of graphs with certain constraints, e.g., with renormalization conditions.

2.3 Words in two letters and RSAs The right-symmetric structure on certain graphs can also be illustrated by words on an alphabet. We want to consider the following nice construction. Let W be the vector space generated by the set of ﬁnite words on the alphabet {A, B}. Let ∅ denote the empty word. If x is such a word, then let x[i] denote the i-th letter of x. For example, if x = AB 2 AB then x[0] = ∅ and x[4] = A. Let (x) be the length of the word x. Deﬁne an algebra product on W by the formula x◦y =

(x)

ε(i)x i y

i=0

where x i y is the insertion of y ⎧ ⎪ ⎪ −1 ⎪ ⎪ ⎪ ⎨+1 ε(i) = ⎪ +1 ⎪ ⎪ ⎪ ⎪ ⎩0

between x[i] and x[i + 1] and if x[i] = A and x[i + 1] = B if x[i] = B and x[i + 1] = A or ∅ if x[i] = ∅ and x[i + 1] = A else

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

329

Note that nothing is inserted between A and ∅, or between ∅ and B. Example 2.5. Let us compute a few examples: A ◦ A = A2 A ◦ AB = ABA AB ◦ A = A2 B − A2 B + ABA = ABA ABA ◦ B = BABA BA ◦ AB = BABA AB ◦ AB = 2ABAB − A2 B 2 BA ◦ ABA = BABA2 ABA ◦ BA = BA2 BA − ABABA + AB 2 A2 The product is neither commutative nor associative. Indeed, (ABA ◦ B) ◦ BA = BABA ◦ BA = B 2 A2 BA − BABABA + BAB 2 A2 , whereas ABA ◦ (B ◦ BA) = ABA ◦ B 2 A = B 2 A2 BA − AB 2 ABA + AB 3 A2 . It follows that ABA ◦ (B ◦ BA) − (ABA ◦ B) ◦ BA = −BABABA + BAB 2 A2 + AB 2 ABA − AB 2 A2 = (ABA ◦ BA) ◦ B − ABA ◦ (BA ◦ B). Hence the associators satisfy (ABA, B, BA) = (ABA, BA, B). This is no coincidence. We have the following result: Proposition 2.6. The algebra (W, ◦) is right-symmetric, i.e., x ◦ (y ◦ z) − (x ◦ y) ◦ z = x ◦ (z ◦ y) − (x ◦ z) ◦ y for all words x, y, z in W .

2.4 Vertex algebras and LSAs Vertex algebras have been studied very intensively over the last years. There is a huge literature on this subject. We can only mention just a few classical references here: [33,

330

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

34, 41, 48]. We will try to explain what a vertex algebra is, and how it is related to LSAs. Vertex algebras were ﬁrst introduced by R. Borcherds in 1986, see [9]. The deﬁnition is given in terms of quite complicated identities, the so called Borcherds identities. In 1996 Kac [41] gave an equivalent deﬁnition of a vertex algebra as a pointed vector space (V, |0 ) together with a local state-ﬁeld correspondence Y . Here any vector space V with a ﬁxed non-zero vector |0 , referred as the vacuum vector, is called a pointed vector space. Kac also introduced conformal algebras. He proved, using ﬁeld algebras, that vertex algebras form a subclass of conformal algebras, see [5]. This allows to give a deﬁnition of vertex algebras via conformal algebras. This goes as follows: Definition 2.7. A Lie conformal algebra is a C[T ]-module V endowed with a C-linear map V ⊗ V → C[λ] ⊗ V denoted by a ⊗ b → [aλ b], called the λ-bracket, satisfying the following axioms for all a, b, c ∈ V :

[(T a)λ b] = −λ[aλ b], [aλ (T b)] = (λ + T )[aλ b], [bλ a] = [a−λ−T b], [[aλ b]λ+μ c] = [aλ [bμ c] − [bμ [aλ c]]. The ﬁrst two axioms are called sesquilinearity. Together they say that T is a derivation of the λ-bracket: T [aλ b] = [(T a)λ b] + [aλ (T b)]. The third axiom is called skewsymmetry, and the last one the Jacobi identity. Now the theorem in [5] is as follows: Theorem 2.8. Giving a vertex algebra structure on a pointed vector space (V, |0 ) is the same as providing V with the structures of a Lie C[T ]-conformal algebra and a leftsymmetric C[T ]-diﬀerential algebra with unit |0 , satisfying

a.b − b.a =

0

−T

dλ [aλ b].

[aλ (b.c)] = [aλ b].c + b.[aλ c] +

0

λ

dμ [[aλ b]μ c].

The ﬁrst axiom is called skewsymmetry and the second is called the non-commutative Wick formula. The theorem says that we can deﬁne a vertex algebra as follows: Definition 2.9. A Vertex algebra is a pair (V, |0 ), where V is a C[T ]-module and |0 is an element of V (the vacuum state), endowed with two operations: a λ-bracket V ⊗ V → C[λ] ⊗ V , a ⊗ b → [aλ b] making it a Lie conformal algebra, and a normally ordered product V ⊗ V → V , a ⊗ b → a.b, which makes it a unital diﬀerential algebra with unit |0 and derivation T . These two operations satisfy the following axioms:

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

(a.b).c − a.(b.c) = a.b − b.a =

0 0

−T

T

dλ a .[bλ c] +

T

0

331

dλ b .[aλ c],

dλ [aλ b],

[aλ (b.c)] = [aλ b].c + b.[aλ c] +

λ

0

dμ [[aλ b]μ c].

The ﬁrst axiom here is called quasi-associativity. It follows that the underlying algebra of a vertex algebra is an LSA: indeed, the right-hand side of the ﬁrst identity is symmetric with respect to a and b. Hence the product a.b is left-symmetric. More details can be found also in [60]. For any Lie conformal algebra R one can construct a so called enveloping vertex algebra U (R). Hence each example of a Lie conformal algebra produces an example of a vertex algebra. Example 2.10. The Virasoro Lie conformal algebra is given by V = C[T ]L ⊕ C |0

with λ-bracket [Lλ L] = (T + 2λ)L +

c 3 λ |0 , 12

where c ∈ C is the central charge. Remark 2.11. Finite-dimensional simple Lie conformal algebras have been classiﬁed, see [1] and the references cited therein. For inﬁnite-dimensional algebras this classiﬁcation is far from being solved.

2.5 Operad theory and RSAs Let Sn denote the symmetric group on n letters and K[Sn ] the group ring. For us an operad P is a sequence of K[Sn ]-modules (P(n))n≥1 equipped with a unit 1 ∈ P (1), together with composition products, for n, m1 , . . . , mm ∈ N, γ : P(n) ⊗ P(m1 ) ⊗ · · · ⊗ P(mn ) → P (n + m1 + · · · + mn ), satisfying natural associativity, unitarity and equivariance conditions. For details see [49]. There is a natural grading on the total space ⊕n P(n), deﬁned by m P(n) = P(m + 1). n≥1

Example 2.12. Let V be a K-vector space and deﬁne P(n) = HomK (V ⊗n , V ).

332

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

Then P = End(V ) = (P(n))n≥1 forms an operad. The Sn -actions are given by permutations of tensors on V ⊗n . The compositions are the usual ones for multilinear maps. For p ∈ P(n) and q ∈ P(m) denote p ◦i q = γ(p, id⊗i−1 , q, id⊗n−i ). Then we have the following result: Proposition 2.13. Let P be an operad of vector spaces. Then the graded vector space n≥1 P(n) forms an RSA under the product p◦q =

n

p ◦i q

i=1

where p ∈ P(n) and q ∈ P(m). Recall the notation T (n) for the free Z-module of rooted trees labelled by S = {1, . . . , n}. We can endow PR = (T (n))n≥1 with an operad structure by deﬁning compositions T ◦i S using substitutions and graftings in a certain way. For more details see [24]. On the other hand we have the quadratic binary operad PL deﬁning RSAs. One constructs this operad as follows. Let F be the free operad generated by the regular representation of S2 . A basis of F(n), as a vector space, is given by products (xi1 xi2 . . . xin ) indexed by {1, . . . , n} with arbitrary bracketing. For instance, a basis of F(2) is given by the products (x1 x2 ) and (x2 x1 ); and a basis of F(3) is given by the products ((x1 x2 )x3 ), (x1 (x2 x3 )) and all their permutations. Then PL = F/I where I denotes the ideal of F generated by the S3 -submodule of F(3) given by the relation ((x1 x2 )x3 ) − (x1 (x2 x3 )) − ((x1 x3 )x2 ) + (x1 (x3 x2 )). The following result was proved in [24]: Proposition 2.14. The operad PL deﬁning RSAs is isomorphic to the operad PR of rooted trees.

2.6 Deformation complexes of algebras and RSAs Let V be an R-module and denote by C n (V, V ) the space of all n-multilinear maps from V to V . For f ∈ C p (V, V ) and g ∈ C q (V, V ) consider the product ◦:

C p (V, V ) × C q (V, V ) → C p+q−1 (V, V ), (f, g) → f ◦ g

given by (f ◦ g)(x1 , . . . , xp+q−1 ) =

p i=1

f (x1 , . . . , xi−1 , g(xi , . . . , xi+q−1 ), xi+q , . . . , xp+q−1 ).

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

Let us denote the i-th summand by f right-symmetric:

◦i

333

g. One can show that this product is indeed

Proposition 2.15. The algebra (C • (V, V ), ◦) is an RSA. Gerstenhaber [35] already noted this fact in a graded version which arises in the Hochschild cohomology setting. Let A be an associative algebra and C n (A, A) = HomK (A⊗n , A) be the space of Hochschild n-cochains. Then the main tool in studying the deformation theory of A is the Hochschild complex 0 → C 0 (A, A) − → ··· − → C n (A, A) − → C n+1 (A, A) − → ··· d

d

d

d

denoted by C • (A, A). Gerstenhaber deﬁned a product on this complex as follows: (f ◦ g)(x1 , . . . , xp+q−1 ) =

p

(−1)(q−1)(i−1) (f

◦i

g)(x1 , . . . , xp+q−1 ).

i=1

This is a graded version of the product given above. It is also not associative in general, but satisﬁes a graded right-symmetric identity. Definition 2.16. Let V be a graded vector space and |x| denote the degree of x ∈ V . Then V together with a K-bilinear product (x, y) → x · y is called a graded RSA, if (x · y) · z − x · (y · z) = (−1)|y||z| ((x · z) · y − x · (z · y)). We have the following result, see [35, 54]: Proposition 2.17. The algebra (C • (A, A), ◦) is a graded RSA. The composition bracket x, y = x ◦ y − (−1)|x||y| y ◦ x is a graded Lie bracket, called the Gerstenhaber bracket. It is graded skew-symmetric, i.e., x, y = −(−1)|x||y| y, x, and satisﬁes the graded Jacobi identity (−1)|x||z| x, y, z + (−1)|y||x| y, z, x + (−1)|z||y| z, x, y = 0. Note that the Hochschild coboundary map d satisﬁes d(f ) = −μ, f , where μ ∈ HomK (A⊗ A, A) is the multiplication map of A.

334

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

2.7 Convex homogeneous cones and LSAs Convex homogeneous cones arose in the theory of automorphic functions on bounded homogeneous domains in Cn . If V is a convex homogeneous cone in Rn then the domain D = {x + iy | y ∈ V } ⊆ Cn is analytically equivalent to a bounded homogeneous domain. This is the so-called generalized upper half-plane, or Siegel domain of the ﬁrst kind. It is homogeneous with respect to the group of complex aﬃne transformations of D. Definition 2.18. A convex cone in Rn is a non-empty set V having the following properties: (1) if x ∈ V and λ > 0 then λx ∈ V ; (2) if x, y ∈ V then x + y ∈ V ; (3) the closure of V does not contain a subspace of positive dimension; (4) the set V is open in Rn . Condition (3) says that V does not completely contain any straight line. The subgroup G(V ) of GL(Rn ) consisting of the automorphisms A satisfying AV = V is called the automorphism group of V . A convex cone V is called homogeneous if G(V ) acts transitively on it. As an example consider the cone of positive-deﬁnite symmetric matrices in Mn (R), or the cone of positive-deﬁnite Hermitian matrices in Mn (C). Definition 2.19. A convex domain in an aﬃne space P is any nonempty open convex set U ⊂ P not completely containing any straight line. Clearly, a convex cone is a special case of a convex domain. The vertex of the cone deﬁnes an origin in the aﬃne space and converts it into a linear space. The group of aﬃne transformations leaving U invariant is denoted by G(U ). It is an algebraic group. Let g(U ) be its Lie algebra. We have G(U ) = K(U )T (U ) and K(U ) ∩ T (U ) = {e}, where K(U ) is the stability subgroup of some point x0 ∈ U and T (U ) is a maximal connected triangular subgroup of G(U ). The group T (U ) acts simply transitively on U by aﬃne transformations. Let t(U ) denote its Lie algebra. Let D ∈ t(U ), x0 ∈ U . Then the mapping D → D(x0 ) is an isomorphism of the linear space T (U ) onto the linear space RP of free vectors of P . Let Da be the inverse image of the vector a ∈ Rp under this mapping, i.e., Da (x0 ) = a. Let La denote the linear part of Da and deﬁne a bilinear product on RP by a · b = La (b). This algebra (RP , ·) is called the algebra of U with respect to the point x0 and the group T (U ). Diﬀerent choices of x0 and T (U ) would lead to isomorphic algebras, so we may speak of the algebra of U . We have the following result, see [66]: Theorem 2.20. The algebra (RP , ·) of any convex homogeneous domain is a left-symmetric

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

335

algebra over R satisfying the following properties: (1) there exists a linear form s on RP such that s(a · b) = s(b · a) and s(a · a) > 0 for each a = 0. (2) the eigenvalues of the operators La : x → a · x are real. It follows from the commutation rule of elements in g(U ) that [Da , Db ](x0 ) = La (b) − Lb (a) = a · b − b · a, [Da , Db ] = Da·b−b·a , [La , Lb ] = La·b−b·a This implies that we have (a, b, c) = (b, a, c). The linear form is given by s(a) = tr(La ). Since 0 = tr([La , Lb ]) = tr(La·b−b·a ) we have s(a · b) = s(b · a). Since the group T (U ) is triangular, the linear translations La are simultaneously reducible to triangular form and have real eigenvalues. In the special case that U is a convex homogeneous cone we obtain the following result: Corollary 2.21. If U is a convex homogeneous cone then the algebra RP has in addition a two-sided unit element, i.e., (3) there exists an element e such that e · a = a · e = a for all a ∈ RP . Vinberg [66] called LSAs satisfying the conditions (1) and (2) clans. He described how to construct a convex homogeneous domain from a clan. This leads to the following result: Theorem 2.22. There is a one-to-one correspondence of n-dimensional convex homogeneous cones and n-dimensional LSAs satisfying (1), (2), (3). There exists a certain classiﬁcation of this special class of LSAs, i.e., of clans with unity. According to Vinberg this classiﬁcation does not have the deﬁnite nature of, say, the classiﬁcation of semisimple Lie algebras. More details are to be found in [29, 66] and the references given therein.

2.8 Aﬃne manifolds and LSAs Let G be a Lie group acting smoothly and transitively on a smooth manifold X. Let U ⊂ X be an open set and let f : U → X be a smooth map. The map f is called locally–(X, G) if for each component Ui ⊂ U , there exists gi ∈ G such that the restriction of gi to Ui ⊂ X equals the restriction of f to Ui ⊂ U . Definition 2.23. Let M be a smooth manifold of the same dimension as X. An (X, G)– atlas on M is a pair (U, Φ) where U is an open covering of M and Φ = {ϕα : Uα → X}Uα ∈U is a collection of coordinate charts such that for each pair (Uα , Uβ ) ∈ U ×U the restriction

336

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

of ϕα ◦ ϕ−1 β to ϕβ (Uα ∩ Uβ ) is locally–(X, G). An (X, G)–structure on M is a maximal (X, G)–atlas and M together with an (X, G)–structure is called an (X, G)–manifold. Let Aﬀ(Rn ) be the group of aﬃne transformations which is given by ⎧⎛ ⎫ ⎞ ⎪ ⎪ ⎨ A b ⎬ ⎜ ⎟ n . (R), b ∈ R | A ∈ GL ⎝ ⎠ n ⎪ ⎪ ⎩ 0 1 ⎭ It acts on the real aﬃne space {(v, 1)t | v ∈ Rn } by ⎛ ⎞⎛ ⎞ ⎛ ⎞ ⎜ A b ⎟ ⎜v ⎟ ⎜Av + b⎟ ⎝ ⎠⎝ ⎠ = ⎝ ⎠ 0 1 1 1 Definition 2.24. Let M be an n-dimensional manifold. An (X, G)–structure on M , where X is the real n–dimensional aﬃne space, also denoted by Rn here, and G = Aﬀ(Rn ) is called an aﬃne structure on M and M is called an aﬃne manifold. Aﬃne structures on a smooth manifold M are in correspondence with a certain class of connections on the tangent bundle of M . The following result can be found in [43]: Proposition 2.25. There is a bijective correspondence of aﬃne structures on a manifold M and ﬂat torsionfree aﬃne connections ∇ on M . Let X denote the Lie algebra of all diﬀerentiable vector ﬁelds on M . The aﬃne connection ∇ is called torsionfree, or symmetric if ∇X (Y ) − ∇Y (X) − [X, Y ] = 0,

(3)

and ﬂat or of curvature zero, if ∇X ∇Y − ∇Y ∇X − ∇[X,Y ] = 0.

(4)

Such a connection determines a covariant diﬀerentiation ∇X : X → X, ∇X : Y → ∇X (Y ) for vector ﬁelds X, Y ∈ X. If we put X · Y = ∇X (Y ) then we obtain an R-bilinear product on X. The vanishing of curvature and torsion, i.e. (3) and (4) is equivalent to the following identities: [X, Y ] = X · Y − Y · X [X, Y ] · Z = X · (Y · Z) − Y · (X · Z)

(5) (6)

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

337

Thus the given product makes X into an LSA. When do aﬃne structures exist on a manifold M ? A ﬂat Euclidean structure on a manifold automatically gives an aﬃne structure. It is well known that the torus and the Klein bottle are the only compact two-dimensional manifolds that can be given Euclidean structures [65]. Let M be a closed 2–manifold, i.e., compact and without boundary. If M is a 2–torus, then there exist many aﬃne structures, among them non-Euclidean ones. A classiﬁcation of all aﬃne structures on the 2–torus is given in [47, 53]. If M is a closed 2– manifold diﬀerent from a 2–torus or the Klein bottle, then there exist no aﬃne structures. This follows from Benzecri’s result [8] of 1955: Theorem 2.26. A closed surface admits aﬃne structures if and only if its Euler characteristic vanishes. In higher dimensions there is no such criterion for the existence of an aﬃne structure. However, Smillie [64] proved that a closed manifold does not admit an aﬃne structure if its fundamental group is built up out of ﬁnite groups by taking free products, direct products and ﬁnite extensions. In particular, a connected sum of closed manifolds with ﬁnite fundamental groups admits no aﬃne structure. It is also known [21] that certain Seifert ﬁber spaces admit no aﬃne structure. Let M be a Seifert ﬁber space with vanishing ﬁrst Betti number. Then M does not admit any aﬃne structure.

2.9 Left-invariant aﬃne structures on Lie groups and LSAs Let G be a connected and simply connected Lie group, with Lie algebra g. Definition 2.27. An aﬃne structure on G is called left-invariant, if each left-multiplication map L(g) : G → G is an aﬃne diﬀeomorphism. If Γ is a discrete subgroup of G then the coset space G/Γ inherits an an aﬃne structure from G by the left-invariance of the structure. Many examples of aﬃne manifolds can be constructed via left-invariant aﬃne structures on Lie groups. Remark 2.28. It is well known that G admits a complete left-invariant aﬃne structure if and only if G acts simply transitively by aﬃne tranformations on Rn . Auslander proved that in this case G is solvable [2]. Milnor had posed in connection with Auslander’s conjecture on aﬃne crystallographic groups, the following question in [50]: Milnor’s Question 2.29. Does every solvable Lie group G admit a complete left–inva operate simply riant aﬃne structure, or equivalently, does the universal covering group G transitively by aﬃne transformations of Rk ?

338

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

It is possible to formulate Milnor’s problem in purely algebraic terms. Definition 2.30. An aﬃne, or left-symmetric structure on a Lie algebra g is a K–bilinear product g × g → g which is left-symmetric and satisﬁes [x, y] = x · y − y · x.

(7)

Denote the left-multiplication in the LSA by L(x)y = x·y, and the right multiplication by R(x)y = y · x. Proposition 2.31. There are canonical one-to-one correspondences between the following classes of objects, up to suitable equivalence: (a) {Left-invariant aﬃne structures on G} (b) {Aﬃne structures on the Lie algebra g} Under the bijection, bi-invariant aﬃne structures correspond to associative LSA–structures. Proof. The details of the correspondence are given in [11] and [27]. Suppose G admits a left-invariant aﬃne structure. Then there exists a left-invariant ﬂat torsionfree aﬃne connection ∇ on G. Since ∇ is left-invariant, for any two left-invariant vector ﬁelds X, Y ∈ g, the covariant derivative ∇X (Y ) ∈ g is also left-invariant. Hence covariant diﬀerentiation deﬁnes a bilinear multiplication on g : g × g → g, (X, Y ) → XY = ∇X (Y ). The conditions that ∇ has zero torsion and zero curvature amounts as before to XY − Y X = [X, Y ], X(Y Z) − Y (XZ) = [X, Y ]Z = (XY )Z − (Y X)Z. This multiplication is an aﬃne structure on g by deﬁnition.

Hence the algebraic version of Milnor’s question is given as folllows: Milnor’s Question 2.32. Does every solvable Lie algebra admit a complete aﬃne structure ? Milnor’s question has a very remarkable history. When he asked this question in 1977, there was some evidence for the existence of such structures. After that many articles appeared proving some special cases, see for example [2, 42, 61]. However, the general question was still open and it was rather a conjecture than a question by the time. Many mathematicians believed that Milnor’s question should have a positive answer. In fact, around 1990 there appeared articles in the literature which claimed to prove the conjecture, e.g., [10] and [56]. However, in 1992 Yves Benoist constructed a counterexample in dimension 11 consisting of a ﬁliform nilpotent Lie group without any left-invariant aﬃne

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

339

structure. Shortly after this counterexample of Benoist we were able to give a shorter proof and to produce a whole family of counterexamples [11, 14, 16] for the dimensions 10 ≤ n ≤ 13, out of which Benoist’s example ermerges as just one in a series: Theorem 2.33. There are ﬁliform nilpotent Lie groups of dimension 10 ≤ n ≤ 13 which do not admit any left-invariant aﬃne structure. Any ﬁliform nilpotent Lie group of dimension n ≤ 9 admits a left-invariant aﬃne structure. For the proof see [16]. An important role is played by the following observation, see Proposition 3.8: if g admits an aﬃne structure then g possesses a faithful Lie algebra module of dimension dim g + 1. Remark 2.34. It seems that there exist counterexamples in all dimensions n ≥ 10. This is not proved yet. Moreover no good criteria are known to decide the existence question for a given Lie group. We have suggested in [15] that the existence of aﬃne structures on g in some cases depends on the cohomology group H 2 (g, K).

3

Algebraic theory of LSAs

3.1 Faithful representations and aﬃne structures Let A be a left-symmetric algebra over a ﬁeld K of characteristic zero with underlying Lie algebra g. By deﬁnition the product x · y in A satisﬁes the two conditions x · (y · z) − (x · y) · z = y · (x · z) − (y · x) · z [x, y] = x · y − y · x for all x, y, z ∈ A. The left-multiplication L in A is given by L(x)(y) = x · y. The two conditions are equivalent to L : g → gl(g) is a Lie algebra homomorphism 1

1 : g → gL is a 1–cocycle in Z (g, gL )

(8) (9)

where gL denotes the g–module with action given by L, and 1 is the identity map. Z 1 (g, gL ) is the space of 1–cocycles with respect to gL . Note that the right-multiplication R is in general not a Lie algebra representation of g. Recall that, for a g-module M , the space of 1-cocycles and the space of 1-coboundaries is given by Z 1 (g, M ) = {ω ∈ Hom(g, M ) | ω([x, y]) = x • ω(y) − y • ω(x)} , B 1 (g, M ) = {ω ∈ Hom(g, M ) | ω(x) = x • m for some m ∈ M }. Let g be of dimension n and identify g with K n by choosing a K–basis. Then gl(g) gets identiﬁed with gln (K).

340

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

Definition 3.1. The Lie algebra of the Lie group Aﬀ(G) is called the Lie algebra of aﬃne transformations and is denoted by aff(g). It can be identiﬁed as a vector space with gln (K) ⊕ K n . Given an aﬃne structure on g, deﬁne a map α : g → aff(K n ) by α(x) = (L(x), x). That is a Lie algebra homomorphism: Lemma 3.2. The linear map L : g → gl(g) satisﬁes (8) and (9) if and only if α : g → aff(K n ) is a Lie algebra homomorphism. Proof. More generally, let α(x) = (L(x), t(x)) ∈ gln (K) ⊕ K n with a bijective linear map t : g → g. We have L([x, y]) = [L(x), L(y)] α([x, y]) = [α(x), α(y)] ⇐⇒ (10) L(x)(t(y)) − L(y)(t(x)) = t([x, y]) To see this, use the identiﬁcation of α(x) with ⎛ ⎞ ⎜L(x) t(x)⎟ α(x) = ⎝ ⎠. 0 0 Hence the Lie bracket in aff(K n ) is given by [α(x), α(y)] = [(L(x), t(x)), (L(y), t(y))] = (L(x)L(y) − L(y)L(x), L(x)(t(y)) − L(y)(t(x)). It follows that α is a Lie algebra homomorphism if and only if L is and t is a bijective 1–cocycle in Z 1 (g, gL ). The lemma follows with t = 1, the identity map on g. What can we say about the existence of aﬃne structures on Lie algebras ? Proposition 3.3. A ﬁnite-dimensional Lie algebra g admits an aﬃne structure if and only if there is a g–module M of dimension dim g such that the vector space Z 1 (g, M ) contains a nonsingular 1–cocycle. Proof. Let ϕ ∈ Z 1 (g, M ) be a nonsingular 1-cocycle with inverse transformation ϕ−1 . The module M corresponds to a linear representation θ : g → gl(g). Then L(x) := ϕ−1 ◦ θ(x) ◦ ϕ deﬁnes a g–module N such that ϕ−1 ◦ ϕ = 1 ∈ Z 1 (g, N ). It follows that L : g → gl(g) is a Lie algebra representation and 1([x, y]) = 1(x)y − 1(y)x is a bijective 1–cocycle in Z 1 (g, gL ). Hence L(x)y = x · y deﬁnes a left-symmetric structure on g. Conversely, 1 is a nonsingular 1–cocycle if g admits a left-symmetric structure.

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

341

Corollary 3.4. If the Lie algebra g admits a nonsingular derivation, then there exists an aﬃne structure on g. Proof. Let D be a nonsingular derivation and g the adjoint module of g. Since Z 1 (g, g) equals the space Der(g) of derivations of g, D is a nonsingular 1-cocycle. Corollary 3.5. If the Lie algebra g is graded by positive integers, then there exists an aﬃne structure on g. Proof. Suppose that g = ⊕i∈N gi is a graduation, i.e., [gi , gj ] ⊆ gi+j . Then there is a nonsingular derivation deﬁned by D(xi ) = ixi for xi ∈ gi . Corollary 3.6. Let g be a 2-step nilpotent Lie algebra or a nilpotent Lie algebra of dimension n ≤ 6. Then g admits an aﬃne structure. Proof. It is well known that in both cases g can be graded by positive integers.

The existence of a nonsingular derivation is a strong condition on the Lie algebra. In fact, such a Lie algebra is necessarily nilpotent [39]. But not every nilpotent Lie algebra admits a nonsingular derivation, see [28]. The class of characteristically nilpotent Lie algebras consists of nilpotent Lie algebras possessing only nilpotent derivations. The example of a characteristically nilpotent Lie algebra, given in [28], is 3-step nilpotent. Although there is no nonsingular derivation, there exists an aﬃne structure. That follows from a theorem of Scheuneman [61]: Proposition 3.7. Let g be a 3-step nilpotent Lie algebra. Then g admits an aﬃne structure. For a new proof see [13]. There have been attempts to generalize this result to 4-step nilpotent Lie algebras. However, only in special cases a positive result could be proved, see [13, 26]. The general case is still open. An aﬃne structure on a Lie algebra implies the existence of a faithful representation of relatively small degree: Proposition 3.8. Let g be an n-dimensional Lie algebra over K. If g admits an aﬃne structure then g possesses a faithful Lie algebra module of dimension n + 1. Proof. For any g-module V and any ω ∈ Z 1 (g, V ) we can deﬁne the g-module Vω := K × V by the action

x ◦ (t, v) = (0, x.v + tω(x))

342

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

where x ∈ g, t ∈ K and v ∈ V . It is easy to see that x ◦ (y ◦ (t, v)) − y ◦ (x ◦ (t, v)) = [x, y] ◦ (t, v). We obtain a g-module of dimension dim V +1 which is faithful if dim V = n and det ω = 0. Hence if we just take V = gL and ω = 1, then 1 ∈ Z 1 (g, gL ) because g admits a LSAstructure. It follows that Vω is a faithful g-module of dimension n + 1. This proposition suggests a review Ado’s Theorem, which states that any ﬁnitedimensional Lie algebra has a faithful ﬁnite-dimensional representation:

3.2 A reﬁnement of Ado’s theorem Definition 3.9. Let g be an n-dimensional Lie algebra over a ﬁeld K of characteristic zero. Deﬁne an invariant of g by μ(g) := min{dimK M | M is a faithful g–module}. We consider K as given by g, so that we need not refer to K in the notation μ(g). By Ado’s theorem, μ(g) is ﬁnite. What can we say about the size of this integer-valued invariant ? Following the details of the proof in Ado’s theorem one obtains an exponential bound on μ(g), given by Reed [59]: Proposition 3.10. Let g be a solvable Lie algebra of dimension n over an algebraically closed ﬁeld of characteristic zero. Then μ(g) ≤ nn + n + 1. For semisimple Lie algebras we have μ(g) ≤ n: Lemma 3.11. Let dim g = n. If g has trivial center then μ(g) ≤ n. If g admits an aﬃne structure then μ(g) ≤ n + 1. Proof. The adjoint representation ad : g → gln (K) is faithful if and only if ker ad = Z(g) = 0. This yields a faithful g-module of dimension n. The second claim follows from Proposition 3.8. For nilpotent Lie algebras the adjoint representation is not faithful. On the other hand we know that all nilpotent Lie algebras g of class 2 and 3 admit an aﬃne structure, so that μ(g) ≤ n + 1. The following general bound for nilpotent Lie algebras has been given by Reed in 1968, see [59]: Proposition 3.12. Let g be a nilpotent Lie algebra of dimension n and nilpotency class k. Then μ(g) ≤ nk + 1. This bound is not very good. For ﬁliform nilpotent Lie algebras we have k = n − 1

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

343

and hence μ(g) ≤ nn−1 + 1. We have proved the following bound in 1997, see [17], which is always better, for all n ≥ 2 and all 2 ≤ k ≤ n: Theorem 3.13. Let g be a nilpotent Lie algebra of dimension n and nilpotency class k. Denote by p(j) the number of partitions of j and let

k n−j p(j). p(n, k) = k−j j=0 Then μ(g) ≤ p(n, k). Independently de Graaf [37] proved the following bound, which is better than Reed’s bound but worse than ours: Theorem 3.14. Let g be a nilpotent Lie algebra of dimension n and nilpotency class k. Then μ(g) ≤ n+k . k For ﬁxed k, i.e., for Lie algebras of constant nilpotency class k these bounds are polynomial in n. For k = 1, . . . , 5 we have p(n, 1) = n + 1 1 p(n, 2) = (n2 + n + 2) 2 1 p(n, 3) = (n3 + 5n) 6 1 p(n, 4) = (n4 − 2n3 + 11n2 − 10n + 24) 24 1 p(n, 5) = (n5 − 5n4 + 25n3 − 55n2 + 154n − 240) 120 On the other hand we have, for b(n, k) =

n+k k

,

b(n, 1) = n + 1 1 b(n, 2) = (n2 + 3n + 2) 2 1 b(n, 3) = (n3 + 6n2 + 11n + 6) 6 1 b(n, 4) = (n4 + 10n3 + 35n2 + 50n + 24) 24 1 b(n, 5) = (n5 + 15n4 + 85n3 + 225n2 + 274n + 120) 120

344

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

Note that the p(n, k) satisfy the following recursion p(n + 1, k) = p(n, k) + p(n, k − 1),

1≤k≤n

where we set p(n, 0) = 1. Indeed, we have

k n−j

k−1 n−j p(n, k) + p(n, k − 1) = p(j) + p(j) k−j k−j−1 j=0 j=0

k−1 n−j n−j n−k = + p(j) + p(k) k−j k−j−1 0 j=0

k−1 n+1−j = p(j) + p(k) k − j j=0 = p(n + 1, k). The numbers b(n, k) satisfy b(n, 1) < b(n, 2) < . . . < b(n, n). The behaviour of the numbers p(n, k) is quite diﬀerent. We have proved the following in [18]: Theorem 3.15. The function p(n, k) is unimodal for ﬁxed n ≥ 4. More precisely we have with k(n) = n+3 2 p(n, 1) < p(n, 2) < · · · < p(n, k(n) − 1) < p(n, k(n)), p(n, k(n)) > p(n, k(n) + 1) > · · · > p(n, n − 1) > p(n, n). ∞ j=1 (1

− q j )−1 . For 2 ≤ k ≤ n − 1 it holds n p(n, k) < F ( nk ). k

Lemma 3.16. Let F (q) =

Proof. Denote by pk (j) the number of those partitions of j in which each term in the partition does not exceed k. We have k

j

p(j)q <

j=0

∞

j

pk (j)q =

j=0

k

1 1 − qj j=1

for |q| < 1. Hence p(n, k) =

k n−j j=0

k−j

p(j) <

k n j=0

k n 1 q p(j) < k k j=1 1 − q j j

with q = k/n. By estimating p(n, k(n)) we obtain the following result:

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

Theorem 3.17. Let α =

113 . 40

345

Then

α p(n, k) < √ 2n for ﬁxed n ≥ 1 and all 1 ≤ k ≤ n. n If k is depending on n, then the general bounds for μ(g) are exponential in n. In this case it is harder to compare the bounds since we may have to consider how k depends on n. For ﬁliform Lie algebras this dependence is easy: k = n − 1. In that case our estimate for μ(g) can be improved. In fact it holds μ(g) ≤ 1 + p(n − 2, n − 2) which was the motivation to prove the following propositions: Proposition 3.18. Let α =

!

2/3π. Then

p(n − 1, n − 1) < eα Proposition 3.19. Let α =

!

√

n

for all n ≥ 1.

n

for all n ≥ 1.

2/3π. Then

p(n, n − 1) <

√

√

neα

We obtain the following corollary: Corollary 3.20. Let g be a ﬁliform nilpotent Lie algebra of dimension n and α = Then √ μ(g) < 1 + eα n−1 .

!

2/3π.

Example 3.21. Let g = span{x1 , . . . , x6 } with Lie brackets deﬁned by [x1 , xi ] = xi+1 ,

2≤i≤5

Then g is a 6-dimensional Lie algebra of nilpotency class 5. For n = 4 and k = 5 the and p(n, k) are 7777, 462 and 45 respectively. However the true values of nk + 1, n+k k size is known to be μ(g) = 6. In some cases we can determine μ(g) by an explicit formula in the dimension of g. The ﬁrst case is that g is abelian. Then g is a vector space and any faithful representation ϕ : g → gl(V ), where V is a d–dimensional vector space, turns ϕ(g) into an n–dimensional commutative subalgebra of the matrix algebra Md (K). There is an upper bound of n in terms of d. Since ϕ is a monomorphism, n ≤ d2 . A sharp bound was proved by Schur [62] over C and by Jacobson [40] over any ﬁeld K: Proposition 3.22. Let M be a commutative subalgebra of Md (K) over an arbitrary ﬁeld K. Then dim M ≤ [d2 /4] + 1, where [x] denotes the integral part of x. This bound is sharp. Denote by x the ceiling of x, i.e., the least integer greater or equal than x.

346

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

Proposition 3.23. Let g be an abelian Lie algebra of dimension n over an arbitrary ﬁeld √ K. Then μ(g) = 2 n − 1. Proof. By Proposition 3.22, a faithful g–module has dimension d with n ≤ [d2 /4] + 1. √ This implies d ≥ 2 n − 1 . It is easy to construct commutative subalgebras of Md (K) √ of dimension exactly equal to [d2 /4] + 1. Hence μ(g) = 2 n − 1 . Definition 3.24. Let hm (K) be a (2m + 1)–dimensional vector space over K with basis (x1 , . . . , xm , y1 , . . . , ym , z). Denote by hm (K) the 2–step nilpotent Lie algebra deﬁned by the brackets [xi , yi ] = z for i = 1, . . . , m. It is called Heisenberg Lie algebra of dimension 2m + 1. We have proved [17]: Proposition 3.25. The Heisenberg Lie algebras satisfy μ(hm (K)) = m + 2. Proposition 3.26. Let g be a 2–step nilpotent Lie algebra of dimension n with 1– dimensional center. Then n is odd and μ(g) = (n + 3)/2. Proof. The commutator subalgebra [g, g] ⊆ z(g) is 1–dimensional. Hence the Lie algebra structure on g is deﬁned by a skew-symmetric bilinear form V ∧ V → K where V is the subspace of g complementary to K = [g, g]. It follows from the classiﬁcation of such forms that g is isomorphic to the Heisenberg Lie algebra hm (K) with n = 2m + 1. It follows μ(g) = m + 2 = (n + 3)/2. Another important result about μ(g) concerns the lower bounds for μ(g). Recall that any solvable Lie algebra g of dimension n satisfying μ(g) ≥ n+2 will be a counterexample to the Milnor conjecture, because of proposition 3.8. Unfortunately it turns out that it is non-trivial to ﬁnd such Lie algebras. It was known that ﬁliform Lie algebras may be good candidates [7]: Theorem 3.27. Let g be a ﬁliform Lie algebra of dimension n ≥ 3. Then μ(g) ≥ n. Studying these algebras in low dimensions yields [16]: Proposition 3.28. Let g be a ﬁliform Lie algebra of dimension n ≤ 9. Then μ(g) = n. Our main result in chapter 5 of [16] is: Theorem 3.29. There are families of ﬁliform Lie algebras g of dimension 10 ≤ n ≤ 13 such that μ(g) ≥ n + 2. Hence these Lie algebras do not admit any aﬃne structure.

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

347

3.3 The radical of an LSA If G is a connected and simply connected Lie group acting simply transitively as aﬃne transformations on Rn then G admits a complete left-invariant aﬃne structure. This means that the associated locally ﬂat connection Δ on G is complete. As a consequence the Lie algebra g of G is solvable [2] and the left-symmetric structure on g is complete. Definition 3.30. The LSA A is complete if for every a ∈ A the linear transformation 1A + R(a) : A → A is bijective. We have the following result, see [63]: Theorem 3.31. Let A be a ﬁnite-dimensional LSA over a ﬁeld K of characteristic zero. Then the following conditions are equivalent: (1) A is complete. (2) A is right nil, i.e., R(x) is a nilpotent linear transformation, for all x ∈ A. (3) R(x) has no eigenvalue in K \ {0}, for all x ∈ A. (4) tr(R(x)) = 0 for all x ∈ A. (5) Id + R(x) is bijective for all x ∈ A. The following deﬁnition is due to Koszul, see [38]: Definition 3.32. Let A be an LSA and T (A) = {x ∈ A | tr R(x) = 0}. The largest left ideal of A contained in T (A) is called the radical of A and is denoted by rad(A). Note that A is complete if and only if A = rad(A). It is not clear whether this is a good deﬁnition of the radical of an LSA. Usually the radical should be a 2-sided ideal in the algebra. Helmstetter [38] has constructed an LSA B where rad(B) is not a 2-sided ideal in general. Let (A, ·) be an LSA and set B = End(A) ⊕ A We may equip this vector space with a left-symmetric product by (f, a).(g, b) = (f g + [L(a), g], a · b + f (b) + g(a)) for a, b ∈ A and f, g ∈ End(A). Proposition 3.33. The algebra B is an LSA. If A is not complete then rad(B) = 0. If A is complete and the product in A is not identically zero then rad(B) is not a 2-sided ideal in A. However Mizuhara published results in [51],[52] claiming that rad(A) is in fact a 2sided ideal in A if the associated Lie algebra gA is solvable or nilpotent (over the complex numbers). We have a counterexample for a 4-dimensional LSA with solvable Lie algebra.

348

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

Example 3.34. Deﬁne a 4-dimensional left-symmetric algebra A by the following product: e1 · e3

=

e1 · e4

= −e4

e3

e2 · e2

=

2e2

e3 · e4

= e2

e2 · e3

=

e3

e4 · e3

= e2

e2 · e4

=

e4

and the other products equal to zero. Then rad(A) = span{e1 }. This is not a right ideal in A. The right multiplications are given by ⎛ 0 ⎜ ⎜ ⎜ 0 ⎜ R(e1 ) = ⎜ ⎜ ⎜ 0 ⎝ 0

⎞ 0

0

0

0

0

0

0

0

0

⎟ ⎟ 0 ⎟ ⎟ ⎟, ⎟ 0 ⎟ ⎠ 0

⎛ 0 ⎜ ⎜ ⎜ 0 ⎜ R(e3 ) = ⎜ ⎜ ⎜ 1 ⎝ 0

⎞ 0

0

0

0

1

0

0

0

0

⎟ ⎟ 1 ⎟ ⎟ ⎟, ⎟ 0 ⎟ ⎠ 0

⎛ 0 ⎜ ⎜ ⎜ 0 ⎜ R(e1 ) = ⎜ ⎜ ⎜ 0 ⎝ 0

⎞ 0

0

2

0

0

0

0

0

0

⎟ ⎟ 0 ⎟ ⎟ ⎟, ⎟ 0 ⎟ ⎠ 0

⎛ 0 ⎜ ⎜ ⎜ 0 ⎜ R(e4 ) = ⎜ ⎜ ⎜ 0 ⎝ −1

⎞ 0

0

0

1

0

0

1

0

0

⎟ ⎟ 0 ⎟ ⎟ ⎟. ⎟ 0 ⎟ ⎠ 0

We see that T (A) = ker tr R = span{e1 , e3 , e4 }. The largest left ideal in T (A) is given by span{e1 }. The solvable, non-nilpotent Lie algebra is given by [e1 , e3 ] = e3 ,

[e2 , e3 ] = e3 ,

[e1 , e4 ] = −e4 ,

[e2 , e4 ] = e4 .

Remark 3.35. The above counterexample can be generalized to all dimensions n ≥ 4. We do not know of a counterexample for an LSA A if the Lie algebra gA is nilpotent. In [51] it is claimed that rad(A) is a 2-sided ideal in A containing [A, A] if gA is nilpotent over the real numbers. There are several other possibilities for radicals of an LSA. Definition 3.36. Let A be an arbitrary ﬁnite-dimensional algebra and I an ideal in A. Deﬁne sets I (k) inductively by I (0) = I and I (i+1) = I (i) I (i) . Denote by k I the linear

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

349

span of all elements L(a1 )L(a2 ) · · · L(ak−1 )ak for all a1 , . . . , ak ∈ I. An ideal I is called solvable, if I (k) = 0 for some k ≥ 0. It is called left-nilpotent if k I = 0 for some k ≥ 1. Note that any left-nilpotent ideal is solvable. If I and J are solvable ideals in A then I + J is again a solvable ideal in A. Hence there exists a unique maximal solvable ideal of A. In particular the following deﬁnition makes sense. Definition 3.37. Let A be a ﬁnite-dimensional LSA. Then the solvable radical sol(A) of A is the unique maximal solvable ideal of A. Unlike the solvable case there is in general no guarantee for the existence of a unique maximal left-nilpotent ideal in A. For LSAs however we have the following result [22]: Lemma 3.38. Let A be a ﬁnite-dimensional LSA. If I and J are left-nilpotent ideals of A, then so is I + J. Corollary 3.39. If A is a ﬁnite-dimensional LSA, then A has a unique maximal leftnilpotent ideal nil(A) containing all left-nilpotent ideals of A. It is called the left-nilpotent radical of A and satisﬁes nil(A) ⊆ sol(A). The last claim follows from the fact that left-nilpotent ideals in A are solvable. Let us consider now the symmetric bilinear form s on A deﬁned by s(x, y) = tr R(x)R(y). Its radical is given by A⊥ = {a ∈ A | s(a, b) = 0 ∀ b ∈ A}. Unfortunately, this need not be an ideal for an LSA A. Also it need not coincide with the Koszul radical rad(A) of A. But we have the following result [22]: Theorem 3.40. Let A be a ﬁnite-dimensional LSA over R. Then we have the relations nil(A) ⊆ rad(A) ⊆ A⊥ ⊆ T (A). Corollary 3.41. The LSAs nil(A), rad(A) and A⊥ are complete, and rad(A) is the maximal complete left ideal of A. Example 3.42. Let A be the LSA of example (3.34). Then nil(A) = 0, rad(A) = A⊥ = span{e1 } and T (A) = span{e1 , e3 , e4 }. Indeed, since rad(A) is 1-dimensional and not an ideal in A, the ideal nil(A) must be zero. The diﬀerent radicals of A need not be equal in general. However the following result is known [42], [38]:

350

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

Lemma 3.43. Let A be a ﬁnite-dimensional LSA over R. Then the following conditions are equivalent: (1) A is left-nilpotent. (2) A is complete and gA is nilpotent. (3) L(x) is a nilpotent transformation, for all x ∈ A. Suppose that the Lie algebra gA is nilpotent. Since rad(A) is complete the Lemma implies that rad(A) is left-nilpotent. If we believe that rad(A) is an ideal in this case, it follows that rad(A) ⊆ nil(A) and hence rad(A) = nil(A). More generally the following result is proved in [23]: Theorem 3.44. Let A be a ﬁnite-dimensional LSA over R or C. Let S = {a ∈ A | R(a) is nilpotent }. If gA is nilpotent then nil(A) = rad(A) = A⊥ = S.

3.4 Simple LSAs Let A be an LSA over K of dimension n ≥ 2 and assume that the product is non-trivial. Denote by gA the Lie algebra of A. Definition 3.45. The algebra A is called simple if every two-sided ideal in A is equal to A or equal to 0. Recall that the map L : g → gl(A) with x → L(x) is a Lie algebra representation, i.e., L([x, y]) = [L(x), L(y)]. It is easy to see that ker(L) is a two-sided ideal in A. If A is simple then ker(L) = 0, since we assume that the product of A is non-trivial. This yields the following result. Lemma 3.46. Let A be a simple, non-trivial LSA of dimension n. Then we have μ(gA ) ≤ n for the Lie algebra gA of A. Indeed, the left multiplication L is a faithful representation of dimension n since ker(L) is zero. Of course we have many examples of simple LSAs. Just consider simple associative algebras. The following lemma yields also diﬀerent examples of simple LSAs: Lemma 3.47. Let A be an LSA with reductive Lie algebra of 1-dimensional center. Then A is simple. Proof. Let g = gA = s ⊕ z be the Lie algebra with center z K. Suppose I is a proper two–sided ideal in A. Then it is also a proper Lie ideal in g and both I and g/I inherit an LSA–structure from A. Since a semisimple Lie algebra does not admit any LSA-structures, we conclude that I must be equal to s1 ⊕ K, where s1 is a semisimple

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

351

ideal of s. Hence g/I is semisimple and admits an LSA–structure. This is a contradiction. Now there exist inﬁnitely many non-isomorphic LSA-structures on gl(n, K), which have been classiﬁed in [6, 19]. They are simple as LSAs, not necessarily associative, and they all arise by deformations of the associative matrix algebra structure. The question is whether all simple LSAs must have a reductive Lie algebra. This is not the case: Example 3.48. Deﬁne an n-dimensional LSA A with basis (e1 , . . . en ) by the following product: e1 · e1

=

2e1

e1 · ej

=

ej

ej · ej

= e1 ,

j = 2, . . . , n

and the other products equal to zero. Then A is a simple, incomplete LSA with two-step solvable Lie algebra. Let I be a non-zero ideal in A and x ∈ I. Then ej · x is a multiple of e1 for each j ≥ 2. It follows that e1 ∈ I and hence I = A. Hence A is simple. It is not complete since tr R(e1 ) = 2. The Lie algebra gA is two-step solvable with brackets [e1 , ej ] = ej for j ≥ 2. What can we say about the Lie algebra of a simple LSA ? Lemma 3.49. Let A be an LSA with Lie algebra gA . Then gA is abelian if and only if A is associative and commutative. Proof. If A is commutative then g is abelian by deﬁnition. Assume that g is abelian. Then x.y = y.x for all x, y ∈ A and using left–symmetry, 0 = [xz].y = x.(z.y) − z.(x.y) = x.(y.z) − (x.y).z = (x, y, z). In particular the Lie algebra of a simple LSA cannot be abelian since A is not onedimensional. This result can be generalized as follows [12]: Proposition 3.50. If A is a simple LSA then gA cannot be nilpotent. The classiﬁcation of simple LSAs is only known in low dimensions. Up to LSAisomorphism there is only one 2-dimensional simple complex LSA. It is given by A = Cx ⊕ Cy with product x.x = 2x, x.y = y, y.x = 0, y.y = x. In dimension 3 the classiﬁcation is as follows, see [12]: Proposition 3.51. Let A be a simple 3-dimensional LSA over C. Then its Lie algebra

352

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

g is isomorphic to r3,λ =< e1 , e2 , e3 | [e1 , e2 ] = e2 , [e1 , e3 ] = λe3 > with |λ| ≤ 1, λ = 0 and A is isomorphic to exactly one of the following algebras A1,λ and A2 (if |λ| = 1 then let λ = eiθ with θ ∈ [0, π]): e1 · e1

=

(λ + 1)e1

e1 · e3

= λe3

e1 · e2

=

e2

e2 · e3

=

e3 · e2

= e1

e1

and e1 · e1

=

3 e 2 1

e1 · e3

=

1 e 2 3

e3 · e2

=

e1 · e2

=

e2

e2 · e3

=

e1

e3 · e3

= −e2

e1

Corollary 3.52. Let A be a complete simple LSA of dimension 3 over C. Then A is isomorphic to A1,−1 with Lie algebra r3,−1 (C). The classiﬁcation of simple LSAs in dimension 4 is quite complicated. It is much easier to consider the complete ones here: any 4-dimensional complete simple LSA over C is isomorphic to the following LSA, see [12]: e1 · e2

= e4

e3 · e2

=

e1

e2 · e1

= e4

e4 · e1

=

e1

e2 · e3

= e4

e4 · e2

= −e2

e4 · e3

=

2e3

It is possible to associate certain weights and graphs for so called “special” complete LSAs, see [12],[36]. The above algebra has weights Λ = {−1, 0, 1, 2}, and the graph is given by

-1

0

1

2

This gives some idea how to classify special complete simple LSAs in general.

3.5 Cohomology of LSAs Let A be an LSA and denote by C n (A, A) = {f : A × · · · × A → A | f is multilinear} be the space of n-cochains, where A is the regular module for A. Deﬁne the coboundary operator δ n : C n (A, A) → C n+1 (A, A) by

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

n

(δ f )(x1 , . . . , xn+1 ) =

n

353

(−1)i+1 xi .f (x1 , . . . , xi , . . . , xn+1 )

i=1

n + (−1)i+1 f (x1 , . . . , xi , . . . , xn , xi ).xn+1 i=1

−

n

(−1)i+1 f (x1 , . . . , xi , . . . , xn , xi .xn+1 )

i=1

+

(−1)i+j f ([xi , xj ], x2 , . . . , xi , . . . , xj , . . . , xn+1 ).

i<j≤n

In particular we have (δ 1 f )(x1 , x2 ) = x1 .f (x2 ) + f (x1 ).x2 − f (x1 .x2 ) (δ 2 f )(x1 , x2 , x3 ) = f (x1 , x2 .x3 ) − f (x1 .x2 , x3 ) + f (x2 .x1 , x3 ) − f (x2 , x1 .x3 ) + x1 .f (x2 , x3 ) − f (x1 , x2 ).x3 + f (x2 , x1 ).x3 − x2 .f (x1 , x3 ). n Recall that [xi , xj ] = xi .xj −xj .xi . Since δ 2 = 0 we obtain cohomology groups HLSA (A, A). 1 2 Note that Z (A, A) = Der(A), and that Z (A, A) describes inﬁnitesimal left-symmetric deformations of A, in the sense of Gerstenhaber. Nijenhuis showed in [55], that many properties of this LSA-cohomology can be deduced from Lie algebra cohomology. In fact, we have n HLSA (A, A) ∼ = H n−1 (gA , End(A)),

where gA denotes the underlying Lie algebra of A. Dzhumadil’daev [32] more generally n has deﬁned cohomology groups HRSA (A, M ) for arbitrary right-symmetric modules M . He proves, among other things that n HRSA (A, M ) ∼ = H n−1 (gA , C 1 (A, M )).

Example 3.53. Let A be an RSA with Lie algebra gA = gln (K) over a ﬁeld K of characteristic zero (see [6], [19], [32]). Then, for k ≥ 1, 1 (A, A) ∼ ZRSA = Z 1 (sln (K), sln (K)) ∼ = sln (K), H k (A, A) ∼ = Z 1 (A, A) ⊗ H k−1 (gln (K), K). RSA

RSA

References [1] A. d’Andrea and V.G. Kac: “Structure theory of ﬁnite conformal algebras”, Selecta Math., Vol. 4, (1998), pp. 377–418.

354

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

[2] L. Auslander: “Simply transitive groups of aﬃne motions”, Am. J. Math., Vol. 99, (1977), pp. 809–826. [3] C. Bai and D. Meng: “A Lie algebraic approach to Novikov algebras”, J. Geom. Phys. Vol. 45(1–2), (2003), pp. 218–230. [4] A.A. Balinskii and S.P. Novikov: “Poisson brackets of hydrodynamic type, Frobenius algebras and Lie algebras”, Sov. Math. Dokl., Vol. 32, (1985), pp. 228–231. [5] B. Bakalov and V. Kac: “Field algebras”, Int. Math. Res. Not., Vol. 3, 2003, pp. 123–159. [6] O. Baues: “Left-symmetric algebras for gl(n)”, Trans. Amer. Math. Soc., Vol. 351(7), (1999), pp. 2979–2996. [7] Y. Benoist: “Une nilvari´et´e non aﬃne”, J. Diﬀerential Geom., Vol. 41, (1995), pp. 21–52. [8] J.P. Benz´ecri: Vari´et´es localement aﬃnes, Th`ese, Princeton Univ. , Princeton, N. J., 1955. [9] R.E. Borcherds: “Vertex algebras, Kac-Moody algebras, and the Monster”, Proc. Nat. Acad. Sci. , Vol. 83(10), (1986), pp. 3068–3071. [10] N. Boyom: “Sur les structures aﬃnes homotopes `a z´ero des groupes de Lie”, J. Diﬀ. Geom., Vol. 31, (1990), pp. 859–911. [11] D. Burde: “Aﬃne structures on nilmanifolds”, Int. J. Math., Vol. 7, (1996), pp. 599–616. [12] D. Burde: “Simple left-symmetric algebras with solvable Lie algebra”, Manuscripta Math., Vol. 95, (1998), pp. 397–411. [13] D. Burde and K. Dekimpe: “Novikov structures on solvable Lie algebras”, J. Geom. Phys., (2006), to appear. [14] D. Burde and F. Grunewald: “Modules for certain Lie algebras of maximal class”, J. Pure Appl. Algebra, Vol. 99, (1995), pp. 239–254. [15] D. Burde: “Aﬃne cohomology classes for ﬁliform Lie algebras”, Contemporary Math., Vol. 262, (2000), pp. 159–170. [16] D. Burde: Left-invariant aﬃne structures on nilpotent Lie groups, Habilitation thesis, D¨ usseldorf, 1999. [17] D. Burde: “A reﬁnement of Ado’s Theorem”, Archiv Math., Vol. 70, (1998), pp. 118–127. [18] D. Burde: “Estimates on binomial sums of partition functions”, Manuscripta Math., Vol. 103, (2000), pp. 435–446. [19] D. Burde: “Left-invariant aﬃne structures on reductive Lie groups”, J. Algebra, Vol. 181, (1996), pp. 884–902. [20] A. Cayley: On the Theory of Analytic Forms Called Trees, Collected Mathematical Papers of Arthur Cayley, Vol. 3, Cambridge Univ. Press, Cambridge, 1890, 1890, pp. 242–246. [21] Y. Carri´ere, F. Dal’bo and G. Meigniez: “Inexistence de structures aﬃnes sur les ﬁbres de Seifert”, Math. Ann., Vol. 296, (1993), pp. 743–753. [22] K.S. Chang, H. Kim and H. Lee: “On radicals of a left-symmetric algebra”, Commun.

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

[23] [24] [25] [26] [27] [28] [29] [30]

[31] [32] [33]

[34]

[35] [36] [37]

[38] [39] [40] [41]

355

Algebra, Vol. 27(7), (1999), pp. 3161–3175. K.S. Chang, H. Kim and H. Lee: “Radicals of a left-symmetric algebra on a nilpotent Lie group”, Bull. Korean Math. Soc. Vol. 41(2), (2004), pp. 359–369. F. Chapoton and M. Livernet: “Pre-Lie algebras and the rooted trees operad”, Intern. Math. Research Notices, Vol. 8, (2001), pp. 395–408. A. Connes and D. Kreimer: “Hopf algebras, renormalization and noncommutative geometry”, Comm. Math. Phys., Vol. 199(1), (1998), pp. 203–242. K. Dekimpe and M. Hartl: “Aﬃne structures on 4–step nilpotent Lie algebras” J. Pure Appl. Math., Vol. 129, (1998), pp. 123–134. K. Dekimpe and W. Malfait: “Aﬃne structures on a class of virtually nilpotent groups”, Topology Appl., Vol. 73, (1996), pp. 97–119. J. Dixmier and W.G. Lister: “Derivations of nilpotent Lie algebras”, Proc. Amer. Math. Soc., Vol. 8, (1957), pp. 155–158. J. Dorfmeister: “Quasi-clans”, Abh. Math. Semin. Univ. Hamburg, Vol. 50, (1980), pp. 178–187. A. Dzhumaldil’daev and C. L¨ofwall: “Trees, free right-symmetric algebras, free Novikov algebras and identities”, Homology Homotopy Appl., Vol. 4(2), (2002), pp. 165–190. A. Dzhumaldil’daev: “N -commutators”, Comment. Math. Helv., Vol. 79(3), (2004), pp. 516–553. A. Dzhumaldil’daev: “Cohomologies and deformations of right-symmetric algebras”, J. Math. Sci., Vol. 93(6), (1999), pp. 836–876. I.B. Frenkel, Y. Huang and J. Lepowsky: “On axiomatic approaches to vertex operator algebras and modules”, Mem. Amer. Math. Soc., Vol. 104(494), (1993), pp. 1–64. I.B. Frenkel, J. Lepowsky and A. Meurman: Vertex operator algebras and the Monster. Pure and Applied Mathematics, Vol. 134, Academic Press, Boston, MA, 1988, pp. 1–508. M. Gerstenhaber: “The cohomology structure of an associative ring”, Ann. Math., Vol. 78, (1963), pp. 267–288. V. Gichev: “On complete aﬃne structures in Lie groups”, Preprint ArXiv. W.A. de Graaf: “Constructing faithful matrix representations of Lie algebras”, In: Proceedings of the 1997 International Symposium on Symbolic and Algebraic Computation, ACM, New York, pp. 54–59 (electronic). J. Helmstetter: “Radical d’une alg`ebre sym´etrique a gauche”, Ann. Inst. Fourier, Vol. 29, (1979), pp. 17–35. N. Jacobson: “A note on automorphisms and derivations of Lie algebras”, Proc. Amer. Math. Soc., Vol. 6, (1955), pp. 281–283. N. Jacobson: “Schur’s theorem on commutative matrices”, Bull. Amer. Math. Soc., Vol. 50, (1944), pp. 431–436. V. Kac: Vertex algebras for beginners, University Lecture Series, Vol. 10, American Mathematical Society, Providence, 1998, pp. 1–201.

356

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

[42] H. Kim: “Complete left-invariant aﬃne structures on nilpotent Lie groups”, J. Diﬀ. Geom., Vol. 24, (1986), pp. 373–394. [43] S. Kobayashi and K. Nomizu: Foundations of Diﬀerential Geometry, Vols. I and II, Wiley-Interscience Publishers, New York and London, 1969. [44] J.-L. Koszul: “Domaines born´es homog`enes et orbites de groupes de transformations aﬃnes”, Bull. Soc. Math. France, Vol. 89, (1961), pp. 515–533 [45] D. Kreimer: “New mathematical structures in renormalizable quantum ﬁeld theories”, Ann. Phys., Vol. 303(1), (2003), pp. 179–202. [46] D. Kreimer: “Structures in Feynman Graphs - Hopf Algebras and Symmetries”, Proc. Symp.. Pure Math., Vol. 73, (2005), pp. 43–78. [47] N.H. Kuiper: Sur les surfaces localement aﬃnes, Colloque de G´eometrie diﬀ´erentielle, Strasbourg, 1953, pp. 79–86. [48] J. Lepowsky and H. Li: “Introduction to Vertex Operator Algebras and Their Representations”, Progr. Math. Vol. 227, (2003), pp. 1–316. [49] J.P. May: “Geometry of Iterated Moduli Spaces”, Lecture Notes in Math., Vol. 271, 1972. [50] J. Milnor: “On fundamental groups of complete aﬃnely ﬂat manifolds”, Advances Math., Vol. 25, (1977), pp. 178–187. [51] A. Mizuhara: “On the radical of a left-symmetric algebra”, Tensor N. S., Vol. 36, (1982), pp. 300–302. [52] A. Mizuhara: “On the radical of a left-symmetric algebra II”, Tensor N. S., Vol. 40, (1983), pp. 221–232. [53] T. Nagano and K. Yagi: “The aﬃne structures on the real two torus”, Osaka J. Math., Vol. 11, (1974), pp. 181–210. [54] A. Nijenhuis: “The graded Lie algebras of an algebra”, Indag. Math., Vol. 29, (1967), pp. 475–486. [55] A. Nijenhuis: “On a class of common properties of some diﬀerent types of algebras. II”, Nieuw Arch. Wisk. 3, Vol. 17, (1969), pp. 87–108. [56] M. Nisse: “Structure aﬃne des infranilvari´et´es et infrasolvari´et´es”, C. R. Acad. Sci. Paris, Vol. 310, (1990), pp. 667–670. [57] J.M. Osborn: “Novikov algebras”, Nova J. Algebra Geom., Vol. 1(1), (1992), pp. 1–13. [58] J.M. Osborn: “Inﬁnite dimensional Novikov algebras of characteristic 0”, J. Algebra, Vol. 167(1), (1994), pp. 146–167. [59] B.E. Reed: “Representations of solvable Lie algebras”, Michigan Math. J., Vol. 16, (1969), pp. 227–233. [60] M. Rosellen: “A course in vertex algebra”, Preprint, (2005). [61] J. Scheuneman: “Aﬃne structures on three-step nilpotent Lie algebras”, Proc. Amer. Math. Soc., Vol. 46, (1974), pp. 451–454. [62] I. Schur: “Zur Theorie vertauschbarer Matrizen”, J. Reine Angew. Mathematik, Vol. 130, (1905), pp. 66–76. [63] D. Segal: “The structure of complete left-symmetric algebras”, Math. Ann., Vol. 293,

D. Burde / Central European Journal of Mathematics 4(3) 2006 323–357

[64] [65] [66] [67]

357

(1992), pp. 569–578. J. Smillie: “An obstruction to the existence of aﬃne structures”, Invent. Math., Vol. 64, (1981), pp. 411–415. W.P. Thurston: Three-dimensional Geometry and Topology, Vol. 1, Princeton Mathematical Series, Vol. 35, Princeton University Press, 1997. E.B. Vinberg: “Convex homogeneous cones”, Transl. Moscow Math. Soc., Vol. 12, (1963), pp. 340–403. E. Zelmanov: “On a class of local translation invariant Lie algebras”, Soviet Math. Dokl., Vol. 35, (1987), pp. 216–218.

DOI: 10.2478/s11533-006-0013-x Research article CEJM 4(3) 2006 358–370

The set of toric minimal log discrepancies∗ Florin Ambro† Research Institute for Mathematical Sciences, Kyoto University Kyoto 606-8502, Japan

Received 23 August 2005; accepted 28 February 2006 Abstract: We describe the set of minimal log discrepancies of toric log varieties, and study its accumulation points. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Minimal log discrepancy, toric variety MSC (2000): 14B05, 14M25

1

Introduction

Minimal log discrepancies are invariants of singularities of log varieties. A log variety (X, B) is a normal variety X endowed with an eﬀective Weil R-divisor B, having at most log canonical singularities. For any Grothendieck point η ∈ X, the minimal log discrepancy of (X, B) at η is a non-negative real number denoted a(η; X, B). For example, a(η; X, B) = 1 − multη (B) for every codimension one point η ∈ X. For higher codimensional points, minimal log discrepancies can be computed on a suitable resolution of X. Let A ⊂ [0, 1] be a set containing 1 and let d be a positive integer. Denote by Mldd (A) the set of minimal log discrepancies a(η; X, B), where η ∈ X is a Grothendieck point of codimension d, and (X, B) is a log variety whose minimal log discrepancies in codimension one belong to A. For example, Mld1 (A) = A. In connection to the termination of a sequence of log ﬂips (see [8, 10]), Shokurov conjectured that if A satisﬁes the ascending chain condition, so does Mldd (A). Furthermore, under certain assumptions, ∗

This work is supported by a 21st Century COE Kyoto Mathematics Fellowship, and a JSPS Grantin-Aid No 17740011. † E-mail: [email protected]

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

359

the accumulation points of Mldd (A) should correspond to minimal log discrepancies of smaller codimensional points. This is known to hold for d = 2 (Shokurov [9], Alexeev [1]) and for any d in the case of toric varieties without boundary (Borisov [3]). The purpose of this note is to extend Borisov’s result to the case of toric log varieties. Given the explicit nature of the toric case, we hope this will provide the reader with some interesting examples. In order to state the main result, deﬁne Mldtor d (A) ⊂ Mldd (A) as above, except that we further require that X is a toric variety and B is torus invariant. Note that Mldtor 1 (A) = A. Theorem 1.1. The following properties hold for d ≥ 2: (1) We have ⎫ ⎪ 2 ≤ s ≤ d ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ s s s s ⎬ , . . . , x ) ∈ Q ∩ (0, 1] , (a , . . . , a ) ∈ A (x 1 s 1 s tor Mldd (A) = { x i ai , ⎪ ⎪ i=1 index(x )| index(x , . . . , x ˆ , . . . , x ), ∀1 ≤ i ≤ s ⎪ i 1 i s ⎪ ⎪ ⎪ s ⎪ i=1 (1 + (m − 1)xi − mxi )ai ≥ 0 ∀m ∈ Z ⎭ where for a rational point x ∈ Qn , we denote by index(x) the smallest positive integer q such that qx ∈ Zn . (2) If A satisfies the ascending chain condition, then so does Mldtor d (A). (3) Assume that A has no nonzero accumulation points. Then the set of accumulation points of Mldtor d (A) is included in 1 Mldtor {0} ∪ d ({ ; n ≥ 1} · A). n 1≤d ≤d−1 Equality holds if d = 2, or if { n1 ; n ≥ 1} · A ⊆ A. We use the same methods as Borisov [3, 4]. The explicit description in (1) is straightforward, whereas the accumulation behaviour in (2) and (3) relies on a result of Lawrence [6] stating that the set of closed subgroups of a real torus, which do not intersect a given open subset, has ﬁnitely many maximal elements with respect to inclusion. Finally, we should point out that Mldtor d (A) is strictly smaller than Mldd (A) in general. For example, even the set of accumulation points of Mld2 (A) (see Shokurov [9] for an explicit description) is larger than {0} ∪ { n1 ; n ≥ 1} · A, the set of accumulation points of Mldtor 2 (A).

2

Toric log varieties

In this section we recall the deﬁnition of minimal log discrepancies and their explicit description in the toric case. The reader may consult [2] for more details. A log variety (X, B) consists of a normal algebraic variety X, deﬁned over an algebraically closed ﬁeld k of characteristic zero, endowed with a ﬁnite combination B =

360

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

i bi Bi

of Weil prime divisors Bi with non-negative real coeﬃcients bi , such that KX + B is R-Cartier. Here KX is the canonical divisor of X, computed as the Weil divisor of zeros dim(X) and poles (ω)X of a top rational form ω ∈ Ωk(X)/k ; it is uniquely deﬁned up to linear equivalence. The R-Cartier property of KX + B means that locally on X, there exist ﬁnitely many non-zero rational functions aα ∈ k(X)× and rα ∈ R such that KX +B = α rα (aα ). Let μ : X → X be a proper birational morphism from a normal variety X and let E ⊂ X be a prime divisor. Let ω be a top rational form on X, deﬁning KX , and let KX be the canonical divisor deﬁned by μ∗ ω. The real number a(E; X, B) = 1 + multE (KX − μ∗ (KX + B)) is called the log discrepancy of (X, B) at E. For a Grothendieck point η ∈ X, the minimal log discrepancy of (X, B) at η is deﬁned as a(η; X, B) = inf a(E; X, B), μ(E)=¯ η

where the inﬁmum is taken over all prime divisors E on proper birational maps μ : X → X. This inﬁmum is either −∞, or a non-negative real number. In the latter case, (X, B) is said to have log canonical singularities at η and the invariant is computed as follows: By Hironaka, there exists a proper birational morphism μ : X → X such that X is nonsingular, μ−1 (¯ η ) is a divisor on X , and there exists a simple normal crossings divisor −1 η ) and KX − μ∗ (KX + B). Then i Ei on X which supports both μ (¯ a(η; X, B) = min a(Ei ; X, B). μ(Ei )=¯ η

Next we specialize these notions to the toric case. We employ standard terminology on toric varieties, cf. Oda [7]. A toric log variety is a log variety (X, B) such that X is a toric variety and B is torus invariant. Thus there exists a fan Δ in a lattice N such that X = TN emb(Δ) and B = i bi V (ei ), where {ei }i is the set of primitive lattice points on the one-dimensional cones of Δ and V (ei ) ⊂ X is the torus invariant prime Weil divisor corresponding to ei . The canonical divisor is KX = i −V (ei ), and the R-Cartier property of KX + B means that there exists a function ψ : |Δ| → R such that ψ(ei ) = 1 − bi for every i, and ψ|σ is linear for every cone σ ∈ Δ. We may assume that (X, B) has log canonical singularities, which is equivalent to ψ ≥ 0 or bi ∈ [0, 1] for all i. Let e ∈ N prim ∩ |Δ| be a non-zero primitive vector. The barycentric subdivision with respect to e deﬁnes a subdivision Δe ≺ Δ and the exceptional locus of the birational morphism TN emb(Δe ) → TN emb(Δ) is a prime divisor denoted Ee . It is easy to see that a(Ee ; X, B) = ψ(e). Due to this property, ψ is called the log discrepancy function of (X, B). Minimal log discrepancies of toric log varieties are computed as follows: Since these are local invariants, we only consider aﬃne varieties and thus Δ consists of the faces of some strongly convex rational polyhedral cone σ ⊂ NR . We denote X = TN emb(σ).

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

361

Assume ﬁrst that 0 ∈ X is a torus invariant closed point (it is unique since X is aﬃne). Using the existence of good resolutions in the toric category, it is easy to see that a(0; X, B) = min(ψ|N ∩relint(σ) ). For the general case, let η ∈ X be a Grothendieck point. There exists a unique face τ ≺ σ such that η ∈ orb(τ ). Let c and d be the codimension of orb(τ ) and η in X, respectively. The induced aﬃne toric log variety (X , B ) = (TN ∩(τ −τ ) emb(τ ), multV (e) (B)V (e)) e∈τ (1)

has a unique torus invariant closed point 0 , and we obtain a(η; X, B) = mld(0 ; X , B ) + d − c.

3

The set of toric minimal log discrepancies

Let A ⊆ [0, 1] be a set containing 1. Definition 3.1. For an integer d ≥ 1, let Mldtor d (A) be the set of minimal log discrepancies a(η; X, B), where η ∈ X is a Grothendieck point of codimension d and (X, B) is a toric log variety whose minimal log discrepancies in codimension one belong to A. It is easy to see that Mldtor 1 (A) = A. Definition 3.2. For an integer d ≥ 2, deﬁne Vd (A) to be the set of pairs (x, a) ∈ (0, 1]d × Ad satisfying the following properties: (i) x ∈ Qd . (ii) index(xi )| index(x1 , . . . , xˆi , . . . , xd ) for 1 ≤ i ≤ d. d (iii) i=1 (1 + (m − 1)xi − mxi )ai ≥ 0 for all m ∈ Z. For x ∈ Qn , index(x) denotes the smallest positive integer q such that qx ∈ Zn . Note that property (ii) means that (1, 0, . . . , 0), . . . , (0, . . . , 0, 1) are primitive vectors in the lattice Zd + Zx. Also, it is enough to verify property (iii) for the ﬁnitely many integers 1 ≤ m ≤ index(x) − 1. For (x, a) ∈ Vd (A) we denote x, a =

d

x i ai .

i=1

Proposition 3.3. For d ≥ 2, we have Mldtor (A) = {x, a; (x, a) ∈ Vs (A)}. d 2≤s≤d

Proof. (1) We ﬁrst show that the right hand side is a subset of the left hand side. Fix (x, a) ∈ Vs (A) for some 2 ≤ s ≤ d.

362

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

If s = d, let N = Zd + Zx and let σ be the standard positive cone in Rd , spanned by the standard basis e1 , . . . , ed of Zd . Let 0 ∈ TN emb(σ) be the invariant closed point corresponding to σ. Then the aﬃne toric log variety d (TN emb(σ), (1 − ai )V (ei )) i=1

has minimal log discrepancy at 0 equal to x, a. Indeed, the log discrepancy function ψ = di=1 ai e∗i attains its minimum at x, and ψ(x) = x, a. Therefore x, a ∈ Mldtor d (A). Assume now that 2 ≤ s ≤ d − 1. Let e1 , . . . , ed be the standard basis of Zd , let ed+1 = (d − s)e1 + e2 − di=s+1 ei , let v = si=1 xi ei and let N = Zd + Zv. Let σ be the cone in Rd generated by e1 , . . . , ed+1 and set ai = a1 for s + 1 ≤ i ≤ d and ad+1 = a2 . Then d+1 0 ∈ (TN emb(σ), (1 − ai )V (ei )) i=1

is a d-dimensional germ of a toric log variety with minimal log discrepancy equal to x, a. Indeed, note ﬁrst that KX + B is R-Cartier since a2 = (d − s)a1 + a2 − di=s+1 a1 . Also, the log discrepancy function is ψ = di=1 ai e∗i and there exists e = d+1 i=1 yi ei ∈ N ∩ relint(σ) where the log discrepancy function ψ attains its minimum. We may assume yi ∈ [0, 1] for every i. If yd+1 ∈ / Z, then ys+1 = · · · = yd = yd+1 , hence e = si=1 yi ei . s Therefore ψ(e) ≥ ψ(v). If yd+1 ∈ Z, then i=1 yi ei ∈ N ∩ relint(σ), hence ψ(e) ≥ s ψ( i=1 yi ei ) ≥ ψ(v). We conclude that ψ attains its minimum at v and therefore x, a = ψ(v) ∈ Mldtor d (A). (2) Let (X, B) be a toric log variety with codimension one log discrepancies in A and let η ∈ X be a Grothendieck point of codimension d. We shall show that a(η; X, B) belongs to the set on the right hand side. There exists a unique cone σ in the fan deﬁning X such that η ∈ orb(σ). Let c be the codimension of orb(σ) in X. Then a(η; X, B) coincides with the minimal log discrepancy of the toric log variety (TN ∩(σ−σ) emb(σ), multV (e) (B)V (e)) × Ad−c k . e∈σ(1)

in the invariant closed point 0. Therefore we may assume that X is aﬃne and η is a torus invariant closed point 0. We have X = TN emb(σ), with dim σ = dim N = d, B = i∈I (1 − ai )V (ei ) with ai ∈ A for every i. The log discrepancy function ψ ∈ σ ∨ of (X, B) satisﬁes ψ(ei ) = ai , and we have mld(0; X, B) = min(ψ|N ∩relint(σ) ). There exists e ∈ N ∩relint(σ) such that mld(0; X, B) = ψ(e). By Carath´eodory’s Theorem (see [7], Theorem A.15), there exists a subset {1, . . . , s} ⊆ I, with 2 ≤ s ≤ d, such that e1 , . . . , es are linearly independent and e belongs to the relative interior of the cone s d spanned by e1 , . . . , es . Let e = i=1 xi ei , and denote x = (x1 , . . . , xs ) ∈ (0, 1] , a = (a1 , . . . , as ) ∈ Ad . It is clear that mld(0; X, B) = x, a, and we claim that (x, a) ∈ Vs (A).

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

363

Indeed, it is clear that x ∈ Qs . Since ei is a primitive lattice point of N , it is also primitive in the sublattice si=1 Zei +Ze, which is equivalent to index(xi )| index(x1 , . . . , xˆi , . . . , xs ) for every 1 ≤ i ≤ s. Finally, let m ∈ Z. We have si=1 (1 + mxi − mxi )ei ∈ N ∩ relint(σ), hence ψ( si=1 (1 + mxi − mxi )ei ) ≥ ψ(e). Equivalently, si=1 (1 + (m − 1)xi − mxi )ai ≥ 0 and therefore (x, a) ∈ Vs (A).

4

The set V˜d (A)

By Proposition 3.3, the limiting behaviour of toric minimal log discrepancies is controlled by the limiting behaviour of the sets Vd (A). The rationality properties (i) and (ii) deﬁning Vd (A) do not behave well with respect to limits, and for this reason we enlarge Vd (A) to a new set V˜d (A), deﬁned only by property (iii), which turns out to have good inductive properties and limiting behaviour. Definition 4.1. Let A ⊆ [0, 1] be a subset containing 1. Deﬁne V˜d (A) = {(x, a) ∈ (0, 1]d × Ad ;

d

(1 + (m − 1)xi − mxi )ai ≥ 0, ∀m ∈ Z}.

i=1

Equivalently, V˜d (A) is the set of pairs (x, a) ∈ (0, 1]d × Ad such that the group Zd + Zx does not intersect the set {y ∈ (0, 1]d ; y − x, a < 0}. As before, we denote x, a = d i=1 xi ai . Lemma 4.2. The following equality holds 1 V˜1 (A) = ((0, 1] × {0}) ∪ ({ ; n ≥ 1} × A), n where the first term is missing if 0 ∈ / A. In particular, {x, a; (x, a) ∈ V˜1 (A)} =

∞ 1 · A. n n=1

Proof. Let x ∈ (0, 1] such that 1+(m−1)x− mx ≥ 0 for every integer m. Equivalently, we have sup ( mx − mx) ≤ 1 − x. m∈Z

Assume by contradiction that x ∈ / Q. Then the set { mx − mx}m≥1 is dense in [0, 1] (cf. [5], Chapter IV), hence supm∈Z ( mx − mx) = 1. We obtain 1 ≤ 1 − x, hence x = 0, a contradiction. Therefore x = pq , for integers 1 ≤ p ≤ q with gcd(p, q) = 1. The above inequality becomes 1 p 1 − = max( mx − mx) ≤ 1 − , m∈Z q q hence p = 1. Therefore x = 1q .

364

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

We will need the following result of Lawrence. Note that property (ii) is a consequence of (i). Theorem 4.3 ([6]). Let T = Rd /Zd be a real torus. (i) Let U ⊂ T be an open subset. The set of closed subgroups of T which do not intersect U has only finitely many maximal elements with respect to inclusion. (ii) The set of finite unions of closed subgroups of T satisfies the descending chain condition. Theorem 4.4. Assume that A satisfies the ascending chain condition. Then the set {x, a; (x, a) ∈ V˜d (A)} satisfies the ascending chain condition. Proof. Assume ﬁrst that d = 1. By Lemma 4.2, 1 {x, a; (x, a) ∈ V˜1 (A)} = { ; n ≥ 1} · A. n Both sets { n1 ; n ≥ 1} and A are nonnegative and satisfy the ascending chain condition, hence their product satisﬁes the ascending chain condition. Now suppose d ≥ 2 and assume by induction the result holds for smaller values of d. Assume by contradiction that {(xn , an )}n≥1 is a sequence in V˜d (A) such that xn , an < xn+1 , an+1 for n ≥ 1. Since A satisﬁes the ascending chain condition, we may assume after passing to a subsequence that ani ≥ an+1 , ∀n ≥ 1, ∀1 ≤ i ≤ d. i Assume ﬁrst that xn ∈ / (0, 1)d for inﬁnitely many n’s. After passing to a subsequence, we may assume xn1 = 1 for every n. Write xn = (1, x¯n ) and an = (an1 , a ¯n ). Then ¯ xn , a ¯n < ¯ xn+1 , a ¯n+1 for every n ≥ 1, which contradicts the ACC property of the set {¯ x, a ¯; (¯ x, a ¯) ∈ V˜d−1 (A)}. Assume now that xn ∈ (0, 1)d for every n. We set U n = {x ∈ (0, 1)d ; x − xn , an < 0} and regard U n as an open subset of the torus T d = Rd /Zd . Let X n be the union of the subgroups of T d which do not intersect U n . By Theorem 4.3.(i), X n is a ﬁnite union of closed subgroups of T d . It is easy to see that U n ⊆ U n+1 , hence X n ⊇ X n+1 for n ≥ 1. Since (xn , an ) ∈ V˜d (A), we have U n ∩ (Zd + Zxn ) = ∅. Therefore xn ∈ X n for every n. We have xn , an+1 ≤ xn , an < xn+1 , an+1 . Then xn ∈ U n+1 , hence xn ∈ / X n+1 . Therefore X n X n+1 for every n ≥ 1, contradicting Theorem 4.3.(ii).

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

365

Lemma 4.5. The following properties hold: (1) If A is a closed set, then V˜d (A) is a closed subset of (0, 1]d × Ad . (2) Identify (0, 1]s with the face xs+1 = · · · = xd = 1 of (0, 1]d . Then V˜d (A) ∩ (0, 1]s = V˜s (A). (3) Identify [0, 1]s with the face xs+1 = · · · = xd = 0 of [0, 1]d and assume that A is a closed set. Then V˜d (A) ∩ (0, 1]s = V˜s (A). Proof. (1) Let (x, a) ∈ (0, 1]d × Ad such that there exists a sequence {(xn , an )}n≥1 in V˜d (A) with x = limn→∞ xn and a = limn→∞ an . Fix m ∈ Z. By assumption, we have d

(1 + (m − 1)xni − mxni )ani ≥ 0, ∀n ≥ 1.

i=1

There exists a positive integer n(m) such that mxni ≥ mxi for every 1 ≤ i ≤ d and every n ≥ n(m). Therefore d

(1 + (m − 1)xni − mxi )ani ≥ 0, ∀n ≥ n(m),

i=1

Letting n tend to inﬁnity, we obtain d (1 + (m − 1)xi − mxi )ai ≥ 0. i=1

Since m was arbitrary, we conclude that (x, a) ∈ V˜d (A). (2) This is clear. (3) Assume that we have a sequence {(xn , an )}n≥1 ⊂ V˜d (A) such that limn→∞ xn = (x, 0, . . . , 0) ∈ (0, 1]s and limn→∞ an = (a, as+1 , . . . , ad ). Let m be a positive integer. Note that for s + 1 ≤ i ≤ d we have mxni ∈ (0, 1] for n ≥ n(m), hence lim (1 + (m − 1)xni − mxni ) = 0 for s + 1 ≤ i ≤ d.

n→∞

Therefore si=1 (1 + (m − 1)xi − mxi )ai ≥ 0 for every m ≥ 1. Since Zs + Zx is included in the closure of Zd + Z≥0 x, we obtain si=1 (1 + (m − 1)xi − mxi )ai ≥ 0 for m ≤ −1 as well. Therefore (x, a) ∈ V˜s (A), proving the direct inclusion. For the converse, just note that (x, a) ∈ V˜s (A) is the limit of the sequence ((x, n1 , . . . , n1 ), (a, 1, . . . , 1)) ∈ V˜d (A). Definition 4.6. For x ∈ R and m ∈ Z, deﬁne x(m) = 1 + mx − mx. Note that this operation induces a selfmap of the half-open interval (0, 1]. For x ∈ Rd and m ∈ Z, deﬁne x(m) ∈ Rd componentwise.

366

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

Since (0, 1]d ∩ (Zd + Zx) = {x(m) ; m ∈ Z}, we have the equivalent description V˜d (A) = {(x, a) ∈ (0, 1]d × Ad ; x(m) − x, a ≥ 0, ∀m ∈ Z}. Lemma 4.7. Let x ∈ (0, 1]d and let a ∈ Ad such that ai > 0 for 1 ≤ i ≤ d. Then there exists a relatively open neighborhood x ∈ U ⊆ (0, 1]d such that if y ∈ U and y (m) − x, a ≥ 0 for every m ∈ Z, then y − x, a = 0. Proof. (1) Assume ﬁrst that x ∈ (0, 1)d . By Theorem 4.3.(ii), the set of closed subgroups of Rd which contain Zd and do not intersect the nonempty open set {y ∈ (0, 1)d ; y − x, a < 0} has ﬁnitely many maximal elements with respect to inclusion, say H1 , . . . , Hl . If x ∈ H1 , then H1 is a rational aﬃne subspace of Rd in an open neighborhood x ∈ U1 ⊂ (0, 1)d . Let v ∈ H1 − x. Since x ∈ (0, 1)d , there exists > 0 such that x + tv ∈ H1 ∩ (0, 1)d for |t| < . In particular, x + tv − x, a ≥ 0, that is tv, a ≥ 0 for |t| < . We infer that v, a = 0. Therefore H1 ∩ U1 is contained in {y ∈ (0, 1)d ; y − x, a = 0}. If x ∈ / H1 , then U1 = (0, 1)d \ H1 is an open neighborhood of x. Repeating this procedure, we obtain a neighborhood Ui of x, for each closed subgroup Hi . The intersection U = U1 ∩ · · · ∩ Ul is the desired neighborhood. (2) We may assume after a reordering that xi = 1 for 1 ≤ i ≤ s and xi ∈ (0, 1) for s < i ≤ n. If s = n, we may take U = (0, 1]d . Assume now that s < n. By [5], Chapter IV, there exists a negative integer m0 such that s

x(m0 ) − x, a < min ai . i=1

Let y ∈ (0, 1]s × di=s+1 ( mm00xi , m0mxi0 −1 ) such that y (m) − x, a ≥ 0 for every m ∈ Z. We claim that y1 = · · · = ys = 1. Indeed, assume by contradiction that yj < 1 for some 1 ≤ j ≤ s. A straightforward computation gives y (m0 ) − x, a − m0 y − x, a = x(m0 ) − x, a +

d ( m0 xi − m0 yi )ai . i=1

By the choice of y, we obtain d s ( m0 xi − m0 yi )ai = (m0 − m0 yi )ai ≤ −aj , i=1

i=1

hence 0 ≤ x(m0 ) − x, a − aj . This contradicts our choice of m0 . Let x¯ = (xs+1 , . . . , xd ), y¯ = (ys+1 , . . . , yd ), a ¯ = (as+1 , . . . , ad ). We have (¯ x, a ¯) ∈ (m) ˜ Vd−s (A) and ¯ y − x¯, a ¯ ≥ 0 for every m ∈ Z. From Step 1, there exists an open ¯ neighborhood x¯ ∈ U ⊂ (0, 1)s such that if y¯ ∈ U¯ then ¯ y − x¯, a ¯ = 0. Then U = (0, 1] × (U¯ ∩ s

satisﬁes the required properties.

d

m0 xi m0 xi − 1 ( , )). m m 0 0 i=s+1

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

367

Lemma 4.8. The following equality holds for d ≥ 1 and a ∈ Ad : {x, a; (x, a) ∈ V˜d (A), x ∈ Qd } = {x, a; (x, a) ∈ V˜d (A)}. Proof. Let (x, a) ∈ V˜d (A). We have x1 , . . . , xs < 1 and xs+1 = . . . = xd = 1, where 0 ≤ s ≤ d. If s = 0, then x ∈ Qd and we are done. Assume s ≥ 1 and set x¯ = (x1 , . . . , xs ) x, a ¯) ∈ V˜s (A). Since x¯ ∈ (0, 1)s , there exists a closed and a ¯ = (a1 , . . . , as ). Then (¯ ¯ ⊆ Rs such that x¯ ∈ H ¯ ∩ Ux¯ ⊂ {¯ subgroup Zs ⊆ H z ; ¯ z − x¯, a ¯ = 0}, by Step 1 of the proof s ¯ of Lemma 4.7. Since H is rational, there exists z¯ ∈ Q ∩ H ∩ Ux¯ . Set x = (¯ z , 1, . . . , 1). d ˜ Then (x , a) ∈ Vd (A), x, a = x , a and x ∈ Q . Proposition 4.9. Assume that A has no positive accumulation points. Then the set of accumulation points of {x, a; (x, a) ∈ V˜d (A)} is {x, a; (x, a) ∈ V˜d (A)}. {0} ∪ 1≤d ≤d−1

Proof. Let r > 0 be an accumulation point, that is there exists a sequence (xn , an ) ∈ V˜d (A) with r = limn→∞ xn , an and r = xn , an for every n ≥ 1. By compactness, we may assume after passing to a subsequence that limn→∞ xn = x ∈ [0, 1]d and limn→∞ an = a ∈ [0, 1]d exist. We have r = x, a. We claim that ai xi = 0 for some i. Indeed, assume by contradiction that ai xi > 0 for every 1 ≤ i ≤ d. Since A has no nonzero accumulation points, we obtain an = a for n ≥ 1. Let Ux ⊂ (0, 1]d be the relative neighborhood of x associated to (x, a) in Lemma 4.7. Then xn ∈ Ux for n ≥ n0 . If xn − x, a ≥ 0, then (xn , a) ∈ V˜d (A) implies that z − x, a ≥ 0 for every z ∈ (Zd + Zxn ) ∩ (0, 1]d . Therefore xn − x, a = 0. This means xn , a = r, a contradiction. Therefore xn , a < r for every n. Since A has no positive accumulation points, it satisﬁes the ascending chain condition. Therefore the sequence (xn , a)n≥1 satisﬁes the ascending chain condition as well, by Theorem 4.4. This is a contradiction. We may assume ai xi > 0 for 1 ≤ i ≤ d and ai xi = 0 for d + 1 ≤ i ≤ d. We have d ≥ 1, since x, a > 0. Denote x¯ = (x1 , . . . , xd ) and a ¯ = (a1 , . . . , ad ). We have ˜ r = ¯ x, a ¯ and (¯ x, a ¯) ∈ Vd (A) by Theorem 4.5. For the converse, note that 1 1 (( , . . . , ), (1, . . . , 1)) ∈ V˜d (A) k k and ( k1 , . . . , k1 ), (1, . . . , 1) = kd accumulates to 0. Let now (x, a) ∈ V˜d (A) for 1 ≤ d ≤ d − 1. Deﬁne xk = (x , k1 , . . . , k1 ) and a = (a , 1, . . . , 1). Then (xk , a) ∈ V˜d (A) and xk , a = x , a + d−d accumulates to x , a . k Remark 4.10. Proposition 4.9 is false if A has a positive accumulation point. For example, let a > 0 be an accumulation point of a sequence of elements ak ∈ A. Then ((1, . . . , 1), (ak , 1, . . . , 1)) ∈ Vd (A) and (1, . . . , 1), (ak , 1, . . . , 1) accumulates to d − 1 + a, which clearly does not correspond to any element of V˜d (A), for d ≤ d − 1.

368

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

The set V˜d (A) is strictly larger that Vd (A). For example, ( 12 , 1) and ( l−1 , 1 ) (l ≥ 2) 2l l are rational points of V˜d ({1}) \ Vd ({1}). However, the following property holds. Lemma 4.11. The following inclusion holds: 1 {x, a; (x, a) ∈ V˜d (A)} ⊆ {x, a; (x, a) ∈ Vd ({ ; n ≥ 1} · A)}. n Proof. Let r = x, a for some (x, a) ∈ V˜d (A). By Lemma 4.8, we may assume that x ∈ Qd . We may assume ai > 0 for every i. Let e1 , . . . , ed be the standard basis of Rd , spanning the standard cone σ, let e = di=1 xi ei and let N = di=1 Zei + Ze. If we set ψ = di=1 ai e∗i , then we have min(ψ|N ∩relint(σ) ) = ψ(e) = r. There exists positive integers ni ≥ 1 such that ei = n1i ei are primitive elements of the lattice N . In the new coordinates, we have ψ = di=1 naii ei ∗ and e = i=1 ni xi ei . Since ψ attains its minimum at e and all ai ’s are positive, we infer that ni xi < 1 for every i. Set ai = naii and xi = ni xi . Then (x , a ) ∈ Vd ({ n1 ; n ≥ 1} · A) and x , a = r. Corollary 4.12. Assume that A = { n1 ; n ≥ 1} · A. Then {x, a; (x, a) ∈ Vd (A)} = {x, a; (x, a) ∈ V˜d (A)}.

5

Accumulation points of Mldtor d (A)

Theorem 5.1. The following properties hold: (1) If A satisfies the ascending chain condition, then so does Mldtor d (A). (2) Assume that A has no positive accumulation points. Then the set of accumulation points of Mldtor d (A) is included in {0} ∪

1 Mldtor d ({ ; n ≥ 1} · A). n 1≤d ≤d−1

The inclusion is an equality if { n1 ; n ≥ 1} · A ⊂ A. (3) Assume that A has no positive accumulation points and { n1 ; n ≥ 1} · A ⊂ A. Then Mldtor d (A) is a closed set if and only if 0 ∈ A. Proof. The inclusion Vd (A) ⊂ V˜d (A) and Proposition 3.3 give (A) ⊆ {x, a; (x, a) ∈ V˜d (A)}. Mldtor d 2≤d ≤d

(1) The set Mldtor d (A) is a subset of a ﬁnite union of sets satisfying the ascending chain condition, by Theorem 4.4. Therefore Mldtor d (A) satisﬁes the ascending chain condition.

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

369

(2) Assume that A has no nonzero accumulation points. By Proposition 4.9 and Lemma 4.11, the accumulation points of Mldtor d (A) belong to the set {0} ∪

1 Mldtor d ({ ; n ≥ 1} · A). n 1≤d ≤d−1

Assuming moreover that { n1 ; n ≥ 1} · A ⊆ A, we will show that all points of the above set are accumulation points of Mldtor d (A). If (x, a) ∈ Vd (A), then 1 1 ((x, 1, . . . , 1), (a, , . . . , )) ∈ Vd (A), n n and (x, 1, . . . , 1), (a, n1 , . . . , n1 ) = x, a +

d−d n

accumulates to x, a. Similarly,

1 1 1 ((1, 1, . . . , 1), ( , , . . . , )) ∈ Vd (A) n n n and (1, 1, . . . , 1), ( n1 , n1 , . . . , n1 ) = nd accumulates to 0. This proves the claim. (3) Assume that Mldtor d (A) is a closed set. Since 1 1 (( , . . . , ), (a, . . . , a)) ∈ Vd (A), k k ∈ Mldtor we infer that 0 = limk→∞ da d (A), which implies 0 ∈ A. k Conversely, assume 0 ∈ A. If (x, a) ∈ Vd (A) then ((x, 1, . . . , 1), (a, 0, . . . , 0)) ∈ Vd (A) and (x, 1, . . . , 1), (a, 0, . . . , 0) = x, a. We infer from (3) that Mldtor d (A) is a closed set. Lemma 5.2. Assume that A has no positive accumulation points. Then the following properties hold: 1 (1) The set of accumulation points of Mldtor 2 (A) is {0} ∪ k≥1 k A. (2) The set Mldtor 2 (A) is closed if and only if 0 ∈ A. Proof. (1) From Theorem 5.1, all accumulation points are of this form. Conversely, ﬁx 1 n a ∈ A and k ∈ Z≥1 . Then (( kn+1 , nk+1 ), (a, a)) ∈ V2 (A) is a sequence converging to 1 a ((0, k ), (a, a)), hence k is an accumulation point of Mldtor 2 (A). Since k is arbitrary, we infer that 0 is an accumulation point as well. tor (2) Assume that Mldtor 2 (A) is a closed set. Then 0 ∈ Mld2 (A), which implies 0 ∈ A. Assume now that 0 ∈ A. Then for a ∈ A and k ∈ Z≥1 we have a 1 1 = ( , ), (0, a) ∈ Mldtor 2 (A). k k k From (1), these are all possible accumulation points, hence Mldtor 2 (A) is a closed set.

370

F. Ambro / Central European Journal of Mathematics 4(3) 2006 358–370

References [1] V. Alexeev: “Two two-dimensional terminations”, Duke Math. J., Vol. 69(3), (1993), pp. 527–545. [2] F. Ambro: “On minimal log discrepancies”, Math. Res. Lett., Vol. 6(5-6), (1999), pp. 573–580. [3] A. Borisov: “Minimal discrepancies of toric singularities”, Manuscripta Math., Vol. 92(1), (1997), pp. 33–45. [4] A. Borisov: “On classiﬁcation of toric singularities”, Algebraic geom., Vol. 9; J. Math. Sci. (New York), Vol. 94(1), (1999), pp. 1111–1113. [5] J.W.S. Cassels: An introduction to Diophantine approximation, Cambridge Tracts in Mathematics and Mathematical Physics, Vol. 45, Cambridge University Press, New York, 1957. [6] J. Lawrence: Finite unions of closed subgroups of the n-dimensional torus, Applied geometry and discrete mathematics, 433–441, DIMACS Ser. Discrete Math. Theoret. Comput. Sci., 4, Amer. Math. Soc., Providence, RI, 1991. [7] T. Oda: Convex bodies and algebraic geometry. An introduction to the theory of toric varieties, Springer-Verlag, Berlin, 1988. [8] V.V. Shokurov: Problems about Fano varieties, Birational Geometry of Algebraic Varieties, Open Problems-Katata, 1988, pp. 30–32. [9] V.V. Shokurov: A.c.c. in codimension 2, preprint 1993. [10] V.V. Shokurov: “Letters of a bi-rationalist. V. Minimal log discrepancies and termination of log ﬂips”, Tr. Mat. Inst. Steklova (Russian), Vol. 246 (2004); Algebr. Geom. Metody, Svyazi i Prilozh., pp. 328–351; translation in: Proc. Steklov Inst. Math., Vol. 3(246), 2004, pp. 315–336.

DOI: 10.2478/s11533-006-0018-5 Research article CEJM 4(3) 2006 371–375

Generalized Alexandroﬀ Duplicates and CD0(K) spaces ∗ Mert Caglar ¸ , Zafer Ercan, Faruk Polat MiddleEast Technical University, Department of Mathematics, 06531 Ankara, Turkey

Received 26 October 2005; accepted 15 April 2006 Abstract: We deﬁne and investigateCDΣ,Γ (K, E)-type spaces, which generalizeCD0 -type Banach lattices introduced in [1]. We state that the space CDΣ,Γ (K, E) can be represented as the space of E-valued continuous functions on the generalized Alexandroﬀ Duplicate of K. As a corollary we obtain the main result of [6, 8]. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Alexandroﬀ Duplicate, homeomorphism, Banach lattices, CD0 (K, E)-spaces MSC (2000): 46D80, 54C35

1

Introduction

Throughout this paper E denotes a Banach lattice and Ω, Σ and Γ stand for topologies on K, where Σ is compact Hausdorﬀ space,Γ is a locally compact Hausdorﬀ space with Σ ⊂ Γ. These spaces are denoted by KΩ , KΣ and KΓ . The closure of a subset A of KΩ is denoted by clΩ (A). As usual, the space of E-valued KΩ -continuous functions on K is denoted by C(KΩ , E), or by C(K, E) if there is no possibility of confusion. C0 (KΓ , E) denotes the space of E-valued KΓ -continuous functions d on K such that for each > 0 there exists a compact set M with ||d(k)|| ≤ for each k ∈ K \ M . We write C(KΩ , R) = C(KΩ ) and C0 (KΓ , R) = C0 (KΓ ). If KΣ has no isolated points and KΓ is discrete then C(KΣ , E) ∩ C0 (KΓ , E) = {0}, and CD0 (KΣ , E) = C(KΣ , E) ⊕ C0 (KΓ , E) is a Banach lattice under point wise order and supremum norm. We refer to [1, 3] and [6] for more detail on these spaces. CDΣ,Γ (K, E) denotes the vector space C(KΣ , E) × ∗

E-mail:[email protected]

372

M. Caglar ¸ et al. / Central European Journal of Mathematics 4(3) 2006 371–375

C0 (KΓ , E) which is equipped with coordinate wise algebraic operations. It is easy to see that CDΣ,Γ (K, E) is a Banach lattice with respect to the order 0 ≤ (f, d) ⇐⇒ 0 ≤ f (k) and 0 ≤ f (k) + d(k) for each k ∈ K and under the norm ||(f, d)|| = max{||f ||, ||f + d||}, where ||.|| is the supremum norm. If KΣ has no isolated points and KΓ is discrete then CD0 (KΣ , E) and CDΣ,Γ (K, E) are isometrically Riesz isomorphic spaces. Let K × {0, 1} be topologized by the open base A = A1 ∪ A2 , where A1 = {H × {1} : H

is Γ − open}

and A2 = {G × {0, 1} − M × {1} : G is Σ − open,

M

is Γ − compact}.

Let us denote this topological space by KΣ,Γ ⊗ {0, 1}, which is called generalized Alexandroﬀ Duplicate (in the case that Γ is a discrete topology we denote this space by A(K)). The space A(K) has been constructed by R. Engelking [5] and it is generalized for an arbitrary locally compact Hausdorﬀ space in [4]. It is known that KΣ,Γ ⊗ {0, 1} is a compact Hausdorﬀ space (see [5] and [7]). The space A([0, 1]) (where [0, 1] is topologized under the usual metric) has been constructed by P. S. Aleksandrov and P. S. Uryson [2] as an example of a compact Hausdorﬀ space containing a discrete dense subspace. This space is called the Alexandroﬀ Duplicate [7, p. 1010]. The following deﬁnition is similar to the one which was introduced in [6]. Deﬁnition 1.1. Let ((kα , rα )) be a net in K × {0, 1} and(k, r) ∈ K × {0, 1}. We say that the net ((kα , rα )) converges to (k, r) (denoted by (kα , rα ) −→ (k, r)) if f (kα ) + rα d(kα ) −→ f (k) + rd(k) for each f ∈ C(KΣ ) and d ∈ C0 (KΓ ). KΣ,Γ {0, 1} denotes K × {0, 1} equipped with this convergence. The proof of the following theorem is a simple consequence of the above deﬁnition. Theorem 1.2. Under the convergence in the previous deﬁnition KΣ,Γ {0, 1} is a Hausdorﬀ topological space (not necessarily Σ ⊂ Γ).

2

The main result

In [6] it was proven that KΣ,Γ {0, 1} is a compact Hausdorﬀ space under the convergence given by the above deﬁnition if KΣ has no isolated points and KΓ is discrete. Some representations of certain Banach lattices have been constructed in [6] with the topology

M. Caglar ¸ et al. / Central European Journal of Mathematics 4(3) 2006 371–375

373

induced by this. We can identify the space of continuous functions on KΣ,Γ ⊗ {0, 1} as follows. Theorem 2.1. C(KΣ,Γ ⊗ {0, 1}, E) and CDΣ,Γ (K, E) are isometrically Riesz isomorphic spaces. Proof. Let f : K × {0, 1} → E be a map. Then f ∈ C(KΣ,Γ ⊗ {0, 1}, E) if and only if (i) k → f (k, 0) is Σ− continuous, and(ii) the function k → f (k, 1) − f (k, 0) belongs to the space C0 (KΓ , E). Suppose that (i) and (ii) are satisﬁed. Then k → f (k, 1) is Γ− continuous (As it is the sum of f (k, 1) − f (k, 0)− the ﬁrst part is Γ− continuous by (ii),the second one is Σ− continuous by (i), and hence also Γ− continuous as Σ ⊂ Γ). It follows that f is continuous at each point of K × {1}. Let k ∈ K. Let us show that f is continuous at (k, 0). Let > 0. Then H = {k ∈ K : ||f (k, 1) − f (k, 0)|| ≥ /2} is Γ− compact by (ii). Further, by (i) there is a Σ− open set G containing k such that||f (k, 0) − f (l, 0)|| < /2 for l ∈ G. Let U = (G × {0, 1})\H × {1}. Then U is a neighbourhood of (k, 0) in KΣ,Γ ⊗ {0, 1}. Further, if (l, i) ∈ U , then: either i = 0 -then ||f (k, 0) − f (l, 0)|| < /2 < ,or i = 1 -then l ∈ H and hence ||f (l, 1) − f (k, 0)|| ≤ ||f (l, 1) − f (l, 0)|| + ||f (l, 0) − f (k, 0)|| < . Conversely, suppose f is continuous. Then clearly(i) holds. Further, k → f (k, 1) is Γ−continuous, and hence k → f (k, 1) − f (k, 0) is Γ−continuous, too. It remains to show that {k ∈ K : ||f (k, 1) − f (k, 0)|| ≥ } is Γ− compact for each > 0. Suppose that for some > 0 {k ∈ K : ||f (k, 1) − f (k, 0)|| ≥ } is not Γ− compact and denote this set by H. Now, by compactness of (K, Σ) there is k ∈ K such that clΓ (G) ∩ H is not Γ− compact for any Σ− neighborhood G of k (otherwise H would be covered by ﬁnitely many Γ−compact subsets and hence itself would be Γ− compact ). Let U = (G × {0, 1}) \ M × {1} be a basic openset in KΣ,Γ ⊗ {0, 1} containing (k, 0) such that for each (l, i) ∈ U we have ||f (l, i)−f (k, 0)|| < /2. As clΓ (G)∩H is not Γ− compact, then there is l ∈ H∩(G\M ). Then both (l, i) and (l, 0) belong to U , hence||f (l, 1)−f (l, 0)|| < . However, ||f (l, 1) − f (l, 0)|| ≥ (as l ∈ H), a contradiction. From this we have the map π : CDΣ,Γ (K, E) −→ C(KΣ,Γ ⊗ {0, 1}, E) be deﬁned by π(f, d)(k, r) = f (k) + rd(k) for each (k, r) ∈ K × {0, 1}. It is clear that π is a bipostive, one-to-one linear operator. Let f ∈ C(KΣ,Γ ⊗ {0, 1}, E) be given. Deﬁne g, d : K → E,

g(k) = f (k, 0) and d(k) = f (k, 1) − d(k, 0).

Then from the above observation (g, d) ∈ CΣ,Γ (K, E) and π(g, d) = h, that is π is also onto. It is also clear that ||π(f, d)|| = ||f + d||. This completes the proof.

374

M. Caglar ¸ et al. / Central European Journal of Mathematics 4(3) 2006 371–375

Note that a characterization similar to the proof of Theorem 2.1 holds for functions with values in any metric space: Let (M, d) be a metric space and f : K × {0, 1} → M be a mapping. Then f ∈ C(KΣ,Γ ⊗ {0, 1}, M ) if and only if the following conditions are satisﬁed: (i) k → f (k, 0) is Σ-continuous. (ii) k → f (k, 1) is Γ-continuous. (iii) For each > 0 the set {k ∈ K : d(f (k, 0), f (k, 1)) ≥ } is Γ-compact. The following theorem is a surprising and interesting consequence of Theorem 2.1. Theorem 2.2. KΣ,Γ ⊗ {0, 1} andKΣ,Γ {0, 1} are homeomorphic spaces. Proof. From Theorem 2.1 and from the fact that any compact Hausdorﬀ space X is homeomorphic to a subspace of (C(X)∗ , w∗ ), i.e the topology on X is the weak topology generated by all continuous functions on X, it follows that KΣ,Γ ⊗{0, 1} and KΣ,Γ {0, 1} are homeomorphic spaces. Corollary 2.3. C(KΣ,Σ ⊗ {0, 1}) and CDΣ,Σ (K) are isomorphic Riesz spaces. The proof follows immediately from the above theorem, which is the main result of [6]. Corollary 2.4. If KΣ has no isolated points, then the spaces CD0 (K, E) and C(A(K), E) are isometrically Riesz isomorphic spaces. From Corollary 2.4 and from the Banach-Stone theorem it follows that the KakutaniKrein compact space of CD0 (K) space is the Aleaxandroﬀ Duplicate A(K) of KΣ .

References [1] Y. Abramovich and A.W. Wickstead: “Remarkable classes ofunitial AM-spaces”, J. Math. Anal. Appl., Vol. 180, (1993), pp, 398–411. [2] P.S. Alexandroﬀ and P.S. Urysohn: Memoire sur les espaces topologiques compacts, Verh. Kon. Akad.Wetensch. Naturkunde. 14, Amsterdam, 1929. [3] S. Alpay and Z. Ercan: “CD0 (K, E) and CDw (K, E) spaces asBanach lattices”, Positivity, Vol. 3, (2000), pp. 213–225. [4] R.E. Chandler, G.D. Faulkner, J.P. Guglielmi and M.C. Memory: “Generalizing the Alexandroﬀ-Urysohn double circumference construction”, Proc. Amer. Math. Soc., Vol. 83(3), (1981), pp. 606–608. [5] R. Engelking: “On the double circumference of Alexandroﬀ”, Bull. Acad. Pol. Sci. Ser. Math., Vol. 16, (1968), pp. 629–634. [6] Z. Ercan: “A concrete description of CD0 (K)-spaces as C(X)-spaces and its applications”, Proc. Amer. Math. Soc., Vol. 132(6), (2004), pp. 1761–1763. [7] K. Kunen and J.E. Vaughan: Handbook of Set-TheoreticTopology, North-Holland, 1984.

M. Caglar ¸ et al. / Central European Journal of Mathematics 4(3) 2006 371–375

375

[8] V. Troitsky: “On CD0 (K)-spaces”, Vladikavkaz. Mat. Zh., Vol. 6(1), (2004), pp. 71–73.

DOI: 10.2478/s11533-006-0021-x Research article CEJM 4(3) 2006 376–394

Non functorial cylinders in a model category J.M. Garc´ıa-Calcines∗, P.R. Garc´ıa-D´ıaz, S. Rodr´ıguez-Mach´ın Department of Fundamental Mathematics, University of La Laguna, Canary Islands, Spain

Received 2 November 2005; accepted 18 April 2006 Abstract: Taking cylinder objects, as defined in a model category, we consider a cylinder construction in a cofibration category, which provides a reformulation of relative homotopy in the sense of Baues. Although this cylinder is not a functor we show that it verifies a list of properties which are very closed to those of an I-category (or category with a natural cylinder functor). Considering these new properties, we also give an alternative description of Baues’ relative homotopy groupoids. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Model category, cofibration category, cylinder object, homotopy groupoid MSC (2000): 55U35, 55P05

1

Introduction

In 1967, D. Quillen introduced the notion of model category. It consists of a category M together with three classes of morphisms, called ﬁbrations, coﬁbrations and weak equivalences, satisfying certain axioms that give suﬃcient conditions in the category to develop a homotopy theory (see [8]). It is one of the most well-known and extended approaches in axiomatic homotopy theory. The notion of model category is autodual; this means that if M is a model category, then so is the opposite category, where the roles of ﬁbrations and coﬁbrations are interchanged. Therefore, a model category has two faces which are dual one to each other. In 1989, H.J. Baues presented in [2] the notion of coﬁbration category, based on a set of axioms which is weaker than the one given by Quillen. This set of axioms is given on two classes of morphisms: coﬁbrations and weak equivalences and it takes the essential properties from Quillen’s. As ﬁrst examples of coﬁbration categories we have the categories with a natural cylinder. These are categories ∗

E-mail: [email protected]

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

377

with an initial object, in whose structure there exist a cylinder functor I : C → C, natural transformations ı0 , ı1 : 1C → I, ρ : I → 1C and a class of morphisms, called coﬁbrations, verifying a certain set of axioms (to be found on pp. 18 and 19 of [2]). However, there are many examples of coﬁbration categories which do not come from any natural cylinder (Baues gives a large number of examples arising naturally within algebraic topology). The framework in which we will be immersed throughout this paper is a coﬁbration category with initial object ∅. However, the reader can also think that we are considering a (proper) model category. In such context the cylinders are not, in general, functors. They are determined from a not necessarily unique factorization of a morphism into a coﬁbration followed by a weak equivalence and such factorization need not be canonical. There are interesting examples of coﬁbration categories without functorial factorizations. W.G. Dwyer proved recently that the category of bounded chain complexes of ﬁnitely generated Abelian groups with the standard (Quillen) model structure does not have functorial factorizations (unpublished). It seems that there is no functorial cylinder in this category. On the other hand D.C. Isaksen has given a strict model structure in the category of pro-objects of a proper model category [7]. In such structure the factorizations are not functorial. In this article we will consider what we call the cylinder construction, which is nothing else but the cylinder object as deﬁned by Quillen in a model category [8]. That is, a cylinder of an object A consists of any factorization of the morphism {1, 1} : A A → A into a coﬁbration followed by a weak equivalence: {1,1}

A AI$

/ A = zz z z zzρ zz ∼

II II {ı0 ,ı1 } III $

ZA

Furthermore, for any morphism f : A → B we can also establish a cylinder Zf : ZA → ZB (see §2 and §4 for notation and more details). That will permit us to obtain an equivalent and more manageable notion of Baues’ relative homotopy. Our main objective in this paper is to give a survey of several new interesting properties that are veriﬁed by this cylinder construction. Despite the lack of functors, these properties are very close to the ones given by a cylinder functor and they will permit us to develop new techniques of proofs in the homotopy theory of such categories. Among others, the following properties stand out: • Taking certain choices of cylinders they preserve, in some sense, the composition of morphisms and the identities. Explicitly, if f : A → B and g : B → C are composable morphisms, we can choose composable cylinders Zf and Zg such that their composition is a cylinder of gf : ZAFF

Z(gf )

FF FF Zf FF "

/ ZC x< x xx xxZg x x

ZB

378

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

On the other hand, the identity of a cylinder, 1ZA : ZA → ZA, is a cylinder of the identity of A. • Any cylinder of f : A → B gives rise to the identities

ιε

/B

f

A

ZA

ρ

ιε

/ ZB

Zf

Zf

ZA

/ ZB

A

f

ρ

/B

• Given any commutative square, we can choose cylinders of the corresponding morphisms such that they conserve this commutativity. In addition, if the square is a pushout along a coﬁbration, then it is possible to obtain cylinders such that the resulting square is another pushout: A r

D

f

s

/B

ZA

g

=⇒

Zr

/C

ZD

Zf

Zs

/ ZB

Zg

/ ZC

• Every coﬁbration i : B −→ A veriﬁes the homotopy extension property (H.E.P.), that is, given Zi : ZB −→ ZA any cylinder of i, ε ∈ {0, 1} and any commutative diagram ıε B ∼ / ZB i

G

A

α

/X ∼

ıε

c

Zi E

2 ZA

then the dotted arrow E : ZA −→ X exists. We have organized this paper as follows: in section 2, we give some preliminaries about the notation that will be used. In addition, the most important properties in a coﬁbration category will be recalled. In section 3 we give examples of coﬁbration categories in which the factorizations are not functorial; we do explicit factorizations in the category of bounded below chain complexes of ﬁnitely generated Abelian groups and in the category of pro-objects of a given proper model category. In section 4 we present the cylinder ZA of an object A and the cylinder Zf : ZA → ZB of a morphism f : A → B and we expose their more important properties. In section 5 an equivalent deﬁnition of relative homotopy is introduced by means of this cylinder construction. In the last section we give an alternative description of Baues’ homotopy groupoids using these techniques. Dedicated to Sergio, wherever he is.

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

2

379

Preliminaries

2.1 Pushouts The most important categorical concept that will be used with relative frequency in this paper is the one of pushout. Our notation diﬀers a bit from the traditional one, but we think that it can improve the understanding of this manuscript: Given f : X → Y and g : X → Z in a category C, the pushout of f and g (if it exists) will be denoted by P {f, g} and {u, v} will denote the morphism induced by the usual universal property of pushout:

f

/Z

g

X

Y

g

f

v

/ P {f, g} {u,v} u

# 1A

f and g denote the corresponding cobase change morphisms. These morphisms could be labeled using similar symbols such as f , f, f, et cetera. On the other hand, it is possible to consider morphisms of the form {u, w} = {{u0 , u1 }, w}. If there is no possibility of confusion we will write {u0 , u1 , w}. In general, the reader can ﬁnd morphisms written as {h0 , h1 , ..., hn } in this paper. Given a commutative diagram of solid arrows: g

X

Y α

f

g

f

Y

/ P {f, g} β

γ g

X g

/Z f

/ Z f

/ P {f , g }

where the top and bottom faces are pushouts, the pushout morphism {g α, f β} : P {f, g} → P {f , g } will be denoted by α ∪γ β or simply by α ∪ β. If the pushouts are coproducts, we will write instead of ∪.

2.2 Coﬁbration categories A cofibration category (C,cof,we) consists of a category C together with two distinguished classes of morphisms, called cofibrations and weak equivalences. This structure should verify the C1, C2, C3 and C4 axioms. Before recalling these axioms we must clarify some / ” and the weak equivalences by “ ∼ / ”. points: The coﬁbrations are denoted by “ / A morphism in C which is both a coﬁbration and a weak equivalence is called trivial

380

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

cofibration and is represented as “ / ∼ / ”. On the other hand, an object X is fibrant if every trivial coﬁbration with X as domain admits a retraction. • C1: Composition axiom · The composition of coﬁbrations is a coﬁbration. · The isomorphisms are trivial coﬁbrations. · Given f : A −→ B and g : B −→ C, morphisms in C, if two of the morphisms f , g and gf are weak equivalences, then so is the third. • C2: Pushout axiom Let i : B / / A be a coﬁbration and f : B −→ X a morphism. Then there exists the pushout P {i, f }, and i is a coﬁbration. Furthermore, the left properness condition is also satisﬁed, that is, when f is a weak equivalence then so is f . • C3: Factorization axiom Every morphism f : X −→ Y of C admits a factorization f = qj, where j : X / / Z is a coﬁbration and q : Z ∼ / Y is a weak equivalence. For simplicity, such factorization will be written as (j, Z, q). • C4: Fibrant model axiom Every object X of C admits a trivial coﬁbration rX : X / ∼ / RX, where RX is a ﬁbrant object. The trivial coﬁbration rX : X / ∼ / RX is called fibrant model of X. We will also consider model categories. We refer to [8] as a basic and standard reference on this subject. Other references include [4], [5] and [6]. We do not assume that model categories have functorial factorizations. Following [5], a model category is called left proper, when it satisﬁes the left properness condition. The obvious dual notion is right proper. Then, a model category is called proper whenever it is left and right proper. It is important to remark that every left proper model category is a coﬁbration category ([2]). Among the most important technical results given in any coﬁbration category we are particularly interested in the following ones (see [2]). Also any model category satisﬁes (a), (b)(i) and (c). Proposition 2.1. (a) Given any trivial cofibration i : B / ∼ / A and given any morphism f : B → X with X fibrant, there is an extension of f relative to i, that is, a morphism f˜ : A → X verifying f˜i = f. (b) Consider the commutative diagram: Bo α

f

γ

B o

A

f

g

A

/C

g

β

/ C

(i) Suppose that α, γ and β are cofibrations and there exist the pushouts P {f, g}, P {f , g }. If {f , α} : P {γ, f } → B is a cofibration or {g , β} : P {γ, g} → C is a cofibration then α ∪ β : P {f, g} → P {f , g } is a cofibration.

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

381

(ii) If α, γ and β are weak equivalences and one of f, g and one of f , g is a cofibration, then α ∪ β : P {f, g} → P {f , g } is a weak equivalence. (c) Given any commutative square f j = gi, where i is a cofibration, there exist an object E and morphisms α, α and β such that the following diagram is commutative: j

B

~

>E

i

α

~

f ∼

α

A

/C

β g

/D

Furthermore: • If f is a weak equivalence then α is a trivial cofibration. • If g is a weak equivalence then α is also a weak equivalence. • If j is a cofibration then α is also a cofibration. 2.2.1 Relative homotopy in the sense of Baues Now we recall the notion of relative homotopy in a coﬁbration category. Given a coﬁbration i : B / / A, a relative cylinder of i is a triple ({j0 , j1 }, Z i , σ) coming from the factorization of the morphism {1, 1} : P {i, i} −→ A, by the C3 axiom. When X is ﬁbrant and f0 , f1 : A → X are morphisms verifying f0 i = f1 i, it is said that f0 is homotopic to f1 relative to i if there is relative cylinder of i, ({j0 , j1 }, Z i , σ), and a morphism H : Z i −→ X such that H{j0 , j1 } = {f0 , f1 }. H is called a homotopy between f0 and f1 relative to i (in symbols H : f0 f1 rel. i). This deﬁnition is independent on the choice of the relative cylinder and it is an equivalent relation. The homotopy bracket is deﬁned as the corresponding quotient set: [A, X]u{i} := Hom(A, X)u{i} ∼ where Hom(A, X)u{i} denotes the set of extensions relative to i of the morphism u : B → X, that is, the set {f : A → X | f i = u}. The relative homotopy is compatible with composition of morphisms. This is a consequence of the following results: Proposition 2.2. Let g : X → Y be a morphism between fibrant objects, i : B / / A a cofibration and u : B → X a morphism. Then there is a function g∗ : [A, X]u{i} → [A, Y ]gu{i} , [α] → [gα] In addition, if g is a weak equivalence, then g∗ is a bijection. Proposition 2.3. Suppose given a commutative square of the form

i

f

/B

B A

f

i

/A

382

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

and let u : B → X be a morphism, where X is a fibrant object. Then f induces a function

f ∗ : [A, X]u{i} → [A , X]uf {i } , [α] → [αf ]. In addition, if f and f are weak equivalences, or the square is a pushout then f ∗ is a bijection.

3

Examples of cofibration categories without functorial factorizations

Now we give two examples of coﬁbration categories in which the factorizations are not functorial. Namely, the proper model category of bounded chain complexes of ﬁnitely generated Abelian groups and the proper model category pro-C of a given proper model category C.

3.1 The proper model structure of bounded below chain complexes of ﬁnitely generated Abelian groups Let Ch+ (Abf.g. ) denote the category of bounded below chain complexes of ﬁnitely generated Abelian groups. Then this category with the usual Quillen’s (proper) model structure has not functorial factorizations. Bill Dwyer has shown us a proof of this fact but it is too lengthy to reproduce here. Recall that in such structure the coﬁbrations are the monomorphisms with projective cokernel (in each level), the ﬁbrations are the epimorphisms and the weak equivalences are the morphisms which induce isomorphisms in homology. Now we exhibit a factorization of any morphism f : X −→ Y in this category as f = qj, where j : X / / W is a coﬁbration and q : W ∼ / Y is a weak equivalence (and a ﬁbration). Proceeding by induction over the dimension, suppose constructed the homomorphisms jr : Xr → Wr and qr : Wr → Yr for r ≤ k, such that fr = qr jr , jr is a monomorphism with projective cokernel and qr is an epimorphism which induces isomorphisms in homology for r < k. Then consider the following commutative diagram of ﬁnitely generated Abelian groups which we explain below: Pk βk+1

Pk+1

pk+1

/ ker(∂ Y

k+1 ) sk+1

pk

nk

k

αk

/ Yk+1

/ B

Y ∂k+1

/ ker(∂ W ) sk / Wk

PULLBACK

/ Im(∂ Y

∂kW

k

k+1 )

nk

αk

/ ker(∂ Y ) k

sk

qk

/ Yk

/ Wk−1

∂kY

qk−1

/ Yk−1

The homomorphism αk is induced by qk in the obvious way and Bk is obtained as the pullback shown in the diagram. We consider βk+1 : Pk → Yk+1 a lifting of αk pk where pk : Pk → Bk is an epimorphism and Pk is a ﬁnitely generated free group. By taking Y another epimorphism pk+1 : Pk+1 → ker(∂k+1 ) with Pk+1 a ﬁnitely generated free group we set W X = {jk ∂k+1 , 0, sk nk pk } : Wk+1 → Wk Wk+1 = Xk+1 ⊕ Pk+1 ⊕ Pk and ∂k+1

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

383

It is not diﬃcult to see that qk+1 = {fk+1 , sk+1 pk+1 , βk+1 } : Wk+1 → Yk+1 induces isomorphism in homology and jk+1 : Xk+1 → Wk+1 is a monomorphism with a projective cokernel.

3.2 Strict model structures for pro-categories Recall that the category pro-C of a given category C has as objects all coﬁltered diagrams in C and has morphisms deﬁned by Hompro-C (X, Y ) = lims colimt HomC (Xt , Ys ) We refer the reader to [1] or [3] for some background on pro-categories. Suppose that C is a model category and let f : X → Y be a morphism in pro-C. Then it is said that f is a strict weak equivalence (resp. strict cofibration) if f admits a level representation s → Ys is a weak equivalence (resp. coﬁbration) in C. → Y such that each f˜s : X f˜ : X By a level representation we mean a commutative diagram in pro-C of the form X ∼ =

f

X

/Y

f˜

∼ =

/ Y

where f˜ is a natural transformation between I-diagrams, being I a coﬁltered indexing category. The following result is proved in [7]: Theorem 3.1. If C is a proper model category then pro-C has a proper model structure in which the weak equivalences (resp. cofibrations) are the strict weak equivalences (resp. strict cofibrations) and the fibrations are defined by the right lifting property. If C is a simplicial model category then so is pro-C. A map in pro-C is a special trivial fibration (or special acyclic fibration, as in [7]) if it admits a coﬁnite directed level representation p : X → Y for which every t the relative matching map, deﬁned by Xt

Mt p

/ lims
is a weak equivalence and a ﬁbration. It is shown in [7] that every special trivial ﬁbration is a strict trivial ﬁbration, that is, a strict ﬁbration and a strict weak equivalence. Any morphism f : X → Y in pro-C factors as a strict coﬁbration i : X → Z followed by a special trivial ﬁbration p : Z → Y. Indeed, as every map has a level representation ([1]) and every pro-object is isomorphic to a coﬁnite directed pro-object (in fact, for every coﬁltered category I there exists a coﬁnite directed set J and a coﬁnal functor J → I, see [3]), we may suppose that f is a level representation indexed by a coﬁnite directed set. By induction suppose that the maps it : Xt → Zt and pt : Zt → Yt have already deﬁned

384

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

for t < s. Consider the map Xs → Ys × limt<s Yt limt<s Zt and factor it into a coﬁbration is : Xs → Zs followed by a weak equivalence and ﬁbration ps : Zs → Ys × limt<s Yt limt<s Zt . This extends the factorization to level s. Remark 3.2. Observe that this construction does not give a functorial factorization, even in the case that C has functorial factorizations. The key fact is that the reindexing into a coﬁnite directed level representation is not functorial.

4

The cylinder construction: Main properties

We recall the notion of cylinder of any object in a model category. Such deﬁnition will be fundamental in our work. In general, this cylinder is not functorial. However we will see that it has new interesting properties which are very similar to that of a functor; in particular, to the axioms of a category with a natural cylinder functor in the sense of Baues. We remark that our constructions are also valid in any coﬁbration category with binary coproducts (or just considering coﬁbrant objects in any coﬁbration category ). Definition 4.1. Let A be an object in a model category. A cylinder of A is a factorization: {1,1}

II II {ı0 ,ı1 } III $

/ A z= z zz zzρ z z ∼

A AI$

ZA

Example 4.2. Consider the category Ch+ (Abf.g. ) of bounded below chain complexes of ﬁnitely generated Abelian groups. Then, maintaining the same notation as the one introduced in the above section, for any object A a cylinder ({ı0 , ı1 }, ZA, ρ) of A is given by (ZA)k+1 = Ak+1 ⊕ Ak+1 ⊕ Pk+1 ⊕ Pk , ı0 k+1 (a) = (a, 0, 0, 0),

ZA A A ∂k+1 = {ı0k ∂k+1 , ı1k ∂k+1 , 0, sk nk pk }

ı1 k+1 (a) = (0, a, 0, 0),

ρk+1 = {idAk+1 , idAk+1 , sk+1 pk+1 , βk+1 } : (ZA)k+1 → Ak+1 . Remark 4.3. Observe that, in general, Z( ) is not a functor since it depends on the factorization of {1, 1}. Z n A will denote a cylinder of Z n−1 A, n ≥ 1 (Z 0 A = A) and the ρ ρ ρ ρ composite Z n A −→ Z n−1 A −→ ... −→ ZA −→ A by ρn . On the other hand, sometimes we will write ıε A , ε ∈ {0, 1} and ρA to indicate that such morphisms are associated to the object A. Proposition 4.4. The set of cylinders of A has the structure of a directed set considering ({ı0 , ı1 }, ZA, ρ) ≤ ({ı0 , ı1 }, Z A, ρ ) if and only if there is a trivial cofibration α : ZA / ∼ / Z A satisfying α{ı0 , ı1 } = {ı0 , ı1 } and ρ α = ρ.

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

385

Proof. The reﬂexive and transitive properties are straightforward. On the other hand, given ({ı0 , ı1 }, ZA, ρ) and ({ı0 , ı1 }, Z A, ρ ) cylinders of A, we obtain from proposition 2.1 (c) an upper bound: {ı0 ı1 }

A A% /

/ Z A { w {ı ww 0 ,ı1 } % {www α {ı0 ,ı1 } ρ Z 9 AGG t GG tt G tt t9 α ∼ ρ GG# /A ZA ∼

∼

∼

∼

ρ

Let f : A → B be any morphism. From any cylinder of A and any factorization of the pushout morphism {f ρ, 1, 1} : Zf1 → B, we obtain a cylinder of B : AA {ı0 ,ı1 }

f f

/B B

ZA

{1,1}

{ı0 ,ı1 }

f f

/ Z1 f

f

1

ZB

{f ρ,1,1} ρ

∼

/8 B "

fρ

Definition 4.5. The morphism Zf := f 1 (f f ) : ZA → ZB is called cylinder of f. Remark 4.6. We can easily iterate this construction. We denote by Z n f for a cylinder of Z n−1 f , n ≥ 1, (Z 0 f = f ). Similarly, we can also consider f n = (f n−1 )1 , n ≥ 1, (f 0 = f ) and Zfn := Zf1n−1 the pushout of {ı0 , ı1 } and f n−1 f n−1 . From the deﬁnition of ZA, the inclusions in the cylinder ι0 , ι1 : A → ZA and the projection of the cylinder ρ : ZA → A arise naturally. Although they are not natural transformations the following assertion is straightforward to check: Proposition 4.7. Any cylinder of f gives rise to the identities ρZf = f ρ and ιε f = (Zf )ιε Now we will see the behavior of the cylinder construction with respect to the identity and the composition of morphisms. Proposition 4.8. If ({ı0 , ı1 }, ZA, ρ) is any cylinder of A then 1ZA : ZA → ZA is a cylinder of 1A . Proof. Since 1A 1A = 1AA and the cobase change of the identity is the identity, we can choose the triple (1ZA , ZA, ρ) as a factorization of ρ.

386

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

Proposition 4.9. If Zf and Zg are cylinders of f and g such that the composition (Zg)(Zf ) exists, then such composition is a cylinder of gf. Proof. Consider f : A → B and g : B → C and the following diagram, where all the squares are pushouts: f f / gg / AA BB C C {ı0 ,ı1 }

{ı0 ,ı1 }

ZA

/ Z1 f

f f

f1

{ı0 ,ı1 }

gg

/ Z1 gf

f1

ZB

/ Z1

gg

EE EE{gρ ,1,1} EE EE E" ∼ /C ZC

g

1

g

ρ

Taking (g 1 , ZC, ρ ), any factorization of {gρ , 1, 1}, the composite g 1 g g is a cylinder of g. Since ρ g 1 f 1 = {gf ρ, 1, 1} then we have that (gf )1 = g 1 f 1 . The cylinder construction also veriﬁes the preservation of the commutative squares as well as the pushouts along coﬁbrations: Proposition 4.10. Let gf = sr be a commutative square. If Zf and Zr are cylinders of f and r from the same cylinder of A then there exist Zs and Zg cylinders of s and g such that the corresponding square is commutative: A r

f

/B

Zf

ZA

g

=⇒

Zr

/ ZB

Zg

s / Zs / D C ZD ZC In addition, if the original square is a pushout and f (or r) is a cofibration, then the second square can be obtained as a pushout.

Proof. Consider the following commutative diagram where all the squares are pushouts, with the exception of the top and bottom faces of the cube: f f

/B B gg rr ss / DD C C {ı0 ,ı1 }

AA

{ı 0 ,ı1 }

/ Z1 / f gg /Q / f1 1 r

ZA f f

rr

Zr1

r1

ZD

{ı0 ,ı1 }

ss

ss

/ Z1 s

{ı0 ,ı1 }

f1

/ ZB gg

/ Z1 g

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

387

1 1 We point out that Q is both the pushout object Zgf and the pushout object Zsr . 1 1 Considering (g , Z C, ρ ) and (s , Z C, ρ ), factorisations of {gρ, 1, 1} and {sρ, 1, 1} respectively, we obtain by proposition 2.1 (c) a new cylinder of C :

g1 f 1

Q /

/ Z C { w ww {ww α ρ s1 r 1 HH : vZC H v HH vv :v α ∼ ρ HH# /C Z C ρ ∼

∼

∼

∼

Then, taking Zs := α s1 (s s) we have Zg = αg 1 (g g) and the ﬁrst part of the proof is completed. Now suppose that gf = sr is a pushout where f or r is a coﬁbration. In this case it is easy to construct the same diagram but where all the squares are pushouts. Considering the pushout of r1 and f 1 a new cylinder of C is obtained where {ı0 C , ı1 C } := r1 f 1 {ı0 , ı1 } and ρC := ρD ∪ ρB . Observe that (r1 , ZC, ρC ) and (f 1 , ZC, ρC ) are factorizations of {gρ, 1, 1} and {sρ, 1, 1}, respectively. Remark 4.11. From the above proposition we observe that, considering the same source cylinder ZA for two cylinder constructions of f : A → B (say Zf : ZA → ZB and Z f : ZA → Z B) we can ﬁnd cylinders of the identity of B such that the following diagram is commutative: ZA ZA

Zf

Zf

/ ZB

/ Z B

/ Z B

/ Z B

Lemma 4.12. Consider the left commutative diagram: D α

D

f

/C o g γ

f

/ C o

B

g

β

B

ZD =⇒

Zα

Zf

Zγ

ZD

/ ZC o Zg

Zf

/ ZC o

ZB

Zg

Zβ

ZB

Given cylinders of f , g, α and β as shown in the diagram we can find cylinders of f , g and γ such that the new diagram is commutative. Proof. Using the coproduct functor and taking {ı0 , ı1 } from each cylinder of D, C, B, D and B we construct the pushouts Zα1 , Zγ1 , Zβ1 , Zf1 and Zg1 . We have three coﬁbrations with common source C C in which we can construct the pushouts (1) and (2). From the new morphisms we can also take the pushout (3), which allows us to consider the pushout morphism R → C (as shown in the diagram) and a factorization of it by a

388

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

coﬁbration followed by weak equivalence:

∼

u C C R( RR {ı0 ,ı1l }llll R{ı RR0R,ı1 } l {ı RRR 0 ,ı1 } lll l RR( l 1 ull 1 (2) (1) Zf Zg1 Z γ (QQ llv QQQ l l QQdQ clll a QQQ b lll Q l Q l QQ( vlll (3) P S) SSS u Q lll SSS d cllll SSS ll SSS SSS u lllll ) l R ty tt t t ytt λ ZC JJ JJ {{γρ,1,1},{g ρ,1,1}} {{f ρ,1,1},{γρ,1,1}} J ρC JJJ $ p . C

We take {ı0 C , ı1 C } := λdc{ı 0 , ı1 } = λcd{ı0 , ı1 }. It is not diﬃcult to check that ({ı0 C , ı1 C }, ZC , ρC ) is a cylinder of C . Taking γ 1 := λdc = λcd, (f )1 = λda and (g )1 = λcb we conclude the proof. Using the previous results the proofs of the following propositions become straightforward and are left to the reader. Proposition 4.13. Given a pushout along a cofibration in which a morphism of the form {uv, wz} is defined, we can take cylinders of u, v, w and z verifying that {ZuZv, ZwZz} is a cylinder of {uv, wz}. Proposition 4.14. Given a morphism of the form α ∪ β, there exist cylinders of α and β, Zα, Zβ, such that Zα ∪ Zβ is a cylinder of α ∪ β.

5

The relative homotopy relation induced by the cylinder construction and the homotopy extension property

In this section we will compare the relative homotopy given in a coﬁbration category to the relative homotopy induced by the cylinder construction. By this reason, from now on we will consider a coﬁbration category (with binary coproducts or within coﬁbrant objects) as a framework. The reader can also think that we are in a left proper model category.

Cylindrical homotopy Inspired on ideas coming from a category with a natural cylinder a new reformulation of relative homotopy arises in a coﬁbration category. Due to its origin we will call it cylindrical homotopy. Recall that the object Zi1 is the pushout of {ı0 , ı1 } : B B → ZB and i i : B B → A A (see deﬁnition 4.5).

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

389

Definition 5.1. Let i : B / / A be a coﬁbration and consider f0 , f1 ∈ Hom(A, X)u{i} . We will say that f0 is homotopic to f1 relative to i (f0 f1 rel. i) if there exist a cylinder Zi, and a morphism H : ZA → X such that Hi1 = {uρ, f0 , f1 }: {uρ,f0 ,f1 }

Zi1

/X |> | || ||H | ||

C! C CC C i1 CC !

ZA

Theorem 5.2. Relative homotopy and cylindrical homotopy are equivalent.

∼

B / ı0

/A

∼

Proof. First of all we observe that Zi1 can also be obtained in two steps, by pushouts:

ZB /

i

i

B /

ı0

iı1

/ P {ı0 , i}

/A

i

P {ı0 , i} /

i

iı1

/ Z1 i

Now consider the pushout: {iρ,1}∪1

/ P {i, i}

Zi1 i1

i1

/ Zi ZA Applying proposition 2.1.(b) we have that {iρ, 1} ∪ 1 is a weak equivalence and therefore, by the C2 axiom w is also a weak equivalence. Taking {j0 , j1 } := i1 and σ := {ρ, 1, 1}, it is plain to see that ({j0 , j1 }, Z i , σ) is a relative cylinder of i in the sense of Baues. The rest of the proof is plain and left to the reader. w

Obviously, we will not make any distinction between the cylindrical homotopy and the Baues homotopy and we will just write relative homotopy.

5.1 Homotopy extension property There is a notion of homotopy extension property which is analogous to that given in categories with a natural cylinder. The homotopy extension property will be an important tool in the next section. Definition 5.3. A morphism i : B −→ A is said to verify the homotopy extension property (H.E.P.) if given Zi : ZB −→ ZA any cylinder of i, and morphisms α : A −→ X, G : ZB −→ X with X ﬁbrant, satisfying Gıε = αi, then there exists a morphism E : ZA −→ X such that Eıε = α and EZi = G (ε ∈ {0, 1}): ıε ∼

B i

A

α

/ ZB

G

/X ∼

ıε

c

Zi E

2 ZA

390

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

For the sake of simplicity the morphism E will be called an extension of the square Gıε = αi. Proposition 5.4. Every cofibration i : B /

/ A verifies the H.E.P.

Proof. Consider a cylinder Zi : ZB −→ ZA of i and a commutative square Gıε = αi. By proposition 2.1(a) we can take E : ZA −→ X as an extension of {α, G} : P {i, ıε } −→ X relative to {ıε , Zi} : P {i, ıε } / ∼ / ZA. In the next result we will denote by EαG any extension of the square Gı0 = αi.

Proposition 5.5. If α α rel. i and G G rel. {ı0 , ı1 }, then EαG ı1 EαG ı1 rel. i. Proof. Consider homotopies H : α α rel. i and K : G G rel. {ı0 , ı1 }. Applying := Kλ as indicated: proposition 2.1 (a) and (c), we can obtain a morphism K {ı0 ,ı1 ,Zı0 ,Zı1 } 1 / / Z 2B Z{ı 0 ,ı1 }

K

/X <

∼

∼

∼

∼

{} {{ { {} { {ı0 ,ı1 }1 ;v • EE K EE vv 2 v ρ E v EE v; λ ∼ " /B 2 Z B 2 ρ

It is plain to see that Eı1 is the desired homotopy, where E is an extension of the following square: ı Zi1 / ∼0 / Z(Zi1 ) i1

ZA

G ,E G } {K,E α α

H

/X

Corollary 5.6. Let i : B / / A be a cofibration, X a fibrant object and G : ZB → X a morphism verifying Gı0 = u and Gı1 = v, then G : [A, X]u{i} → [A, X]v{i} ,

[α] → [EαG ı1 ]

is a bijection. Furthermore, if G G rel. {ı0 , ı1 }, then G = G . −1

Proof. The good deﬁnition of G is a consequence of the above proposition. (G ) obtained considering similar squares in which ı0 is replaced by ı1 .

6

is

Reformulation of Baues’ homotopy groupoids

In this section we are dealing with giving another description of the relative homotopy groupoid (of a ﬁbrant object). As we will see, the properties of the cylinder construction as well as the homotopy extension property will be fundamental in our description.

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

391

First of all we will give a brief review of Baues’ homotopy groupoids: Given a ﬁbrant object X and ({j0 , j1 }, Z i , σ) a relative cylinder of a coﬁbration i : B / / A, the homotopy groupoid Hi (X) is the category whose objects are the elements of Hom(A, X) and whose morphisms are the elements of the homotopy bracket: Hi (X)(f0 , f1 ) = [Z i , X]{f0 ,f1 }{j0 ,j1 } The inverse and composition operations are given by: ( )−1 : Hi (X)(f0 , f1 ) → Hi (X)(f1 , f0 ) ;

[F ]−1 := [F]

: Hi (X)(f0 , f1 ) × Hi (X)(f1 , f2 ) → Hi (X)(f0 , f2 ) ;

[F ] [G] := [F G]

where F = F α and F G = {F, G} α are constructed by means of proposition 2.1 (a) and (c) applied to the diagrams: {j1 ,j0 }

/ Zi

F

∼

∼

∼

∼

j0 ∪j1

∼

∼

/ P {j1 , j0 }{F,G} / X < z tt t t tt ztt β {F,G} {j0 ,j1 } :v E JJJ JJ vv {σ,σ} J vv v JJ vα γ JJ : v %/ ∼ A Zi σ

P {i, i} /

∼

~ ~~ ~ ~~ β F {j0 ,j1 } ;v E AA v AA vv σ vvα γ AAA v ; v ∼ /A Zi σ

/X B

∼

P {i, i} /

The identity in f is [f σ]. From now on, X will denote a ﬁbrant object. Definition 6.1. Let i : B / / A be a coﬁbration and let Zi : ZB −→ ZA be any cylinder construction. We consider the category Hi (X), whose objects are the elements of Hom(A, X) and whose morphisms are the elements of 1}

Hi (X)(f0 , f1 ) := [ZA, X]{uρ,f0 ,f1 }{i

We note that, if Z i is another cylinder of i then, applying proposition 2.1(c) to the square ρ i 1 = ρi1 , we can obtain trivial coﬁbrations α and α . By proposition 2.3 it is easy to check that the above deﬁnition does not depend on the choice of the cylinder construction of i. Our goal is to prove that Hi (X) is a groupoid which is isomorphic to Hi (X). In order to complete the description of Hi (X) we will give the corresponding operations. Applying the second part of proposition 4.10 to the pushout of {ı0 , ı1 } and i i, we obtain a new pushout: ZiZi / ZA ZA ZB ZB Z{ı0 ,ı1 }

Z 2B Now

Z(ii)

Z{ı0 ,ı1 }

/ Z(Z 1 ) i

392

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

• Let F be a homotopy F : f0 f1 rel. i, where f0 , f1 ∈ Hom(A, X)u{i} . Applying the H.E.P. we can take EF , an extension of the diagram: Zi1 /

ı0 ∼

/ Z(Z 1 ) i

{uρ2 ,F,f0 ρ}

i1

f0 ρ

ZA

Zi1

/X d

EF

1 Z 2A

∼

ı0

Then F := EF ı1 is a homotopy F : f1 f0 rel. i. • Consider homotopies F : f0 f1 rel. i and G : f1 f2 rel. i. Applying again the H.E.P. we can take, E(F,G) , an extension of the diagram: Zi1 / i1

/ Z(Z 1 ) i

Zi1

{uρ2 ,F ,G}

ZA

ı0 ∼

f1 ρ

/X d ∼

ı0

E(F,G)

1 Z 2A

Obviously F ∗ G := E(F,G) ı1 is a homotopy F ∗ G : f0 f2 rel. i. Proposition 6.2. The composition and the inverse operations in Hi (X), given by ( )−1 : Hi (X)(f0 , f1 ) → Hi (X)(f1 , f0 ) ;

[F ]−1 := [F]

∗ : Hi (X)(f0 , f1 ) × Hi (X)(f1 , f2 ) → Hi (X)(f0 , f2 ) ;

[F ] ∗ [G] := [F ∗ G]

are well defined. Proof. If H : F0 F1 rel. i1 then we can take the homotopy {uρ3 , H, f0 ρ2 } : {uρ2 , F0 , f0 ρ} {uρ2 , F1 , f0 ρ} rel. {ι0 , ι1 }. 1 rel. i1 . On the other hand if F0 F1 0 F Hence, by proposition 5.5 we have that F rel. i1 and G : G0 G1 rel. i1 then F0 ∗ G0 F1 ∗ G1 rel. i1 , taking into ac0 , G0 } count the same proposition and considering the homotopy {uρ3 , F, G} : {uρ2 , F 1 , G1 } rel. {ι0 , ι1 }. Here F denotes a homotopy F : F 0 F 1 rel. i1 . {uρ2 , F Now we consider w∗ : Hi (X) −→ Hi (X) given by w∗ (f ) = f , for all f ∈ Hom(A, X) and w∗ ([F ]) = [F w], for all [F ] ∈ Hi (X)(f0 , f1 ). Here w is the morphism obtained in the pushout of theorem 5.2. Observe that, by proposition 2.3, w∗ is a bijection.

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

393

Theorem 6.3. If [F ] ∈ Hi (X)(f0 , f1 ) and [G] ∈ Hi (X)(f1 , f2 ), then: (a) w∗ ([F ]−1 ) = (w∗ ([F ]))−1 (b) w∗ ([F G]) = w∗ ([F ]) ∗ w∗ ([G]) Therefore, Hi (X) has a groupoid structure which is isomorphic to Hi (X). Proof. Consider the diagram which gives the morphism [F ]−1 = [F] = [F α], that is, the inverse of [F ] in the groupoid Hi (X), as shown at the beginning of this section. Taking ({ı0 , ı1 }, ZE, ρ) a cylinder of E we can consider a similar diagram where the exterior square is the same but we replace E, α, β and γ by ZE, ı1 α, ı1 β and γρ respectively. In addition, F will denote the extension of F relative to ı1 β. On the other hand, let F : Z(Z i ) −→ X be the morphism obtained by applying proposition 2.1 (a) and (c) to the square: {w,j0 ρ,j0 σ,1} 1 / Zi Z{j 0 ,j1 }

∼

{j1 ,j0 }1

∼

Z(Z i )

σρ

σ

/A

It is easy to check that any extension of {F , f0 γ, F } relative to β 1 is also an extension of F relative to ı1 β, so we can take F as such an extension and therefore w∗ ([F]) = [F ı1 αw] = [F ZαZw ı1 ]. The proof of (a) comes from the fact that F ZαZw is an extension of the square: ı Zi1 / ∼0 / Z(Zi1 ) i1

ZA

f0 ρ

{uρ2 ,F w,f0 ρ}Z1

/X

Now consider the composition of [F ] and [G] in Hi (X) given by the following diagram: / P {βj1 , j0 } {F ,G} y ss ssλ s s yss H {j0 ,j1 } v: E LLLL v v LL {γ,σ} vv μ LLLL vv ν : v L %/ ∼ A Zi σ

P {i, i} /

βj0 ∪j1

/8 X

∼

∼

∼

∼

Although this is not the usual diagram of deﬁnition of the composition it is equivalent to that; therefore [F G] = [H ν]. As it was done in the proof of (a) we can take a similar diagram where the exterior square is the same and E , ν, λ and μ are replaced by Z(E ), ı1 ν, ı1 λ and μρ, respectively. Let H be the corresponding extension of {F , G} and let H : ZP {βj1 , j0 } → X be the morphism obtained after applying proposition 2.1.(a) and (c) to the square: 1 Z(βj 0 ∪j1 )

/ P {βj1 , j0 }

∼

(βj0 ∪j1 )1

{αw∪w,αj0 γ∪j0 σ,1∪1}

ZP {βj1 , j0 }

∼

{γ,σ}ρ

{γ,σ}

/A

394

J. Garc´ıa-Calcines et al. / Central European Journal of Mathematics 4(3) 2006 376–394

Finally taking H as an extension of {H , f1 μ, H } relative to λ the proof of (b) becomes easy to establish. The construction of this homotopy groupoid gives rise to the deﬁnition of the ﬁrst homotopy group (relative to i) of X. If we consider in : Zin / / Z n A, we obtain the n-th homotopy groups : πni (X, f0 ) = Hi n−1 (X)(f0 ρn−1 , f0 ρn−1 ) = [Z n A, X]{f0 ρ

n−1 in−1 ρ,f

0ρ

n−1 ,f

0ρ

n−1 }{in }

for n 1, where ρ0 = 1 and i0 = i. Furthermore, Baues homotopy groups ([2];II-§6), deﬁned by means of suspensions, are isomorphic to the homotopy groups relative to the initial coﬁbration and based on the 0 morphism.

References [1] M. Artin and B. Mazur: Etale homotopy, Lecture Notes in Maths, Vol. 100, SpringerVerlag, 1969. [2] H.J. Baues: Algebraic homotopy, Cambridge University Press, 1989. [3] D.A. Edwards and H.M. Hastings: Cech and Steenrod homotopy theories with applications to geometric topology, Lecture Notes in Mathematics, Vol. 542, Springer Verlag, 1976. [4] K. Hess: “Model categories in algebraic topology”, Appl. Categ. Struct., Vol. 10(3), (2002), pp. 195–220. [5] P.S. Hirschhorn: Model Categories and Their Localizations, Mathematical Surveys and Monographs, Vol. 99, Amer. Math. Soc, 2003. [6] M. Hovey: Model Categories, Mathematical Surveys and Monographs, Vol. 63, Amer. Math. Soc, 1999. [7] D.C. Isaksen: “Strict model structures for pro-categories, Categorical decomposition techniques in algebraic topology”, Prog. Math., Vol. 215, (2004), pp. 179–198. [8] D.G. Quillen: Homotopical Algebra, Lecture Notes in Maths, Vol. 43, SpringerVerlag, 1967.

DOI: 10.2478/s11533-006-0020-y Research article CEJM 4(3) 2006 395–412

On the periodicity of trigonometric functions generalized to quotient rings of R[x] Claude Gauthier∗ Department of Mathematics and Statistics, Universit´e de Moncton, Moncton, N.B. Canada

Received 15 June 2005; accepted 1 June 2006 Abstract: We apply a method of Euler to algebraic extensions of sets of numbers with compound additive inverse which can be seen as quotient rings of R[x]. This allows us to evaluate a generalization of Riemann’s zeta function in terms of the period of a function which generalizes the function sin z. It follows that the functions generalizing the trigonometric functions on these sets of numbers are not periodic. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Compound inverse, generalized trigonometric functions, zeta function MSC (2000): 30G35, 11M41

1

Introduction

The trigonometric and exponential functions have geometric interpretations from which we easily foresee their periodicity. It is then clear that the functions sin z and cos z are periodic of period 2π on R and C. Similarly the function ez is periodic of period 2πi on C, but is not periodic on R. These examples show that the domain of deﬁnition of a function is essential when one wants to examine its periodicity. To study the periodicity of a function which has no geometric interpretation, one needs a method to calculate its period. A problem of this kind has been addressed by Euler [3]. He has shown how to determine the period of the functions sin z and cos z by computing ζ(2), where ζ designates the Riemann zeta function. This period equals ∗

E-mail: [email protected]

396

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

2 6ζ(2) and follows from Euler’s general formula ζ(2k) = (−1)k−1

22k−1 π 2k B2k , (2k)!

(1)

where k ∈ N∗ and B2k are the Bernoulli numbers. In this paper, we shall determine the period of functions which generalize the trigonometric functions to quotient rings of R[x]. These quotient rings are related to the notion of compound additive inverse introduced in [2] and [4]. Our method generalizes the one of Euler and follows in part [6], pp. 155-158.

2

The sets Am and Ep,ν

Let m ∈ N, m ≥ 2, and consider m copies of the monoid M made up of the positive real numbers R+ and the operation of addition. These m copies of M are assumed to share the same identity element. If a ∈ R+ and k ∈ I(m) = {0, 1, . . . , m − 1}, the expression a(k)m will represents a number of the kth of the m copies of M. To simplify the notation, we shall write a(k) for a(k)m and a for a(0) when there will be no risk of confusion. We assume that the elements 1(k) , k ∈ I(m), satisfy 1(k1 ) 1(k2 ) = 1((k1 +k2 )( mod m)) , for k1 , k2 ∈ I(m), and

1(k) = 0,

(2)

k∈In (m)

where In (m) is the subset of multiples of n in I(m), for n|m, n < m. We call m (k) addisym number (symmetry with respect to addition) every element k∈I(m) ak of the (k) and Cartesian product of the m copies of M. The m-addisym numbers k∈I(m) ak (k) will be said to be equivalents if their diﬀerence is a linear combination of k∈I(m) bk (k) expressions of the form k∈In (m) 1 , where n|m and n < m. We shall then write (k) (k) k∈I(m) ak = k∈I(m) bk . The set of m-addisym numbers, written Am , can be seen as (k) the set of equivalence classes of expressions k∈I(m) ak modulo the equations (2). If m = p is a prime number of N and ak = ak − minj∈I(p) {aj }, then k∈I(p) ak (k) is (k) called reduced expression of k∈I(p) ak . The number ak ∈ R+ is called kth coordinate (k) of k∈I(p) ak . We shall say that a p-addisym number is primary if its reduced expression is a number in R+ , that is if (k) ak = a0 . k∈I(p)

(k) is antiprimary if its reduced expression is the additive We shall say that k∈I(p) ak inverse of a primary number. We have shown in [2] that the set Am is isomorphic to the ring of polynomials with coeﬃcients in R quotiented by the ideal generated by Φm (x), which is the cyclotomic

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

397

polynomial of order m. This means that Am R[x]/Φm (x). Therefore A2 is a ﬁeld isomorphic to R, while A3 , A4 and A6 are ﬁelds isomorphic to C. If ϕ designates the Euler totient function, then all others Am , ϕ(m) ≥ 4, are commutative rings having zero divisors. With the addition of corresponding coordinates and the multiplication of an element of A2 by an element of Am , every Am forms a vectorial space of dimension m − 1 on A2 , which will be designated by A2;m . We also have that if ϕ(m) ≥ 4, then the set of zero divisors of Am , designated by A◦m , is the union of ϕ(m)/2 subspaces of codimension 2 of A2;m . Note that A◦m contains no non-zero number of any of the m copies of M in Am . We designate by Am the set of all non-divisors of zero in Am and deﬁne the multiplicative inverse of a ∈ Am by the solution x ∈ Am of ax = 1. This inverse will be written 1/a. We now consider the case where m = p is a prime number of N. Let us associate (k) (jk) with each k∈I(p) ak ∈ Ap , p > 2, the numbers in Ap given by k∈I(p) ak , j ∈ I(p), (k) j = 0, 1. These numbers will be called p-addisym conjugates of k∈I(p) ak . One can extend this deﬁnition to A2 by saying that each number in this set is its own 2-addisym conjugate. The equations xp = 1(ν) , ν ∈ I ∗ (p) = I(p)\{0}, have no solution in Ap which can be written in term of one coordinate only. One can obtain such solutions within algebraic extensions of Ap . These extensions, written Ep,ν , result from the adjunction of the pth non-compound root of 1(ν) , ν ∈ I ∗ (p), which will be designated by sp,ν . If k∈I(p) ak skp,ν , k∈I(p) bk skp,ν ∈ Ep,ν , where ak , bk ∈ Ap , then their product is given by ⎛ ⎞ ⎛ ⎞⎛ ⎞ ⎜ ⎟ (r(μ,k,k1 )) ⎟ k ⎜ ⎝ ak skp,ν ⎠ ⎝ bk skp,ν ⎠ = a b k 1 k 2 ⎝ ⎠ sp,ν , k∈I(p)

k∈I(p)

k∈I(p)

k1 ,k2 ∈I(p)

k1 +k2 ≡k( mod p)

where r(ν, k, k1 ) = 0, if k1 ≤ k, and r(ν, k, k1 ) = ν, if k1 > k. In the expression k z = k∈I(p) ak sp,ν ∈ Ep,ν , where ak ∈ Ap , the number a0 will be called p-addisym coordinate of z and ak , k ∈ I ∗ (p), its kth p-addisym extended coordinate. With the addition of corresponding coordinates and this multiplication, the sets Ep,ν are rings with zero divisors if p > 2. The ring E2,1 is a ﬁeld isomorphic to C. The set Ep,ν is also a vectorial space of dimension (p − 1)p on the ﬁeld A2 , which will be written E2;p,ν . (jk) With every k∈I(p) ak skp,ν ∈ Ep,ν , we associate the numbers k∈I(p) ak skp,ν , j ∈ I ∗ (p), which we call its conjugates in Ep,ν . We designate by E◦p,ν the set of zero divisors in Ep,ν . Since Ep,ν Ap2 , we obtain that if p > 2 then the set E◦p,ν is the union of (p − 1)p/2 subspaces of codimension 2 of E2;p,ν . Let Ep,ν be the set of non-divisors of zero in Ep,ν . The multiplicative inverse of a ∈ Ep,ν , written 1/a, is deﬁned by the solution x ∈ Ep,ν of ax = 1.

3

Diﬀerential calculus and exponential function on Ep,ν

In order to deﬁne a limit, and then a derivative on Ep,ν , we introduce the notion of absolute p-prevalue. This is a function from Ep,ν to R+ , written x → x p , which

398

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

satisﬁes the conditions: i) x p = 0 if and only if x = 0; ii) x + y p ≤ x p + y p ; iii) xy p ≤ x p y p , for any x, y ∈ Ep,ν . If ak,j , j ∈ I(p), designate the coordinates of ak ∈ Ap , k ∈ I(p), then

k∈I(p)

ak skp,ν p =

⎛

⎝

k∈I(p)

⎞ ak,j ⎠

(3)

j∈I(p)

is an absolute p-prevalue (see [4]). Let f : Ep,ν → Ep,ν . We say that f is p-addiderivable at a point z ∈ Ep,ν if limΔz→0 [f (z + Δz) −p f (z)] /Δz, exists independently of the way Δz ∈ Ep,ν approaches zero, with respect to the absolute p-prevalue deﬁned on Ep,ν . We call p-addiderivative of f (z) and write f (z) the value of this limit. One consequence of this deﬁnition is that the functions of one variable in Ep,ν satisfy the usual rules of derivation. On Ep,ν it is in fact easy to deduce all classical results of diﬀerential calculus deﬁned on R or C, such as the uniqueness of the Maclaurin series of a function, which will be used later. The proofs of these properties are similar to the corresponding ones for functions of one real or complex variable. A function deﬁned on Ep;ν will be said to be p-addiholomorphic at a point z if it is p-addiderivable at every point in a neighborhood (deﬁned with respect to the absolute p-prevalue) of z. We shall say that a function is p-addiholomorphic in a domain of Ep;ν if it is p-addiholomorphic at every point in this domain. To deﬁne the exponential function on Ep,ν , let us ﬁrst remark that the formal se ries ∞ X k /k! has an inﬁnite radius of convergence. For z ∈ Ep,ν , we set exp(z) = ∞ k=0 k k=0 z /k!. This series converges (with respect to the limit deﬁned with the absolute p-prevalue) for every z ∈ Ep,ν . This implies that exp(z) exp(w) = exp(z + w), for all z, w ∈ Ep,ν . If x ∈ Ap , then exp(xsp,ν ) = 1 + xsp,ν

x2 s2p,ν + + ... . 2!

(4)

Setting gk (x) =

xk 1(ν) xp+k 1(2ν) x2p+k + + + ... k! (p + k)! (2p + k)!

,

k ∈ I(p),

(5)

the expression (4) becomes exp(xsp,ν ) = g0 (x) + sp,ν g1 (x) + · · · + sp−1 p,ν gp−1 (x).

(6)

The p-addiderivation of gk (x), k ∈ I(p), gives (gk (x)) = gk−1 (x) if k ∈ I ∗ (p), and (g0 (x)) = 1(ν) gp−1 (x). One also has (gk (x))[p] = 1(ν) gk (x) for k ∈ I(p), [p] † where designates the p-addiderivative of order p of the function. †

Another generalization of the trigonometric functions is presented in [5]. See also [7].

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

399

If x, y ∈ Ap , one can write gj (x+y), j ∈ I(p), in terms of gk (x) and gl (y), k, l ∈ I(p): ⎫ ⎪ ⎪ g0 (x + y) = g0 (x)g0 (y) + [g1 (x)gp−1 (y) + · · · + ⎪ ⎪ ⎪ ⎪ (ν) ⎪ (y)] ⎪ +gp−1 (x)g1 ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ g1 (x + y) = g0 (x)g1 (y) + g1 (x)g0 (y)+ ⎪ ⎪ ⎬ (ν) (7) (x)gp−1 (y) + · · · + gp−1 (x)g2 (y)] ⎪ +[g2 ⎪ ⎪ .. ⎪ ⎪ ⎪ . ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ gp−1 (x + y) = g0 (x)gp−1 (y) + g1 (x)gp−2 (y) + · · · + ⎪ ⎪ ⎪ ⎪ ⎪ +gp−1 (x)g0 (y). ⎭ From (5), it is easy to show that gk (x(l) ) = (gk (x))(kl)

,

k, l ∈ I(p).

(8)

The jth conjugate in Ep,ν of exp(xsp,ν ), j ∈ I ∗ (p), is thus given by ((p−1)j) g0 (x) + sp,ν 1(j) g1 (x) + s2p,ν 1(2j) g2 (x) + · · · + sp−1 gp−1 (x) = p,ν 1 (j) = g0 (x) + sp,ν g1 (x(j) ) + s2p,ν g2 (x(j) ) + · · · + sp−1 p,ν gp−1 (x )

(9)

(j)

= exp(x sp,ν ). From (9), one also gets gk (x) =

1 (k(p−j)) 1 exp(x(j) sp,ν ), pskp,ν

k ∈ I(p).

(10)

j∈I(p)

The substitution of x ∈ Ap by z ∈ Ep,ν in (10) allows us to deﬁne the functions gk , k ∈ I(p), on Ep,ν . One easily shows that (7) and (8) stay valid for x, y ∈ Ep,ν . Assume the existence of a number αp ∈ Ap such that exp(pαp sp,1 ) = 1 and its absolute p-prevalue is the smallest possible without being zero. The aim of the rest of this paper is to construct an expression to compute αp . The expression we shall ﬁnd will imply the number ζp (p), where ζp designates a generalization of Riemann’s zeta function.

4

The computation of ζp (p)

Since Ep,ν Ep,1 , ν ∈ I ∗ (p), from now on we shall work on Ep = Ep,1 and shall set sp = sp,1 . The deﬁnition of αp implies that 1

[exp(pαp sp )] p = exp(αp sp ) = 1(ν1 ) ,

(11)

where ν1 ∈ I ∗ (p). Setting gk

= gk , from (6) it follows that gk

(αp ) = 0 for k ∈ I ∗ (p). Therefore, according to (8), gk

(αp(l) ) = (gk

(αp ))(kl) = 0,

l ∈ I(p),

k ∈ I ∗ (p).

(12)

400

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412 (l)

However (g1

(z)) = g0

(z) and, according to (6), (8) and (11), g0

(αp ) = g0

(αp ) = (l) 1(ν1 ) for l ∈ I(p). The values z = αp , l ∈ I(p), are thus simple roots of g1

(z). Another simple root of g1

(z) is z = 0. From (7) and (12), we also have, for n ∈ N, n > 1, g1

(αp n(l) ) = 1(ν1 ) g1

[αp(l) (n − 1)] = (1(ν1 ) )n−1 g1

(αp(l) ) = 0. All n(l) , for n ∈ N and l ∈ I(p), are thus simple roots of g1

(αp z). From (7) and (12), it is easy to show that any sum of these n(l) is also a simple root of g1

(αp z). Therefore, (l) all roots of g1

(αp z) in Ep are simple and can be expressed as n = l∈I(p) nl , where nl ∈ N for l ∈ I(p). Let −p 1 = k∈I ∗ (p) 1(k) . For every j ∈ I(p), we set

Np,j =

⎧ ⎨ ⎩

(( p−1 +j+1)( mod p))

(l)

p−1 + j + 2)(modp) 2 and n( p−1 +j+2)( mod p) ∈ N∗ ,

nl −p n( p−12 +j+1)( mod p) : nl ∈ N, l ∈ I(p), l = ( 2

l∈I(p)

2

and Np = {

(l)

nl : nl ∈ N, l ∈ I(p) and

l∈I(p)

(l)

nl = 0}.

(13)

l∈I(p)

It is easy to see that the Np,j form a partition of Np . Writing the reduced expressions of n ∈ Np , it is also easy to show that the set of n ∈ Np is identical to the set of n(k) for n ∈ Np, p−3 = N•p and k ∈ I(p). 2 In order to obtain an inﬁnite product to represent g1

(αp z), we ﬁrst note that n∈Np (1 −p z/n) deﬁnes a function having a simple root at each n ∈ Np . The inﬁnite product

1 −p

n∈N•p

k∈I(p)

z p z = 1 − p n(k) n n∈N•

(14)

p

is then a function whose all roots are simple and are given by n ∈ Np . (0) (1) (p−2) For every n0 +n1 +· · ·+np−2 ∈ N•p , let ni0 = max{n0 , n1 , . . . , np−2 }. We can replace (0) (1) (p−2) (0) (1) (p−2) (0) n(k) = (n0 + n1 + · · · + np−2 )(k) of (14) by [(ni0 + ni1 + · · · + nip−2 )(i0 ) ](k) = (ni0 + (1)

(p−2)

ni1 + · · · + nip−2 )((i0 +k)( mod p)) , where ij ∈ {0, 1, . . . , p − 2} and j = 0, 1, . . . , p − 2. For (0)

(1)

(p−2)

n0 +n1 +· · ·+np−2 ∈ N•p , one can thus always assume that n0 = max{n0 , n1 , . . . , np−2 }. Let us now show that the inﬁnite product (14) converges with respect to the absolute (0) (1) (p−2) (0) (1) p-prevalue on Ep . If n0 + n1 + · · · + np−2 = n ∈ N•p , then for every m0 + m1 + · · · +

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

401

(p−2)

mp−2 ∈ N•p we have that m0 m1 mp−2 mp−2 m0 m1 z p p z ≤ 1 + −p ··· ··· 1 −p n n p n0 =1 n1 =0 np−2 =0 n0 =1 n1 =0 np−2 =0 p ⎛ ⎞ mp−2 mp−2 m0 m0 m1 m1 p p z z ⎠ ≤ ··· exp −p ··· = exp ⎝ −p n n p p n0 =1 n1 =0 np−2 =0 n0 =1 n1 =0 np−2 =0 ⎞ ⎛ mp−2 m0 m1 1 p ⎠ ⎝ (15) ≤ exp z p (p − 1) ··· np . n0 =1 n1 =0

np−2 =0 (0)

p

(1)

(p−2)

Consider the last absolute p-prevalue of (15). We know that n0 + n1 + · · · + np−2 = (0) (0) (p−3) (0) (1) (p−2) n0 + 1(1) (n1 + · · · + np−2 ). We can thus treat n0 + n1 + · · · + np−2 as a p-addisym number whose coordinates are (p − 2)-addisym numbers. Multiplying this number by (0) (p−3) the p − 2 numbers which are its p-addisym conjugates and by n0 + (n1 + · · · + np−2 ), (0) (p−3) we ﬁnd np0 + (n1 + · · · + np−2 )p . For some Mi ∈ R+ , i ∈ I(p − 2), it follows that (0) (1) (p−3) (i) (n1 + n2 + · · · + np−2 )p = M0 + 1(1) i∈I(p−3) Mi+1 . If Ni = Mi+1 for i ∈ I(p − 3), then (0) (p−3) (i) Ni . (16) np0 + (n1 + · · · + np−2 )p = np0 + M0 + 1(1) i∈I(p−3)

Considering (16) as a p-addisym number having (p − 3)-addisym coordinates, it follows that the multiplication of this number by the p − 2 numbers which are its p-addisym (i) (i) conjugates and by np0 +M0 + i∈I(p−3) Ni gives (np0 +M0 )p +( i∈I(p−3) Ni )p . Repeating this reasoning the appropriate number of times to render primary the ﬁnal product of the p−2 p-addisym conjugates, we arrive at an expression which is larger then np0 . If we do the same multiplications of p-addisym conjugates as above for the numerator of the last absolute p-prevalue of (15), we ﬁnd a sum of terms that are all products of powers of ni , i = 0, 1, . . . , p − 2, the total power of which being always equal to pp−2 − 1. Consequently ep−2 p ne0o ne11 . . . np−2 1 . (0) ≤ (p−2) p pp−2 (n0 + n(1) n 0 e ,e ,...,e ∈N 1 + · · · + np−2 ) 0 1 p−2 p e0 +e1 +···+ep−2 =pp−2 −1

Knowing that we can set n0 ≥ ni for i = 0, 1, . . . , p − 2, it follows that 1 c0 (0) ≤ p, (1) (p−2) (n0 + n1 + · · · + np−2 )p n0 p where c0 ∈ R+ is a constant. We then obtain ⎞ ⎛ mp−2 mp−2 m0 m1 m1 z p c0 m0 ≤ exp ⎝ z pp (p − 1) ⎠ ··· ··· 1 −p p n n n0 =1 n1 =0 np−2 =0 n0 =1 n1 =0 np−2 =0 0 p m0 1 p = exp c z p , np n =1 0 0

(17)

402

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

where c ∈ R+ is a constant. Therefore ∞ z p ≤ exp c z pp 1 −p n−p . n n∈N•p n=1 p

−p Since ∞ converges for every p > 1, the inﬁnite product (14) will converge with n=1 n respect to the absolute p-prevalue on Ep . This result legitimizes the rearrangement of its factors we have already done. The function g1

(αp z) is p-addiholomorphic on all Ep and has the same roots, with the same multiplicities, as the inﬁnite product (14) multiplied by z. A reasoning similar to the one done for usual holomorphic functions allows us here to conclude that there exits an p-addiholomorphic function ψ deﬁned on Ep and a constant K ∈ Ep such that g1

(αp z) = zK exp(ψ(z))

1 −p

n∈N•p

z p n

.

(18)

To show that the function ψ is constant on Ep , let us deﬁne the function g(u) = log(1+u), u ∈ Ep , as the reciprocal series of the one representing the function f (u) = exp(u) −p 1 ([1, chapter 1]). One can then see ([8, p. 350]) that the series n∈N•p log (1 −p (z/n)p ) converges uniformly on every compact subset of Ep which does not include n ∈ N•p . We can thus apply the logarithmic p-addiderivation to the two members of (18). This implies that pz p−1 g

(αp z) 1 g

(z) = αp 0

. (19) = ψ (z) + + z n∈N• z p −p np g1 (αp z) p

We shall now show that ψ (z) ≡ 0 on Ep . Let us consider the function G

(z) = g

(z) −p h

(z), where h

(z) =

pz p−1 1 + z n∈N• z p −p np

(20)

p

1 1 1 1 z = + = + + z n∈N n (z −p n) z n∈N z −p n n

p

p

for every z ∈ Ep such that z = n et n ∈ Np . The three following lemmas will give properties of the function G

that we shall use below. Lemma 4.1. The function G

has a power series representation on Ep . Proof. It is clear that the function g

is p-addiholomorphic on Ep , except at the roots of g1

(αp z), that is at z = 0 and z = n for n ∈ Np . For the function h

, one can see that the series (20) converges uniformly as the series n∈N•p 1/np , whose convergence follows from the one of n∈N•p 1/np p proved above. This series will be p-addiderivable, except at z = 0 and z = n for n ∈ Np . Thus, the functions g

and h

have simple

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

403

poles at every n ∈ Np and n = 0. This implies that g

(z) =

g˜

(n, z) , z −p n

h

(z) =

˜

(n, z) h , z −p n

˜

are functions which have power series on Ep for each n ∈ Np . Thus where g˜

and h g˜

(n, z) =

∞

aj (n)(z −p n)j ,

j=0

if z −p n ≤ r(n) and ˜

(n, z) = h

∞

bj (n)(z −p n)j ,

j=0

if z−p n ≤ R(n), where aj (n), bj (n) ∈ Ep for j ∈ N and r(n), R(n) ∈ R+ . Consequently a0 (n) aj+1 (n)(z −p n)j + z −p n j=0 ∞

g

(z) = and

b0 (n) (z) = bj+1 (n)(z −p n)j . + z −p n j=0 ∞

h

Applying l’Hˆopital’s rule, which directly generalizes to Ep , we ﬁnd that a0 (n) = lim (z −p n)g

(z) = lim z→n

(z −p n)αp2 1(1) gp−1 (αp z) + αp g0

(αp z)

αp g0

(αp z)

z→n

= 1.

For every n0 ∈ Np , we also have ⎤ z(z −p n0 ) ⎦ z −p n0 + b0 (n0 ) = lim (z −p n0 )h

(z) = lim ⎣ z→n0 z→n0 z n(z −p n) n∈Np ⎡ ⎤ ⎡

z(z −p n0 ) ⎥ ⎢z ⎥ = 1. + = lim ⎢ ⎦ z→n0 ⎣ n0 n(z − n) p n∈N p

n =n0

Similarly, we show that b0 (0) = 1. Thus G

(z) = g

(z) −p h

(z) =

∞

[aj+1 (n) −p bj+1 (n)] (z −p n)j .

j=0

Lemma 4.2. The function G

is periodic of period 1 with respect to each copy of the m monoids M in Ap .

404

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

Proof. The veriﬁcation of g

(z + 1(k) ) = g

(z), k ∈ I(p), being direct, we shall only prove that h

(z + 1(k) ) = h

(z) for k ∈ I(p). In what follows we shall use the sets (l) (l) " p,k,j = { # N l∈I(p) nl ∈ $Np,j : nk% = 1} and Np,k,j = { l∈I(p) nl ∈ Np,j : nk = 0} for j ∈ I(p), j = k and j = p−1 + k (mod p). We ﬁrst have 2

h

1 1 1 (z + 1 ) = + + z + 1(k) n∈N z + 1(k) −p n n p,k 1 1 1 1 + + + 1(p−k) + = (k) (k) z + 1 −p n n z+1 z n∈N (k)

p,l

l∈I(p),l =k

+

n∈Np,k n =1(k)

1 1 + (k) z −p (n −p 1 ) n

+

n∈Np,l

1 1 + (k) z −p (n −p 1 ) n

.

l∈I(p),l =k

Setting m = n −p 1(k) , it follows that

h

1 1 1 1 (p−k) (z + 1 ) = + +1 + + z + 1(k) z z − m + 1(k) pm m∈Np,k 1 1 + = + z −p m m + 1(k) (k) (k)

m∈Np,l ,m=−p 1

l∈I(p),l =k

1 1 + + 1(p−k) + (k) z+1 z

&

1 1 + z −p m m

m∈Np,j ,j∈I(p)

j=k,k+1,...,( p−1 +k−1)( mod p) 2

&

+

m∈N p−1 p, 2 +k ( mod p) m =−p 1(k)

(

)

1 1 + z −p m m &

+

m∈Np,j ,j∈I(p)

(k) ,j∈I(p) " m∈N p,k,j ,m=1 p−1 j=k+1,..., 2 +k−1 ( mod

(

+

1 + −p 1(k)

)

(k) ,j∈I(p) " m∈N p,k,j ,m=1 p−1 j=k+1,..., 2 +k−1 ( mod

(

1 1 −p m m + 1(k)

1 1 −p (k) m+1 m

−p

'

1 1 −p m m + 1(k)

m∈Np,j ,j∈I(p)

)

1 + m m∈N

( p−1 2 +k)( mod p)

'

1 1 + z −p m m

j=k,k+1,...,( p−1 +k−1)( mod p) 2

−p

+

−p

1 1 + + 1(p−k) + = (k) z+1 z −p 1

1 1 + z −p m m

j=( p−1 +k+1)( mod p),...,(p+k−1)( mod p) 2

(p−k)

1 1 + z −p m m

p,

p)

m =−p 1(k)

1 + m p)

j=(

p−1 +k+1 2

# m∈N p,k,j ,j∈I(p)

)( mod p),...,(p+k−1)( mod p)

1 + m

'

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

m∈Np,j ,j∈I(p)

1 1 + z −p m m

1 + z m∈N

=

p

1 1 + z −p m m

−p

j=( p−1 +k+1)( mod p),...,(p+k−1)( mod p) 2

j=(

405

p−1 +k+1 2

1 m

# m∈N p,k,j ,j∈I(p)

)( mod p),...,(p+k−1)( mod p)

= h

(z).

Lemma 4.3. The function G

is uniformly bounded with respect to the absolute pprevalue on Ep . Proof. Let xk ∈ Ap , k ∈ I(p), and z = k∈I(p) xk skp ∈ Ep . Due to the result of Lemma 4.2, it will be suﬃcient to show that G

is uniformly bounded with respect to the absolute p-prevalue on each of the regions {z ∈ Ep : x0 = y (k) , where y ∈ R+ , y ≤ 1 and k ∈ I(p)}. But in these regions the analytic function G

is bounded with respect to the absolute p-prevalue for every xj p ≤ a such that a ∈ R+ and j ∈ I ∗ (p), because every continuous function deﬁned on a compact set is bounded on this set. It thus remains to show that G

is bounded with respect to the absolute p-prevalues for xj p > a. To this end, we shall successively show that g

(z) and h

(z) tends toward ﬁnite values when xj tends toward ∞(k) for j ∈ I ∗ (p) and k ∈ I(p), where ∞(k) designates 1(k) ∞. We begin with the function g

. Using (10) and (19), we obtain lim

xp−1

→∞(p−1)

g

(z) =

1 + limxp−1 →∞(p−1) = αp sp

(

p−1 sp x0 +···+sp xp−2 1+ k∈I ∗ (p) exp (1) xp−1 ) ( p−1 sp x0 +···+sp xp−2 (1) exp αp xp−1 1+ (1) xp−1

(

p−1 sp x0 +···+sp xp−2 (1) xp−1 ) ( p−1 sp x0 +···+sp xp−2 (1) exp αp xp−1 1+ (1) xp−1

k∈I ∗ (p)

1 + limxp−1 →∞(p−1)

)

(k) (1) αp xp−1

)

.

(21)

(k) (1)

1(p−k) exp αp xp−1 1+

But

' & sp x0 +···+sp−1 xp−2 (k) (1) p (1) k∈I ∗ (p) exp αp xp−1 1 + xp−1 ' & lim xp−1 →∞(p−1) sp x0 +···+sp−1 xp−2 (1) p exp αp xp−1 1 + (1) xp−1 (k) (1) (k) (1) exp α x ∗ p p−1 exp(αp xp−1 ) k∈I (p) = lim lim = (1) (1) xp−1 →∞(p−1) xp−1 →∞(p−1) exp(αp x exp(αp xp−1 ) ∗ p−1 ) k∈I (p) *& + ' α x p p−1 exp(1(k) ) = lim . xp−1 →∞ exp 1 ∗

k∈I (p)

(p+k−1) Using (4) with x = (sp−1 and the continuity on Ep of the function deﬁned by p )

406

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

the absolute p-prevalue allow us to show that & 'xp−1 exp(1(k) ) = 0. lim xp−1 →∞ exp 1 From expression (5) for g1

(x), one directly proves that αp cannot be an antiprimary p-addisym number. This implies that *& 'xp−1 +αp exp(1(k) ) lim = 0. xp−1 →∞ exp 1 ∗ k∈I (p)

Therefore, the numerator of (21) equals 1. The same reasoning shows that its denominator also equals 1. Thus lim g

(z) = αp sp . (22) xp−1 →∞(p−1)

The method used to obtain (22) also gives lim

xp−1 →∞(p−1−i)

g

(z) = αp(i) sp ,

i ∈ I(p).

Finally, the limits of g

(z) when xl , l = 1, 2, . . . , p − 2, tends toward ∞(p−1−i) , i ∈ I(p), (p−l) can be computed by writting g0

(z) and g1

(z) in terms of exp(z (j) sp ), j ∈ I(p), instead of exp(z (j) sp ), j ∈ I(p). We ﬁnd lim

xl →∞(p−1−i)

g

(z) = αp(i) sp−l p

.

This shows that the function g

is uniformly bounded with respect to the absolute p-prevalue on Ep . To prove that the function h

is also uniformly bounded with respect to the absolute p-prevalue on Ep , we use its expression (20). We have that z p−1 1

h

(z) p ≤ z + p z p −p np . p p n∈N• p

But, for k ∈ I(p),

1 1 = lim = 0. p−1 limx1 →∞(k) (x0 + sp x1 + · · · + sp xp−1 ) x1 →∞(k) z p p

Thus lim h

x1 →∞(k)

z p−1 (z) p ≤ p lim z p −p np x1 →∞(k) p n∈N•p 1 p−1 ≤ p lim

z p z p −p np x1 →∞(k) p n∈N•p p−1 1 . = p lim x1 p (p−1) np x1 →∞ x − p1 1 • p n∈N p

(23)

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

407

The method used to obtain a lower bound for the denominator of the last absolute pp−1 p−1 prevalue of (15) here shows that the denominator of (23) has np0 +xp1 as lower bound. Applying to the numerator of (23) the same multiplications of p-addisym conjugates which have leaded to the lower bound of its denominator, and then multiplying the result p by xp−1 1 , one ﬁnds a sum of terms which are the products of power of ni and of x1 multiplied by xp−1 and whose total power is always equal to pp−1 − 1. For some aj ∈ R+ , 1 j = 0, 1, . . . , pp−2 − 1, one can thus write pp−2 −1 pp−1 −jp−1 aj njp 0 x1 j=0

lim h (z) p ≤ p lim p−1 p−1 x1 →∞ x1 →∞(k) np0 + xp1 n0 ∈N∗ pp−2 −1 njp xpp−1 −jp−1 0 1 aj . (24) = p lim pp−1 pp−1 x1 →∞ n + x ∗ 0 1 j=0 n0 ∈N Each of the terms between parentheses in (24) is uniformly bounded with respect to the absolute p-prevalue on Ep . For j = 0, 1, . . . , pp−2 − 1, we have indeed njp xpp−1 −jp−1 0 1 n0 ∈N∗

np0

p−1

+ xp1

p−1

But

=

njp xpp−1 −jp−1 0 1 n0 ≤x1

np0

p−1

+ xp1

p−1

njp xpp−1 −jp−1 0 1 n0 ≤x1

np0

p−1

+ xp1

p−1

+

njp xpp−1 −jp−1 0 1 n0 >x1

np0

p−1

+ xp1

p−1

.

≤1

and, if N is the smallest integer such that 2N > x1 njp xpp−1 −jp−1 0 1 n0 >x1

p−1 np0

+

p−1 xp1

< 2N − 1 + 2(p

p−1 −jp−1)N

ζ(pp−1 − jp).

It follows that h

(z) is bounded with respect to the absolute p-prevalue when x1 → ∞(k) and k ∈ I(p). Repeating the above reasoning for each j ∈ I ∗ (p) shows that h

(z) is bounded with respect to the absolute p-prevalue when xj → ∞(k) and k ∈ I(p). This shows that h

is uniformly bounded with respect to absolute p-prevalue on Ep . The same conclusion can then be applied to the function G

. Using the distance resulting from the absolute p-prevalue, let Eδp be the subset of Ep = Ep,1 whose points are at a distance larger then or equal to a given δ ∈ R∗+ form any point of E◦p = E◦p,1 . We call p-addisym surface of Ep every subset S of Ep such that at every z ∈ S one can deﬁne a patch having two real dimensions. By deﬁning the curvilinear integral on Ep in a manner similar to the one this integral is deﬁned on C, one directly shows that this new integral has the same properties as the later. We then have the following generalization of the Cauchy integral formula. Lemma 4.4. Let f be an p-addiholomorphic function inside and on the boundary C of a simply connected subset D of an p-addisym surface of Eδp . If z0 is an interior point of

408

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

D, w −p z0 ∈ Ep for w ∈ D\{z0 } and αp = 0, then for every n ∈ N

n! f (z0 ) = pαp sp [n]

, C

f (w) dw. (w −p z0 )n+1

Proof. It is similar to the one of the corresponding result on C ([9, p. 369]).

(25)

The multiplication of z = k∈I(p) ak skp ∈ Ep by its conjugates in Ep gives a number ckk ∈ Ap [4]. If p = 2, this number is a20 + a21 ∈ R+ . If p > 2, the multiplication k∈I(p) of k∈I(p) ckk by its p-addisym conjugates gives a primary or an antiprimary p-addisym number. From these observations, we obtain a new function z → |z|p of Ep on R+ which has properties close to those of an absolute value. This function, called absolute p-quasivalue, is deﬁned by

|z|p =

where R =

j∈I ∗ (p)

⎧ 1 ⎪ ⎨ R (p−1)p

,

1 ⎪ ⎩ (−p R) (p−1)p ,

(jk)

k∈I(p) ck

if R is primary if R is antiprimary,

. We have shown in [4] that for any z, w ∈ Ep

|zw|p = |z|p |w|p

(26)

and for every δ > 0, there exist ap , bp ∈ R∗+ such that for every z ∈ Eδp

ap z p ≤ |z|p ≤ bp z p .

(27)

We have then the following generalization of Liouville’s theorem. Lemma 4.5. If f is p-addiholomorphic and uniformly bounded with respect to the absolute p-prevalue on Eδp and αp = 0 then f is constant on this set. Proof. Since f is p-addiholomorphic on Eδp , it will be p-addiholomorphic on every paddisym surface of Eδp . One can thus apply Lemma 4.4 to every p-addisym surface S passing through any ﬁxed point z of Eδp , by using the simple closed curve C(z, r) = {w ∈ S : w −p z p = r}, where r ∈ R∗+ . If M is an upper bound of f (w) p on Eδp then,

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

409

according to (25), (26) and (27)

f (z) p = ≤ ≤ ≤ = ≤

But, if w =

, 1 f (w) dw pαp sp 2 (w − z) p C(z,r) p , 1 1 f (w) dw p p αp sp p C(z,r) (w −p z)2 p , 1 1 dw p 1 1

f (w) p 2 p αp p sp p C(z,r) (w −p z) p - - - - , M -- 1 -- -- 1 -1 - dw p pa3p - αp -p - sp -p C(z,r) - (w −p z)2 -p , M

dw p 3 pap |αp |p |sp |p C(z,r) |w −p z|2p , ,

dw p M M = 5

dw p . pa5p |αp |p C(z,r) w −p z 2p pap |αp |p r2 C(z,r)

k k∈I(p) vk sp , from (3) we ﬁnd that dw p =

,

dw p = C(z,r)

⎛ ⎝

k∈I(p)

k∈I(p)

(dv) . Thus k,j j∈I(p)

⎞

, j∈I(p)

(dv)k,j ⎠ ≤ p2 r.

C(z,r)

Consequently f (z) p ≤ M p/a5p |αp |p r. Taking the limit when r tends toward inﬁnity, we ﬁnd that f (z) p tends toward zero. Therefore f (z) = 0 and the function f is constant on the considered p-addisym surface of Eδp . Since this surface was arbitrary, the result applies to every p-addisym surfaces of Eδp , that is on all Eδp . Theorem 4.6. For every z ∈ Ep , one has

g

(z) =

pz p−1 1 . + z n∈N• z p −p np

(28)

p

Proof. The Lemmas 4.1 and 4.3 show that the function G

is p-addiholomorphic and uniformly bounded with respect to the absolute p-prevalue on Ep . From Lemma 4.5, it follows that G

is constant on Eδp . The fact that G

is periodic with respect to each copies of M in Ap and that E◦p is made of (p − 1)p/2 subspaces of codimension 2 of the vectorial space E2;p not containing any of the p2 monoids M of Ep implies that G

is constant on all Ep . To determine the value of this constant, we shall compute G

(0)

410

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

by using the expression (20) of h(z): ⎛ ⎞⎤ ⎡

g 1 (αp z) 1 ⎠⎦ G

(0) = lim ⎣αp z 0

−p ⎝1 + pz p p p z→0 z z − n g1 (αp z) p n∈N•p ⎤ ⎡ 1(1) (αp z)p+1 1(2) (αp z)2p+1 + + ... 1 αp z + p! (2p)! = lim ⎣ −p 1⎦ (1) (2) p+1 2p+1 (αp z) (αp z) z→0 z αp z + 1 (p+1)! + 1 (2p+1)! + ... ) / .( 1(1) pαpp z p 1(2) p{2[(p + 1)!]2 −p (2p + 1)!}αp2p z 2p 1 + . . . −p 1 = lim 1+ + z→0 z (p + 1)! (2p + 1)![(p + 1)!]2 ( ) 1(1) pαpp z p−1 1(2) p{2[(p + 1)!]2 −p (2p + 1)!}αp2p z 2p−1 = lim + . . . = 0. + z→0 (p + 1)! (2p + 1)![(p + 1)!]2 Thus G

(z) ≡ 0 and g

(z) = h

(z) for z ∈ Ep , from which the result follows. Theorem 4.6 shows that ψ (z) ≡ 0 in (19), so that ψ(z) = c for every z ∈ Ep , where c ∈ Ep is a constant. From (18), we then deduce that g1

(αp z)

= zK0

1 −p

z p n

n∈N•p

,

(29)

where K0 = K exp c. From (29), one can determine the value of K0 by using the series which represents g1

(z). From (5), it follows that K0 = lim K0 z→0

1 −p

z p

n∈N•p

n

g1

(αp z) = lim = αp . z→0 z

We have thus proved Theorem 4.7. For every z ∈ Ep , one has g1

(αp z)

= αp z

1 −p

z p

n∈N•p

n

.

(30)

A second expression of g1

(αp z) is given by its Maclaurin’s series g1

(αp z)

= αp z

∞ 1(n) (αp z)np n=0

(np + 1)!

.

(31)

The uniqueness of this series makes it possible to identify the coeﬃcients of z p in the expressions (30) and (31) of g1

(αp z)/αp z. This means that −p

1 1(1) αpp = . p n (p + 1)! • n∈N p

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

411

A generalization of Riemann’s zeta function is ζp (z) =

1 , nz n∈N• p

for z ∈ Ep . Consequently ζp (p) = −p

1(1) αpp , (p + 1)!

and, in particular, ζ2 (2) = ζ(2) =

α22 . 3!

(32)

Comparing (32) with (1), we obtain that α2 = π. We shall say that a number of Ap belongs to its primary sector if its reduced expression has the form of the numbers in N•p , that is if this expression has a non-zero ﬁrst coordinate and its (p − 1)th coordinate is zero. We designate by αp the pth root of αpp belonging to the primary sector of Ap . The proof of the following theorem is completed. Theorem 4.8. For every prime number p ∈ N, we have ζp (p) =

−p 1(1) αpp , (p + 1)!

(33)

where αp is a number of the primary sector of Ap . The expression (33) will allow us to compute the value of αp , p ≥ 3, if we determine the value of ζp (p). To this end, we recall the remark following the deﬁnition of Np given by (13) which says that the set of n ∈ Np coincides with the set of n(k) for n ∈ N•p and k ∈ I(p). Therefore 1 1 1 = . p p n p n • n∈N n∈N p

p

But for every n ∈ Np we also have −p n ∈ Np . We can thus decompose Np into two subsets Mp and Np \Mp such that the additive inverse of every n ∈ Mp is in Np \Mp . Therefore 1 1 1 = 0. = + p p p n n (− p n) n∈N n∈M p

p

Consequently ζp (p) = 0 and αp = 0 if p ≥ 3. This result contradicts our assumption about the existence of a αp ∈ Ap such that exp(pαp sp ) = 1 and which has a minimal absolute p-prevalue without being zero. Thus, if p ≥ 3, then there exists no αp ∈ A∗p such that gk

(z + pαp ) = gk

(z), for z ∈ Ap and k ∈ I(p). It follows that the functions generalizing the trigonometric functions to the set Ap have no non-zero root in Ap if p ≥ 3. These functions are thus non-periodic on Ap . This completes the proof of Theorem 4.9. If p ≥ 3, none of the functions gk

, k ∈ I(p), is periodic on Ap .

412

C. Gauthier / Central European Journal of Mathematics 4(3) 2006 395–412

Acknowledgment The author thanks Paul Deguire and Pierre Gravel for fruitful discussions about some results contained in this paper.

References [1] H. Cartan: Th´eorie ´el´ementaire des fonctions analytiques d’une ou plusieurs variables complexes, Hermann, Paris, 1961. [2] P. Deguire and C. Gauthier: “Sur la d´erivation dans certains anneaux quotients de R[x]”, Ann. Sci. Math. Qu´ebec, Vol. 24, (2000), pp. 19–31. [3] L. Euler: “De summis serierum reciprocarum”, Comment. Acad. Sci. Petropolit., Vol. 7(1734/35), (1740), pp. 123–134; Opera omnia, Ser. 1, Vol. 14, Leipzig-Berlin, 1924, pp. 73–86. [4] C. Gauthier: “Quelques propri´et´es alg´ebriques des ensembles de nombres `a inverse additif compos´e”, Ann. Sci. Math. Qu´ebec, Vol. 26, (2002), pp. 47–59. [5] I.J. Good: “A simple generalization of analytic function theory”, Expo. Math., Vol. 6 (1988), pp. 289–311. [6] E. Grosswald: Topics from the Theory of Numbers, Birkh¨auser, Boston, 1984. [7] M.E. Muldoon and A.A. Ungar: “Beyond sin and cos”, Math. Mag., Vol. 69 (1996), pp. 2–14. [8] H. Silverman: Complex Variables, Houghton Miﬄin, Boston, 1975. [9] G. Valiron: Th´eorie des fonctions, Masson, Paris, 1948.

DOI: 10.2478/s11533-006-0017-6 Research article CEJM 4(3) 2006 413–434

On presentations of Brauer-type monoids Ganna Kudryavtseva1∗ , Volodymyr Mazorchuk2† 1

Department of Mathematics and Mechanics, Kyiv Taras Shevchenko University, 01033 Kyiv, Ukraine 2

Department of Mathematics, Uppsala University, SE-75106, Uppsala, Sweden

Received 30 November 2005; accepted 17 March 2006 Abstract: We obtain presentations for the Brauer monoid, the partial analogue of the Brauer monoid, and for the greatest factorizable inverse submonoid of the dual symmetric inverse monoid. In all three cases we apply the same approach, based on the realization of all these monoids as Brauer-type monoids. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Brauer semigroup, monoid presentation, braid relations MSC (2000): 20M05, 20M20

1

Introduction and preliminaries

The classical Coxeter presentation of the symmetric group Sn plays an important role in many branches of modern mathematics and physics. In semigroup theory there are several “natural” analogues of the symmetric group. For example the symmetric inverse semigroup IS n or the full transformation semigroup Tn . Perhaps a “less natural” generalization of Sn is the so-called Brauer semigroup Bn , which arose in the context of centralizer algebras in representation theory in [6]. The basis of this algebra can be described in a nice combinatorial way using special diagrams (see Section 2). This combinatorial description motivated a generalization of the Brauer algebra, the so-called partition algebra, which has its origins in physics and topology, see [10, 16]. This algebra leads to another ﬁnite semigroup, the partition semigroup, usually denoted by Cn . Many ∗ †

E-mail: [email protected] E-mail: [email protected]

414

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

classical semigroups, in particular, Sn , IS n , Bn and some others (again see Section 2) are subsemigroups in Cn . In the present paper we address the question of ﬁnding a presentation for some of the subsemigroups of Cn . As we have already mentioned, for Sn this is a famous and very important result, with the major role played by the so-called braid relations. Because of the “geometric” nature of the generators of the semigroups we consider, our initial motivation was that the additional relations for our semigroups would be some kind of “singular deformations” of the braid relations (analogous to the case of the singular braid monoid, see [1, 3], or to the known presentations of the Brauer algebra from [2, 4]). In particular, we wanted to get a complete list of “deformations” of the braid relations, which can appear in these cases. It turns out that all the semigroups we considered indeed have presentations, all ingredients of which are in some sense deformations or degenerations of the braid relations. As the main results of the paper we obtain a presentation for the semigroup Bn (see Section 3), its partial analogue PBn (which is also called the rook Brauer monoid, see Section 5), and a special inverse subsemigroup IT n of Cn , which is isomorphic to the greatest factorizable inverse submonoid of the dual symmetric inverse monoid, see Section 4 (another presentation for the latter monoid was obtained in [7]). The technical details in all cases are quite diﬀerent, however, the general approach is the same. We ﬁrst “guess” the relations and in the standard way obtain an epimorphism from the semigroup T , given by the corresponding presentation, onto the semigroup we are considering. It remains to show that this epimorphism is in fact a bijection. For this we compare the cardinalities of the semigroups. In all cases the symmetric group Sn is the group of units in T . The product Sn × Sn thus acts on T via multiplication from the left and from the right. The general idea is to then show that each orbit of this action contains a very special element, for which, using the relations, one can estimate the cardinality of the stabilizer. The necessary statement then follows by comparing the cardinalities. Note added in proofs: A presentation for Cn can be found in Theorem 1.11 of the paper [9].

2

Brauer type semigroups

For n ∈ N we denote by Sn the symmetric group of all permutations on the set {1, 2, . . . , n}. We will consider the natural right action of Sn on {1, 2, . . . , n} and the induced action on the Boolean of {1, 2, . . . , n}. For a semigroup, S, we denote by E(S) the set of all idempotents of S. Fix n ∈ N and let M = Mn = {1, 2, . . . , n}, M = {1 , 2 , . . . , n }. We will consider : M → M as a bijection, whose inverse we will also denote by . Consider the set Cn of all decompositions of M ∪ M into disjoint unions of subsets. Given α, β ∈ Cn , α = X1 ∪ · · · ∪ Xk and β = Y1 ∪ · · · ∪ Yl , we deﬁne their product γ = αβ as the unique element of Cn satisfying the following conditions: (P1) For i, j ∈ M the elements i and j belong to the same block of the decomposition γ

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

415

if and only if they belong to the same block of the decomposition α or there exists a sequence, s1 , . . . , sm , where m is even, of elements from M such that i and s1 belong to the same block of α; s1 and s2 belong to the same block of β; s2 and s3 belong to the same block of α and so on; sm−1 and sm belong to the same block of β; sm and j belong to the same block of α. (P2) For i, j ∈ M the elements i and j belong to the same block of the decomposition γ if and only if they belong to the same block of the decomposition β or there exists a sequence, s1 , . . . , sm , where m is even, of elements from M such that i and s1 belong to the same block of β; s1 and s2 belong to the same block of α; s2 and s3 belong to the same block of β and so on; sm−1 and sm belong to the same block of α; sm and j belong to the same block of β. (P3) For i, j ∈ M the elements i and j belong to the same block of the decomposition γ if and only if there exists a sequence, s1 , . . . , sm , where m is odd, of elements from M such that i and s1 belong to the same block of α; s1 and s2 belong to the same block of β; s2 and s3 belong to the same block of α and so on; sm−1 and sm belong to the same block of α; sm and j belong to the same block of β. One can think about the elements of Cn as “microchips” or “generalized microchips” with n pins on the left hand side (corresponding to the elements of M ) and n pins on the right hand side (corresponding to the elements of M ). For α ∈ Cn we connect two pins of the corresponding chip if and only if they belong to the same set of the partition α. The operation described above can then be viewed as a “composition” of such chips: having α, β ∈ Cn we identify (connect) the right pins of α with the corresponding left pins of β, which uniquely deﬁnes a connection of the remaining pins - the left pins of α and the right pins of β. An example of multiplication of two chips from Cn is given on Figure 1. Note that, performing the operation we can obtain some “dead circles” formed by some identiﬁed pins from α and β. These circles should be disregarded (however they play an important role in representation theory as they allow one to deform the multiplication in the semigroup algebra). From this interpretation it is fairly obvious that the composition of elements from Cn deﬁned above is associative. On the level of associative algebra, the partition algebra was deﬁned in [16] and then studied by several authors especially in recent years, see for example [5, 17–19, 22, 24]. Purely as a semigroup it seems that Cn appeared in [21]. Let α ∈ Cn and X be a block of α. The block X will be called • a line provided that |X| = 2 and X intersects with both M and M ; • a generalized line provided that X intersects with both M and M ; • a bracket if |X| = 2 and either X ⊂ M or X ⊂ M ; • a generalized bracket if |X| ≥ 2 and either X ⊂ M or X ⊂ M ; • a point if |X| = 1. By a Brauer-type semigroup we will mean a “natural” subsemigroup of the semigroup Cn . Here are some examples: (E1) The subsemigroup, consisting of all elements α ∈ Cn such that each block of α is a line. This subsemigroup is canonically identiﬁed with Sn and is the group of units

416

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

2→

• • •

3→

•

•

4→

•

•

1→

5→ 6→ 7→

•

@ B @B • • HH @ B@ B • • XXHHH B HH HH B • H•

•

#• # # • # • # #

•

•

•

•

•

•

• •

• bb

•

•

•

b b

•

•

•

•

•

% % • • % • %% •

•

•

•

=

Fig. 1 Multiplication of elements of Cn . of Cn . (E2) The subsemigroup, consisting of all elements α ∈ Cn such that each block of α is a either a line or a point. This subsemigroup is canonically identiﬁed with the symmetric inverse semigroup IS n . (E3) The subsemigroup Bn , consisting of all elements α ∈ Cn such that each block of α is a either a line or a bracket. This is the classical Brauer semigroup, see [11, 20]. (E4) The subsemigroup PBn , consisting of all elements α ∈ Cn such that each block of α is a either a line, a bracket or a point. This is the partial analogue of the Brauer semigroup, see [20]. (E5) The subsemigroup IP n , consisting of all α ∈ Cn such that each block of α is a generalized line. In this form the semigroup IP n appeared in [14, 15]. It is easy to see that the semigroup IP n is isomorphic to the dual symmetric inverse monoid ∗ from [8]. IM (E6) The subsemigroup IT n , consisting of all α ∈ Cn such that each block X of α is a generalized line and |X ∩ M | = |X ∩ M |. In this form the semigroup IT n appeared in [15]. The semigroup IT n is isomorphic to the greatest factorizable ∗ ∗ inverse submonoid FM of IM from [8]. All the semigroups described above are regular. Sn is a group. The semigroups ISn , IP n and IT n are inverse, while Cn , Bn and PBn are not. The partially ordered set consisting of these semigroups, with the partial order given by inclusions, is illustrated in Figure 2. In the following we shall need some easy combinatorial results for Brauer-type semigroups. For α ∈ Cn we deﬁne the rank rk(α) of α as the number of generalized lines in α, that is the number of blocks in α intersecting with both M and M . Note that for the semigroups Sn , IS n , Bn , PBn and Cn ranks of the elements classify the D-classes (this is obvious for Sn , for IS n this is an easy exercise, for Bn and PBn this can be found in [20], and for Cn it can be obtained by arguments similar to those from [20] for Bn ).

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

PB

C lll n OOOOO lll OOO l l l OOO lll l O l l

nF FF ww FF ww w FF w w F w w IS n WWWWW BnB WWWWW BB B WWWWW WWWWW BBB WWWWW B W

Sn

pp ppp p p pp ppp

417

IP n IT n

Fig. 2 Inclusions for classical Brauer-type semigroups For the semigroup IT n we need a diﬀerent notion. Let X be a ﬁnite set and X = be a decomposition of X into a union of pairwise disjoint subsets. For each i, 1 ≤ i ≤ |X|, let mi denote the number of subsets of this decomposition, whose cardinality equals i. The tuple (m1 , . . . , m|X| ) will be called the type of the decomposition. Consider an element, α ∈ IT n . By deﬁnition α is a decomposition of M ∪ M into a disjoint union of subsets, whose intersections with M and M have the same cardinality. Let (m1 , . . . , m2n ) be the type of this decomposition (note that mi = 0 only if i is even). The element α induces a decomposition of M into disjoint subsets, whose blocks are intersections of the blocks of α with M . By the type of α we will mean the type of this decomposition of M , which is obviously equal to (m2 , m4 , . . . , m2n ). The types of elements from IT n correspond bijectively to partitions of n (a partition, λ n, of n is a tuple, λ = (λ1 , . . . , λk ), of positive integers such that λ1 ≥ λ2 ≥ · · · ≥ λk and λ1 + · · · + λk = n). The types of the elements classify the D-classes in IT n , see [8, Section 3]. For the semigroup PBn we need a more complicated technical tool. Although Dclasses are classiﬁed by ranks we will need to distinguish elements of a given rank, so we introduce the notion of a type. For α ∈ PBn let r denote the number of lines in α; b1 the number of brackets in α, contained in M ; b2 the number of brackets in α, contained in M ; p1 the number of points in α, contained in M ; p2 the number of points in α, contained in M . Obviously n = r + 2b1 + p1 = r + 2b2 + p2 . Deﬁne the type of α as follows: ∪ki=1 Xk

type(α) =

(b2 , b1 − b2 , 0, p1 ), b1 ≥ b2 ; (b1 , 0, b2 − b1 , p2 ), b2 > b1 .

We will need the following explicit combinatorial formulas for the number of elements of a given rank or type. Proposition 2.1. (a) For k ∈ {0, . . . , n} the number of elements of rank k in IS n equals n2 k!. k (b) For k ∈ {1, . . . , n} the number of elements of rank k in Bn equals 0 if n − k is odd 2 and 22l(n!) if n − k = 2l is even. (l!)2 k!

418

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

(c) The number of elements of IT n of type (m1 , . . . , mn ) equals (n!)2 n (mi !(i!)2mi )

.

i=1

(d) For all non-negative integers k, m, t such that 2k +2m+t ≤ n the number of elements of the type (k, m, 0, t) in PBn is equal to the number of elements of the type (k, 0, m, t) in PBn and equals (n!)2 . k!2k (t + 2m)!(k + m)!2k+m t!(n − 2k − 2m − t)! Proof. This is a straightforward combinatorial calculation.

Remark 2.2. The semigroup Cn can be also connected to some other semigroups of binary relations. As we have already mentioned, the subsemigroup IP n of Cn is iso∗ morphic to the dual symmetric inverse monoid IM from [8], which is the semigroup of all difunctional binary relations under the operation of taking the smallest difunctional binary relations, containing the product of two given relations. The semigroup IT n is ∗ isomorphic to the greatest factorizable inverse submonoid of IM , that is to the semigroup ∗ E(IM )Sn . One can also deform the multiplication in Cn in the following way: given α, β ∈ Cn deﬁne γ = α β as follows: all blocks of γ are either points or generalized lines, and for i, j ∈ M the elements i and j belong to the same block of γ if and only if i belongs to some block X of α and j belongs to some block Y of β such that X ∩ M = (Y ∩ M ) . It is straightforward to show that this deformed multiplication is associative and hence we ˜ n . This semigroup is an inﬂation of Vernitski’s inverse semigroup get a new semigroup, C ˜ n in the natural way. An isomorphic object (DX , ), see [23], which is a subsemigroup of C can be obtained if instead of points one requires that γ contains at most one generalized bracket, which is a subset of M , and at most one generalized bracket, which is a subset of M .

3

Presentation for Bn

For i = 1, . . . , n − 1 we denote by si the elementary transposition (i, i + 1) ∈ Sn , and by πi the element {i, i + 1} ∪ {i , (i + 1) } ∪ j=i,i+1 {j, j } of Bn (the elementary atom from [20]). It is easy to see (and can be derived from the results of [20] and [13]) that Bn is generated by {si } ∪ {πi } as a monoid. Moreover, Bn is even generated by {si } and, for example, π1 . However, in the context being considered, the set {si } ∪ {πi } is more natural as a system of generators for Bn because, for example, the connection between Brauer and Temperley-Lieb algebras (and analogy with the singular braid monoid, see [1, 3]). In this section we obtain a presentation for Bn with respect to this system of generators (this resembles the presentation of the Brauer algebra in [4], see also [2]).

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

419

Let T denote the monoid with the identity element e, generated by the elements σi , θi , i = 1, . . . , n − 1, subject to the following relations (where i, j ∈ {1, 2, . . . , n − 1}): σi2 = e; θi2

σi σj = σj σi , |i − j| > 1;

= θi ;

θi θj = θj θi , |i − j| > 1; θi σi = σi θi = θi , σi θj θi = σj θi ,

σi σj σi = σj σi σj , |i − j| = 1;

(1)

θi θj θi = θi , |i − j| = 1;

(2)

θi σj = σj θi , |i − j| > 1;

(3)

θi θj σi = θi σj , |i − j| = 1.

(4)

Theorem 3.1. The map σi → si and θi → πi , i = 1, . . . , n − 1, extends to an isomorphism, ϕ : T → Bn . The rest of the section will be devoted to the proof of Theorem 3.1. We start with the following easy observation, which will be used later in our computations: Lemma 3.2. Under the assumption that the relations (1)–(4) are satisﬁed, we have the following relations: σi θj σi = σj θi σj ,

θi σj θi = θi , |i − j| = 1;

σi σi+1 θi θi+2 = σi+2 σi+1 θi θi+2 .

(5) (6)

Proof. For i, j, |i − j| = 1, applying (4) twice we have σi θj σi = σj θi θj σi = σj θi σj . Applying (4), (3) and, ﬁnally, (2) we also have θi σj θi = θi θj σi θi = θi θj θi = θi . This gives (5). Analogously, applying (4), (1), (2) and (4) again gives σi+2 σi+1 θi θi+2 = σi+2 σi θi+1 θi θi+2 = σi σi+2 θi+1 θi+2 θi = σi σi+1 θi+2 θi , which implies (6).

It is a direct calculation to verify that the generators si and πi of Bn satisfy the relations, corresponding to (1)–(4). Thus the map σi → si and θi → πi , i = 1, . . . , n − 1, extends to an epimorphism, ϕ : T Bn . Hence, to prove Theorem 3.1 we have only to show that |T | = |Bn |. To do this we will have to study the structure of the semigroup T in detail. Let W denote the free monoid, generated by σi , θi , i = 1, . . . , n − 1, and ψ : W T denote the canonical projection. Let ∼ be the corresponding congruence on W , that is v ∼ w provided that ψ(v) = ψ(w). We start with the following description of the units in T : Lemma 3.3. The elements σi , i = 1, . . . , n−1, generate the group G of units in T , which is isomorphic to the symmetric group Sn .

420

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

Proof. Let v, w ∈ W be such that v ∼ w. Assume further that v contains some θi . Since θ’s always occur on both sides in the relations (2)–(4) and do not occur in the relation (1), it follows that w must contain some θj . In particular, the submonoid, generated in W by σi , i = 1, . . . , n − 1, is a union of equivalence classes with respect to ∼. Using the wellknown Coxeter presentation of the symmetric group we obtain that σi , i = 1, . . . , n − 1, generate in T a copy of the symmetric group. All elements of this group are obviously units in T . On the other hand, if v, w ∈ W and v contains some θi , then vw contains θi as well. By the above arguments, vw cannot be equivalent to the empty word. Hence v is not invertible in T . The claim of the lemma follows. In the following we identify the group G of units in T with Sn via the isomorphism, which sends σi ∈ G to si . There is a natural action of Sn on T by inner automorphisms of T via conjugation: xg = g −1 xg for each x ∈ T , g ∈ Sn . Lemma 3.4. The Sn -stabilizer of θ1 is the subgroup H of Sn , consisting of all permutations, which preserve the set {1, 2}. This subgroup is isomorphic to S2 × Sn−2 . Proof. We have σj θ1 σj = θj , j = 2, by (3). Since σj , j = 2, generate H, we obtain that all elements of H stabilize θ1 . In particular, the Sn -orbit of θ1 consists of at most |Sn |/|H| = n2 elements. At the same time, it is easy to see that the Sn -orbit of ϕ(θ1 ) consists of exactly n2 diﬀerent elements and hence H must coincide with the Sn -stabilizer of θ1 . Since Sn acts on T via automorphism and θ1 is an idempotent, all elements in the Sn -orbit of θ1 are idempotents. From Lemma 3.4 it follows that the elements of the Sn orbit of θ1 are in the natural bijection with the cosets H\Sn . By the deﬁnition of H, two elements, x, y ∈ Sn , are contained in the same coset if and only if x({1, 2}) = y({1, 2}). Lemma 3.5. The Sn -orbit of θ1 contains all θi , i = 1, . . . , n − 1. Moreover, for w ∈ Sn we have w−1 θ1 w = θi if and only if w({1, 2}) = {i, i + 1}. Proof. We use induction on i with the case i = 1 being trivial. Let i > 1 and assume that θi−1 is contained in our orbit. Then θi = σi−1 σi θi−1 σi σi−1 and hence θi is contained in our orbit as well. Hence all θi belong to the Sn -orbit of θ1 . The second claim follows from σi−1 σi σi−2 σi−1 · · · σ1 σ2 ({1, 2}) = {i, i + 1}, which is obtained through direct calculation. This completes the proof.

(7)

For w ∈ Sn such that w({1, 2}) = {i, j}, where i < j, we set i,j = w−1 θ1 w, which is well deﬁned by Lemma 3.4. Lemma 3.6. Suppose {i, j} ∩ {p, q} = ∅. Then i,j p,q = p,q i,j .

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

421

Proof. Since all elements i,j are obtained from θ1 via automorphisms, it is enough to show that θ1 commutes with all elements i,j such that {i, j} ∩ {1, 2} = ∅. Take any v ∈ Sn such that v({1, 2}) = {1, 2} and v({i, j}) = {3, 4} (such v obviously exists). Then θ1 commutes with i,j if and only if v −1 θ1 v = θ1 commutes with v −1 i,j v = θ3 . The statement now follows from (2). Lemma 3.7. Suppose {i, j} ∩ {p, q} = ∅. Then i,j p,q = uθ1 v for some u, v ∈ Sn . Proof. If {i, j} = {p, q} the statement is obvious as i,j is an idempotent. Assume |{i, j} ∩ {p, q}| = 1. Since all elements i,j are obtained from θ1 via automorphisms, it is enough to consider the case when {i, j} = {1, 2}, p = 2 and q > 2. Consider v ∈ Sn such that v(1) = 1, v(2) = 2 and v(q) = 3. Using (3), (1) and (5) we have v −1 θ1 p,q v = θ1 θ2 = θ1 σ1 θ2 σ1 σ1 = θ1 σ2 θ1 σ2 σ1 = θ1 σ2 σ1 .

The statement follows.

For each k, 1 ≤ k ≤ [ n2 ], set δk = θ1 θ3 . . . θ2k−1 . Set also δ0 = e. The elements δi , 0 ≤ i ≤ [ n2 ], will be called canonical. The group Sn × Sn acts naturally on T via (g, h)(x) = g −1 xh for x ∈ T and (g, h) ∈ Sn × Sn . Lemma 3.8. Every Sn × Sn -orbit contains a canonical element. Proof. Let x ∈ T . If x ∈ Sn the statement is obvious. Assume that x ∈ Sn . By Lemma 3.5 we can write x = wθ1 g1 θ1 g2 . . . θ1 gk for some k ≥ 1 and w, g1 , . . . , gk ∈ Sn . Moreover, we may assume that x cannot be written as a product of θ1 ’s and elements of Sn , which contains less than k occurrences of θ1 . We have x = w(g1 . . . gk )(g1 . . . gk )−1 θ1 (g1 . . . gk )· · (g2 . . . gk )−1 θ1 (g2 . . . gk ) . . . (gk−1 gk )−1 θ1 (gk−1 gk )gk−1 θ1 gk , (8) and hence we can write x = u i1 ,j1 . . . ik ,jk ,

(9)

where u = wg1 . . . gk and {it , jt }={(gt . . . gk )(1), (gt . . . gk )(2)}, 1 ≤ t ≤ k. Since x was chosen so that it cannot be reduced to an element of T which contains less that k entries of θ1 , from Lemmas 3.6 and 3.7 it follows that {it , jt } ∩ {is , js } = ∅ for any two factors it ,jt , is ,js in (9). This implies that the Sn × Sn -orbit of x contains i1 ,j1 . . . ik ,jk with {it , jt } ∩ {is , js } = ∅ for all s = t. Now consider some v ∈ Sn such that v(i1 ) = 1, v(j1 ) = 2, v(i2 ) = 3 and so on, v(jk ) = 2k. Then the element v −1 i1 ,j1 · · · ik ,jk v is canonical by deﬁnition. This completes the proof. Remark 3.9. From the proof of Lemma 3.8 it follows that each x ∈ T can be written in the form x = wθ1 g1 θ1 g2 . . . θ1 gk , where k ≤ n2 .

422

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

Lemma 3.10. The Sn × Sn -orbit of the canonical element δk , 0 ≤ k ≤ [ n2 ], contains at most (n!)2 22k (k!)2 (n − 2k)! elements. Proof. It is enough to show that the stabilizer of δk under the Sn × Sn -action contains at least (k!)2 22k (n − 2k)! elements. Set Σ0i = σ2i σ2i−1 σ2i+1 σ2i , 1 ≤ i ≤ k − 1;

Σ1i = σ2i σ2i−1 σ2i+1 σ2i σ2i−1 , 1 ≤ i ≤ k − 1. Then both Σ0i and Σ1i swap the sets {2i−1, 2i} and {2i+1, 2i+2}. It follows that the group H, generated by all Σ0i , consists of all permutations of the set {1, 2}, {3, 4}, . . . , {2k−1, 2k} ˜ and is therefore isomorphic to the group Sk . Further, it is easy to see that the group H,

generated by all Σ0i and Σ1i , is isomorphic to the wreath product H S2 . From (6) and (3) it follows that the left multiplication with both Σ0i and Σ1i stabilizes δk . Therefore for ˜ the left multiplication with this element stabilizes δk as well. Similarly each element of H ˜ stabilizes δk . In one proves that the right multiplication with each element from H addition to this, from (3) we have that the conjugation by any element from the group H = σ2k+1 , . . . , σn−1 Sn−2k stabilizes δk . ˜ the right copy of H, ˜ and the Observe that the group, generated by the left copy of H, H is a direct product of these three components. Using the product rule we derive that the cardinality of the stabilizer of δk is at least (|H S2 |)2 |Sn−2k | = (k!)2 22k (n − 2k)!,

and the proof is complete. Corollary 3.11.

n

|T | ≤

2 k=0

(n!)2 . 22k (k!)2 (n − 2k)!

Proof. The proof follows from Lemma 3.10 and Remark 3.9 by a direct calculation. Proof (of Theorem 3.1). Comparing Corollary 3.11 and Proposition 2.1(b) we have |T | ≤ |Bn |. Since ϕ : T → Bn is surjective we have |T | ≥ |Bn |. Hence |T | = |Bn | and ϕ is an isomorphism.

4

Presentation for IT n

For i ∈ {1, 2, . . . , n−1} let i denote the element {i, i+1, i , (i+1) }∪ j=i,i+1 {j, j } ∈ IT n . By [15, Proposition 9], the elements {σi } and {i } generate IT n (and even {σi } and, say 1 , do).

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

423

Let T denote the monoid with the identity element e, generated by the elements σi , τi , i = 1, . . . , n − 1, subject to the following relations (where i, j ∈ {1, 2, . . . , n − 1}): σi2 = e;

σi σj = σj σi , |i − j| > 1; τi2

= τi ;

σi σj σi = σj σi σj , |i − j| = 1;

τi τj = τj τi , i = j;

τi σi = σi τi = τi ;

τi σj = σj τi , |i − j| > 1;

σi τj σi = σj τi σj and τi σj τi = τi τj , |i − j| = 1.

(10) (11) (12) (13)

Theorem 4.1. The map σi → si and τi → i , i = 1, . . . , n−1, extends to an isomorphism, ϕ : T → IT n . The rest of the section will be devoted to the proof of Theorem 4.1. It is a direct calculation to verify that the generators si and i of IT n satisfy the relations, corresponding to (10)–(13). Thus the map σi → si and τi → i , i = 1, . . . , n−1, extends to an epimorphism, ϕ : T IT n . Hence, to prove Theorem 4.1 we have only to show that |T | = |IT n |. As in the previous section, to do this we will study the structure of T in detail. Let W denote the free monoid, generated by σi , τi , i = 1, . . . , n − 1, ψ : W T denote the canonical projection, and ∼ be the corresponding congruence on W . The ﬁrst part of our argument is very similar to that from the previous Section. Lemma 4.2. The elements σi , i = 1, . . . , n−1, generate the group G of units in T , which is isomorphic to the symmetric group Sn (and will be identiﬁed with Sn in the following). Proof. Analogous to that of Lemma 3.3.

There are two natural actions on T : (I) The group Sn acts on T by inner automorphisms via conjugation. (II) The group Sn × Sn acts on T via (g, h)(x) = g −1 xh for x ∈ T and (g, h) ∈ Sn × Sn . Lemma 4.3. The Sn -stabilizer of τ1 is the subgroup H of Sn , consisting of all permutations, which preserve the set {1, 2}. This subgroup is isomorphic to S2 × Sn−2 . Proof. Analogous to that of Lemma 3.4.

Since Sn acts on T via automorphisms and τ1 is an idempotent, all elements in the Sn -orbit of τ1 are idempotents. From Lemma 4.3 it follows that the elements of the Sn orbit of τ1 are in the natural bijection with the cosets H\Sn . By the deﬁnition of H, two elements, x, y ∈ Sn , are contained in the same coset if and only if x({1, 2}) = y({1, 2}). Lemma 4.4. The Sn -orbit of τ1 contains all τi , i = 1, . . . , n − 1. Moreover, for w ∈ Sn we have w−1 τ1 w = τi if and only if w({1, 2}) = {i, i + 1}. Proof. Analogous to that of Lemma 3.5.

424

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

Lemma 4.5. All elements in the Sn -orbit of τ1 commute. Proof. Since all elements in the Sn -orbit of τ1 are obtained from τ1 via automorphism, it is enough to show that τ1 commutes with all elements in this orbit. Let w ∈ Sn be such that w({1, 2}) = {i, j}. If {i, j} = {1, 2} then w−1 τ1 w = τ1 by Lemma 4.4 and hence we may assume {i, j} = {1, 2}. Take any v ∈ Sn such that • v({1, 2}) = {1, 2} and v({i, j}) = {3, 4} if {i, j} ∩ {1, 2} = ∅; • v({1, 2}) = {1, 2} and v({i, j}) = {2, 3} if {i, j} ∩ {1, 2} = ∅. (such v obviously exists). Then τ1 commutes with w−1 τ1 w if and only if v −1 τ1 v commutes with v −1 w−1 τ1 wv. Using our choice of v and Lemma 4.4 we have v −1 τ1 v = τ1 and v −1 w−1 τ1 wv = τj , where j = 3 if {i, j} ∩ {1, 2} = ∅, and j = 2 otherwise. The statement now follows from (11). For w ∈ Sn such that w({1, 2}) = {i, j}, where i < j, we set εi,j = w−1 τ1 w, which is well deﬁned by Lemma 4.3. Lemma 4.6. Let {i, j, k} ⊂ {1, 2, . . . , n} and i < j < k. Then εi,j εj,k = εi,k εj,k = εi,j εi,k . Proof. We prove that εi,j εj,k = εi,k εj,k and the second equality is proved by analogous arguments. Let w ∈ Sn be such that w(i) = 1, w(j) = 2, w(k) = 3. Conjugating by w we reduce our equality to the equality τ1 τ2 = σ2 τ1 σ2 τ2 . Using (13) twice and (12) we have σ2 τ1 σ2 τ2 = σ1 τ2 σ1 τ2 = σ1 τ1 τ2 = τ1 τ2 .

The claim follows.

For i, j ∈ M set εi,i = e and εi,j = εj,i if j < i. For a non-empty binary relation, ρ, on M set ερ = εi,j . iρj

Corollary 4.7. Let ρ be non-empty binary relation on M and ρ∗ be the reﬂexive-symmetrictransitive closure of ρ. Then ερ = ερ∗ Proof. Follows easily from Lemma 4.5, Lemma 4.6 and the fact that all εi,j ’s are idempotents. Let λ : {1, . . . , n} = X1 ∪ · · · ∪ Xk be a decomposition of M into an unordered union of pairwise disjoint sets. With this decomposition we associate the equivalence relation ρλ on M , whose equivalence classes coincide with Xi ’s.

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

425

Corollary 4.8. Let λ and μ be two decompositions of M as above. Assume that the types of λ and μ coincide. Then ερλ and ερμ are conjugate in T . Proof. Let v ∈ Sn be an element, which maps λ to μ (such element exists since the types of λ and μ are the same). One easily sees that v −1 ερλ v = ερμ . The statement follows. A decomposition, λ : {1, . . . , n} = X1 ∪ · · · ∪ Xk , is called canonical provided that (up to a permutation of the blocks) we have |X1 | ≥ |X2 | ≥ · · · ≥ |Xk |, X1 = {1, 2, . . . , l1 }, X2 = {l1 + 1, l1 + 2, . . . , l1 + l2 } and so on. Note that in this case λ can also be viewed as a partition of n. The element ερλ will be called canonical provided that λ is canonical. Lemma 4.9. Every Sn × Sn -orbit contains a canonical element. Proof. By Corollary 4.8 it is enough to show that every Sn × Sn -orbit contains ερλ for some decomposition λ. Let x ∈ T . If x ∈ Sn , then the statement is obvious. Let x ∈ T \ Sn . From Lemma 4.4 we have that the semigroup T is generated by Sn and τ1 . Hence we have x = wτ1 g1 τ1 g2 · · · τ1 gk for some w, g1 , . . . , gk ∈ Sn . Therefore x = w(g1 . . . gk )(g1 . . . gk )−1 τ1 (g1 . . . gk )· · (g2 . . . gk )−1 τ1 (g2 . . . gk ) . . . (gk−1 gk )−1 τ1 (gk−1 gk )gk−1 τ1 gk , and hence we can write x = uεi1 ,j1 . . . εik ,jk , where u = wg1 . . . gk and {it , jt } = {(gt . . . gk )(1), (gt . . . gk )(2)}, 1 ≤ t ≤ k. Deﬁne the equivalence relation ρ as the reﬂexive-symmetric-transitive closure of the relation {(i1 , j1 ), . . . , (ik , jk )} and let λ be the corresponding decomposition of {1, 2, . . . , n}. From Corollary 4.7 we get that the Sn × Sn -orbit of x contains ερ = ερλ . This completes the proof. Lemma 4.10. Let λ be a canonical decomposition of {1, 2, . . . , n}. For i = 1, . . . , n set λ(i) = |{j : |Xj | = i}|. Then the Sn × Sn -stabilizer of ερλ contains at least n (i) (λ(i) !(i!)2λ ) i=1

elements. Proof. Fix i ∈ {1, 2, . . . , n}. Let Xa , Xa+1 . . . , Xb be all blocks of λ of cardinality i. Then for any non-maximal element j of any of Xa , Xa+1 . . . , Xb , using Lemma 4.5, the deﬁnition of ερλ , and (12) we have σj ερλ = ερλ σj = ερλ . Moreover, for any w ∈ Sn , which stabilizes all elements outside Xa ∪ Xa+1 ∪ · · · ∪ Xb and maps each Xs to some Xt , we have (i) w(λ) = λ and hence w−1 ερλ w = ερλ . This gives us exactly λ(i) !(i!)2λ elements of the Sn × Sn -stabilizer. The statement of the lemma now follows by applying the product rule

426

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

since for diﬀerent i the nontrivial elements w above stabilize pairwise diﬀerent subsets of {1, . . . , n}. Corollary 4.11. |T | ≤

(n!)2

λn

n (i) (λ(i) !(i!)2λ )

.

i=1

Proof. Canonical elements of T are in bijection with partitions λ n by construction. By Lemma 4.9, every Sn × Sn -orbit contains a canonical element. We have |Sn × Sn | = (n!)2 . By Lemma 4.10, the stabilizer of a canonical element, corresponding to λ, contains at (i) least ni=1 (λ(i) !(i!)2λ ) elements. The statement now follows by applying the sum rule. Proof (of Theorem 4.1.). Comparing Corollary 4.11 and Proposition 2.1(c) we have |T | ≤ |IT n |. Since ϕ : T → IT n is surjective we have |T | ≥ |IT n |. Hence |T | = |IT n | and ϕ is an isomorphism. Remark 4.12. From the above arguments it follows that the inequality obtained in Lemma 4.10 is in fact an equality. From the proof of Lemma 4.10 one easily derives that the Sn × Sn -stabilizer of ερλ is isomorphic to the direct product of wreath products Sλ(i) (Si × Si ). Remark 4.13. Following the arguments of the proof of Theorem 4.1 one easily proves the following presentation for the symmetric inverse semigroup IS n : IS n is generated, as a monoid, by σ1 , . . . , σn−1 , ϑ1 , . . . , ϑn subject to the following relations: σi2 = e;

σi σj = σj σi , |i − j| > 1; ϑ2i

= ϑi ;

σi σj σi = σj σi σj , |i − j| = 1;

ϑi ϑj = ϑj ϑi i = j;

σi ϑi = ϑi+1 σi ; σi ϑj = ϑj σi , j = i, i + 1;

ϑi σi ϑi = ϑi ϑi+1 .

(14) (15) (16)

The classical presentation for IS n usually involves only one additional generator (namely ϑ1 ) and can be found for example in [12, Chapter 9].

5

Presentation for PBn

For i ∈ {1, . . . , n} let ςi denote the element {i} ∪ {i } ∪ j=i {j, j }. Using [20], it is easy to see that PBn is generated by {σi } ∪ {πi } ∪ {ςi } (and even by {σi }, π1 and ς1 ). Let T denote the monoid with the identity element e, generated by the elements σi , θi , i = 1, . . . , n − 1, and ϑi , i = 1, . . . , n, subject to the relations (1)–(4), the relations

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

427

from Remark 4.13, and the following relations (for all appropriate i and j): θi ϑj = ϑj θi , j = i, i + 1; θi ϑi = θi ϑi+1 = θi ϑi ϑi+1 , θi ϑ i θi = θi ,

ϑi θi = ϑi+1 θi = ϑi ϑi+1 θi ; ϑi θi ϑi = ϑi ϑi+1 .

(17) (18) (19)

Theorem 5.1. The map σi → si , θi → πi , i = 1, . . . , n − 1, and ϑi → ςi , i = 1, . . . , n, extends to an isomorphism, ϕ : T → PBn . We will again start with the following auxiliary technical statement, which we will need later: Lemma 5.2. Under the assumption that (1)–(4), (17)–(19) and the relations from Remark 4.13 are satisﬁed, one has the relation σi+2 σi+1 θi ϑi+2 ϑi+3 = σi σi+1 ϑi θi θi+2 ϑi+2 .

(20)

Proof. Using (4) twice and (1) we have σi+2 σi+1 θi ϑi+2 ϑi+3 = σi+2 σi θi+1 θi ϑi+2 ϑi+3 = = σi σi+2 θi+1 θi ϑi+2 ϑi+3 = σi σi+1 θi+2 θi+1 θi ϑi+2 ϑi+3 , and hence (20) reduces to θi+2 θi+1 θi ϑi+2 ϑi+3 = ϑi θi θi+2 ϑi+2 .

(21)

Using (17)-(19) and (2) we have θi+2 θi+1 θi ϑi+2 ϑi+3 = θi+2 ϑi+3 θi+1 ϑi+2 θi = θi+2 ϑi+2 θi+1 ϑi+1 θi = = θi+2 ϑi+1 θi+1 ϑi+1 θi = θi+2 ϑi+2 ϑi+1 θi = ϑi θi θi+2 ϑi+2 , which gives (21). The statement follows.

As in the previous section, one easily checks that this map extends to an epimorphism and hence to complete the proof one has to compare the cardinalities of T and PBn . Similar to the results of Section 4, using the presentation of IS n given in Remark 4.13, one proves that elements σi , i = 1, . . . , n − 1, generate the symmetric group Sn , and that the elements σi , i = 1, . . . , n − 1; ϑi , i = 1, . . . , n, generate the semigroup, which is isomorphic to IS n (and which will be identiﬁed with it). As in Section 4 we consider the natural action of Sn on T by inner automorphism of T via conjugation: xg = g −1 xg for each x ∈ T , g ∈ Sn . Set ξi = θi ϑi , ηi = ϑi θi , 1 ≤ i ≤ n − 1. Lemma 5.3. The Sn -stabilizer of each of θ1 , ξ1 , η1 is the subgroup H of Sn , consisting of all permutations, which preserve the set {1, 2}. This subgroup is isomorphic to S2 × Sn−2 .

428

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

Proof. For θ1 this follows from Lemma 3.4. For each j ≥ 2 we have that σj commutes with both ξ1 and η1 by (3) and (16) respectively, and hence σj ξ1 σj = ξ1 and σj η1 σj = η1 . Let j = 1. Then σ1 ξ1 σ1 = σ1 θ1 ϑ1 σ1 = σ1 θ1 σ1 ϑ2 = θ1 ϑ2 = θ1 ϑ1 = ξ1 ; σ1 η1 σ1 = σ1 ϑ1 θ1 σ1 = ϑ2 σ1 θ1 σ1 = ϑ2 θ1 = ϑ1 θ1 = η1 by (16) and (3). Hence σ1 also stabilizes ξ1 and η1 . Since σj , j = 2, generate H, it follows that all elements of H stabilize ξ1 and η1 . In particular, the Sn -orbits of ξ1 and of η1 consist of at most |Sn |/|H| = n2 elements each. At the same time, the Sn -orbits of ϕ(ξ1 ) and ϕ(η1 ) consist of exactly n2 diﬀerent elements and hence H must coincide with the Sn -stabilizer of both ξ1 and η1 . Since Sn acts on T via automorphisms and θ1 , ξ1 , η1 are idempotents, all elements in the Sn -orbits of θ1 , ξ1 , η1 are idempotents as well. From Lemma 5.3 it follows that the elements of the Sn -orbits of θ1 , ξ1 , η1 are in natural bijection with the cosets H\Sn . By the deﬁnition of H, two elements, x, y ∈ Sn , are contained in the same coset if and only if x({1, 2}) = y({1, 2}). Lemma 5.4. The Sn -orbits of θ1 , ξ1 , η1 contain all elements θi , ξi and ηi , i = 1, . . . , n−1, respectively. Moreover, for w ∈ Sn we have w−1 θ1 w = θi if and only if w({1, 2}) = {i, i + 1} and analogously for ξ1 and η1 . Proof. The proof for the Sn -orbit of θ1 is analogous to that of Lemma 3.5. We prove the statement for the Sn -orbit of ξ1 . For the Sn -orbit of η1 the arguments are analogous. We use induction on i with the case i = 1 being trivial. Let i > 1 and assume that ξi−1 is contained in our orbit. Then, using (16), (1) and (5), we compute ξi = θi ϑi = σi−1 σi θi−1 σi σi−1 ϑi = σi−1 σi θi−1 σi ϑi−1 σi−1 = σi−1 σi θi−1 ϑi−1 σi σi−1 = σi−1 σi ξi−1 σi σi−1 , and hence ξi is contained in our orbit as well. The second claim follows from (7). This completes the proof. For w ∈ Sn such that w({1, 2}) = {i, j}, where i < j, we set i,j = w−1 θ1 w, μi,j = w−1 ξ1 w, νi,j = w−1 η1 w. All these elements are well deﬁned by Lemma 5.3. Lemma 5.5. (a) ϑi i,j = ϑj i,j = ϑi ϑj i,j = νi,j ; ϑk i,j = i,j ϑk , k ∈ {i, j}. (b) ϑi μi,j = ϑj μi,j = ϑi ϑj μi,j = ϑi ϑj ; ϑk μi,j = μi,j ϑk , k ∈ {i, j}. Proof. First we prove (a). By Lemma 5.4 it is enough to check that ϑ1 1,2 = ϑ2 1,2 = ϑ1 ϑ2 1,2 = ν1,2 and that ϑ3 1,2 = 1,2 ϑ3 . The latter equalities follow from (18) and (17). Now we prove (b). Again, because of Lemma 5.4 it is enough to check that ϑ1 μ1,2 = ϑ2 μ1,2 = ϑ1 ϑ2 μ1,2 = ϑ1 ϑ2 and that ϑ3 μ1,2 = μ1,2 ϑ3 . Using (19), (18) and (17) we have ϑ1 μ1,2 = ϑ1 θ1 ϑ1 = ϑ1 ϑ2 ;

ϑ1 μ2,3 = ϑ1 θ2 ϑ2 = θ2 ϑ1 ϑ2 = θ2 ϑ2 ϑ1 = μ2,3 ϑ1 ,

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

429

as required.

Lemma 5.6. Suppose {i, j} ∩ {p, q} = ∅. Then i,j p,q = p,q i,j , μi,j μp,q = μp,q μi,j and i,j μp,q = μp,q i,j . Proof. Following the arguments from the proof of Lemma 3.6 it is enough to show that μ1,2 μ3,4 = μ3,4 μ1,2 and μ1,2 3,4 = 3,4 μ1,2 , that is that ξ1 ξ3 = ξ3 ξ1 and ξ1 θ3 = θ3 ξ1 . Using (17), (15) and (2) we have ξ1 ξ3 = θ1 ϑ1 θ3 ϑ3 = θ1 θ3 ϑ1 ϑ3 = θ3 θ1 ϑ3 ϑ1 = θ3 ϑ3 θ1 ϑ1 = ξ3 ξ1 , and using (17) and (2) we also obtain ξ1 θ3 = θ1 ϑ1 θ3 = θ1 θ3 ϑ1 = θ3 ξ1 , as required.

Lemma 5.7. Suppose {i, j} ∩ {p, q} = ∅. Then each of the elements i,j p,q , μi,j μp,q , i,j μp,q , μi,j p,q is equal to an element of the form uθ1 v for some u, v ∈ IS n . Proof. Using the argument from the proof of Lemma 3.7 it is enough to prove the statement only for the elements μ1,2 μ2,3 , μ1,2 2,3 , 1,2 μ2,3 . We have μ1,2 μ2,3 = ξ1 ξ2 = θ1 ϑ1 θ2 ϑ2 = θ1 ϑ2 θ2 ϑ2 = θ1 ϑ2 ϑ3 = ξ1 ϑ3 = θ1 ϑ1 ϑ3 by (18) and (19); and μ1,2 2,3 = θ1 ϑ1 θ2 = θ1 ϑ1 σ1 σ2 θ1 σ1 σ2 = θ1 σ1 ϑ2 σ2 θ1 σ1 σ2 = θ1 σ1 σ2 ϑ3 θ1 σ1 σ2 = θ1 σ1 σ2 θ1 ϑ3 σ1 σ2 = θ1 σ2 θ1 ϑ3 σ1 σ2 = θ1 ϑ3 σ1 σ2 by (1), (5), (3), (16). Finally, 1,2 μ2,3 = θ1 θ2 ϑ2 = θ1 σ1 σ2 θ1 σ2 σ1 ϑ2 = θ1 σ2 ϑ1 σ1 . using (1), (3) and (5). The statement follows.

For each subset {i1 , . . . , ik } of {1, 2, . . . , n} set ϑ({i1 , . . . , ik }) = ϑi1 . . . ϑik . Obviously, ϑ({i1 , . . . , ik }) is an idempotent and each idempotent of IS n has such a form. In the following we will use the obvious fact that each element of IS n can be written in the form uv, where u is an idempotent, and v ∈ Sn . As in the previous sections we consider the Sn × Sn -action on T given by (g, h)(x) = −1 g xh for x ∈ T and (g, h) ∈ Sn × Sn . Lemma 5.8. Every Sn × Sn -orbit contains either e or an element of the form ϑ(A)γi1 ,j1 . . . γis ,js , where A ⊂ {1, 2, . . . , n}, the sets {il , jl } are pairwise disjoint, and each γil ,jl equals either il ,jl or μil ,jl . Proof. The idea of the proof is analogous to that of Lemma 3.8. Let x ∈ T . If x ∈ Sn the statement is obvious so assume that x ∈ Sn . Since T is generated by IS n and θ1 we can write x = wuθ1 u1 g1 θ1 u2 g2 · · · θ1 uk gk (22)

430

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

for some k ≥ 1, w, g1 , . . . , gk ∈ Sn and u, u1 , . . . , uk ∈ E(IS n ). Moreover, we may assume that x cannot be written as a product of θ1 ’s and elements of IS n , which contains less than k occurrences of θ1 . We claim that x can be written as x = wu γ11 g1 γ12 g2 · · · γ1k gk ,

(23)

where, w, g1 , . . . , gk ∈ Sn , u ∈ E(IS n ), and each γ1i is equal to either θ1 or ξ1 . Let us prove this by induction on k. Let k = 1 and x = wuθ1 u1 g1 . We know that u1 = ϑ(B) for some B ⊂ {1, . . . , n}. Let A = B \ {1, 2}. Using (17) and (18) we obtain that ⎧ ⎪ ⎨ wuu1 θ1 g1 , if B ∩ {1, 2} = ∅; x= ⎪ ⎩ wuϑ(A)ξ1 g1 , if B ∩ {1, 2} = ∅, as required. Now suppose k ≥ 2. Applying the basis of the induction to θ1 uk gk we obtain x = wuθ1 u1 g1 θ1 u2 g2 · · · θ1 uk−1 gk−1 θ1 uk gk = wuθ1 u1 g1 θ1 u2 g2 · · · θ1 uk−1 gk−1 uk γ1k gk , where uk is an idempotent of IS n and γ1k is either ξ1 or θ1 . Now, since uk−1 gk−1 uk ∈ IS n , for some gk−1 ∈ Sn and uk−1 ∈ E(IS n ). Now (23) we can write uk−1 gk−1 uk = uk−1 gk−1 follows by applying the inductive assumption to wuθ1 u1 g1 θ1 u2 g2 · · · uk−2 gk−2 θ1 uk−1 gk−1 . Similarly to (8) we can rewrite (23) as follows: x = wu (g1 · · · gk )(g1 · · · gk )−1 γ11 (g1 · · · gk )·

gk )−1 γ1k−1 (gk−1 gk )gk−1 γ1k gk , · (g2 · · · gk )−1 γ12 (g2 · · · gk ) · · · (gk−1

and therefore we can write x = vu γi1 ,j1 · · · γik ,jk ,

(24)

where v = wg1 · · · gk , {it , jt }={(gt · · · gk )(1), (gt · · · gk )(2)}, 1 ≤ t ≤ k, and each γil ,jl is equal to either il ,jl or μil ,jl . Since x was initially chosen so that it cannot be reduced to an element of T , which contains less that k entries of θ1 , from Lemma 5.7 it follows that {it , jt } ∩ {il , jl } = ∅ for any two factors γit ,jt , γil ,jl in (24). This implies that the Sn × Sn -orbit of x contains u γi1 ,j1 · · · γis ,js such that u ∈ E(IS n ), {it , jt } ∩ {il , jl } = ∅ for all l = t. The statement follows. Corollary 5.9. Any Sn × Sn - orbit contains either e or an element of the form ϑ(A)γi1 ,j1 · · · γis ,js , such that (i) the sets {il , jl } are pairwise disjoint; (ii) each γil ,jl equals to either il ,jl or μil ,jl or νil ,jl ; (iii) A ∩ {i1 , j1 , . . . is , js } = ∅. Proof. This follows from Lemmas 5.8 and 5.5.

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

431

Now we introduce the notion of a canonical element. Let k, l, m, t be some nonnegative integers satisfying 2k + 2l + 2m + t ≤ n. Set δ(0, 0, 0, 0) = e and if at least one of k, l, m, t is not zero, set δ(k, l, m, t) = θ1 θ3 · · · θ2k−1 ξ2k+1 ξ2k+3 · · · ξ2k+2l−1 ν2k+2l+1 ν2k+2l+3 · · · · · ν2k+2l+2m−1 ϑ2k+2l+2m+1 ϑ2k+2l+2m+2 · · · ϑ2k+2l+2m+t . (25) The element δ(k, l, m, t) such that l = 0 or m = 0 will be called a canonical element of type (k, l, m, n). Corollary 5.10. Every Sn × Sn -orbit contains a canonical element. Proof. By Corollary 5.9 we have to prove that, the Sn ×Sn -orbit of the element ϑ(A)γi1 ,j1 · · · γis ,js , satisfying the conditions of Corollary 5.9, contains a canonical element. Using conjugation, we can always reduce ϑ(A)γi1 ,j1 · · · γis ,js to some δ(k, l, m, t). However, it might happen that both m and l are non-zero. Without loss of generality we may assume m ≥ l ≥ 1. Using (20) and conjugation we get that the Sn × Sn -orbit of the element μi,j νp,q contains i,j ϑp ϑq provided that {i, j} ∩ {p, q} = ∅. Hence the Sn × Sn -orbit of our δ(k, l, m, t) contains δ(k + 1, l − 1, m − 1, t + 2). Proceeding by induction we get that the Sn × Sn -orbit of our δ(k, l, m, t) contains δ(k + l, 0, m − l, t + 2l), which is canonical. This completes the proof. Lemma 5.11. The Sn × Sn -orbits of the canonical element δ(k, l, 0, t) and δ(k, 0, l, t) contain at most (n!)2 (k + l)!2k+l t!k!2k (2l + t)!(n − 2k − 2l − t)! elements. Proof. We will prove the statement for the element δ(k, l, 0, t). For δ(k, 0, l, t) the proof is analogous. We use the arguments similar to those from the proof of Lemma 3.10. It is enough to show that the stabilizer of δ(k, l, 0, t) under the Sn × Sn -action contains at least (k + l)!2k+l t!k!2k (2l + t)!(n − 2k − 2l − t)! elements. Set Σ0i = σ2i σ2i−1 σ2i+1 σ2i , 1 ≤ i ≤ k + l − 1;

Σ1i = σ2i σ2i−1 σ2i+1 σ2i σ2i−1 , 1 ≤ i ≤ k + l − 1. Then both Σ0i and Σ1i swap the sets {2i−1, 2i} and {2i+1, 2i+2}. It follows that the group H, generated by all Σ0i , consists of all permutations of the set {1, 2}, {3, 4}, . . . , {2k + 2l − 1, 2k + 2l} and is therefore isomorphic to the group Sk+l . Further, it is easy to ˜ generated by all Σ0i and Σ1i , is isomorphic to the wreath product see that the group H, H S2 . From (6) and (3) it follows that the left multiplications with Σ0i and Σ1i stabilizes ˜ stabilizes δ(k, l, 0, t) δ(k, l, 0, t). Therefore the left multiplication with each element of H as well. Now, from (16) and (18) it follows that σi ηi = σi ϑi ϑi+1 θi = ϑi+1 σi ϑi+1 θi = ϑi σi ϑi θi = ϑi ϑi+1 θi = ηi .

432

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

for all i = 1, . . . , n − 1. Moreover, σi+1 ηi ηi+2 = σi+1 ϑi+1 θi ϑi+2 θi+2 = σi+1 ϑi+1 ϑi+2 θi θi+2 = ϑi+1 ϑi+2 θi θi+2 = ϑi+1 θi ϑi+2 θi+2 = ηi ηi+2 for all i = 1, . . . , n − 3 by (17) and (16) and σi+1 ηi ϑi+2 = σi+1 ϑi+1 θi ϑi+2 = σi+1 ϑi+1 ϑi+2 θi = ϑi+1 ϑi+2 θi = ηi ϑi+2 for all i = 1, . . . , n − 2 again by (17) and (16). Using this and the fact that ηi commutes with each of θj , ηj , ξj whenever |i−j| > 1 we see that each of the elements σi , 2k+2l−1 ≤ i ≤ 2k + 2l + t, stabilizes δ(k, l, 0, t) under the left multiplication. All these elements generate the group H0 St , which stabilizes δ(k, l, 0, t) and has trivial intersection with ˜ Let H1 = H0 × H. ˜ H. Analogously one shows that there is a group, H2 , isomorphic to the wreath product (Sk S2 ) × S2l+t , such that each element of this group stabilizes δ(k, l, 0, t) with respect to the right multiplication. In addition to this, from (3) we have that conjugation by any element from the group H3 = σ2k+2l+t+1 , . . . , σn−1 Sn−2k−2l−t stabilizes δ(k, l, 0, t). Observe that the group, generated by H1 , H2 and H3 , is a direct product of H1 , H2 and H3 . Hence, using the product rule we derive that the cardinality of the stabilizer of δ(k, l, 0, t) is at least (k + l)!2k+l t!k!2k (2l + t)!(n − 2k − 2l − t)!, and the proof is complete.

Proof (of Theorem 5.1). Comparing Lemma 5.11 and Proposition 2.1(d) we have |T | ≤ |Bn |. Since ϕ : T → Bn is surjective we have |T | ≥ |Bn |. Hence |T | = |Bn | and ϕ is an isomorphism.

Acknowledgment This paper was written during the visit of the ﬁrst author to Uppsala University, which was supported by the Swedish Institute. The ﬁnancial support of the Swedish Institute and the hospitality of Uppsala University are gratefully acknowledged. For the second author the research was partially supported by the Swedish Research Council. We thank Victor Maltcev for informing us about the reference [7]. We would also like to thank the referee for very helpful suggestions.

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

433

References [1] J. Baez: “Link invariants of ﬁnite type and perturbation theory”, Lett. Math. Phys., Vol. 26(1), (1992), pp. 43–51. [2] H. Barcelo and A. Ram: Combinatorial representation theory. New perspectives in algebraic combinatorics, Berkeley, CA, 1996–97, pp. 23–90. [3] J. Birman: “New points of view in knot theory”, Bull. Amer. Math. Soc. (N.S.), Vol. 28(2), (1993), pp. 253–287. [4] J. Birman and H. Wenzl: “Braids, link polynomials and a new algebra”, Trans. Amer. Math. Soc., Vol. 313(1), (1989), pp. 249–273. [5] M. Bloss: “The partition algebra as a centralizer algebra of the alternating group”, Comm. Algebra, Vol. 33(7), (2005), pp. 2219–2229. [6] R. Brauer: “On algebras which are connected with the semisimple continuous groups”, Ann. of Math. (2), Vol. 38(4), (1937), pp. 857–872. [7] D. FitzGerald: “A presentation for the monoid of uniform block permutations”, Bull. Aus. Math. Soc., Vol. 68, (2003), pp. 317–324. [8] D. FitzGerald and J. Leech: “Dual symmetric inverse monoids and representation theory”, J. Austral. Math. Soc. Ser. A, Vol. 64(3), (1998), pp. 345–367. [9] T. Halverson and A. Ram: “Partition algebras”, European J. Comb., Vol. 26, (2005), pp. 869–921. [10] V.F.R. Jones: The Potts model and the symmetric group. Subfactors (Kyuzeso, 1993), World Sci. Publishing, River Edge, NJ, 1994, pp. 259–267. [11] S. Kerov: “Realizations of representations of the Brauer semigroup”, Zap. Nauchn. Sem. Leningrad. Otdel. Mat. Inst. Steklov. (LOMI), Vol. 164, (1987); Diﬀerentsialnaya Geom. Gruppy Li i Mekh., Vol. IX, pp. 188–193, 199; translation in J. Soviet Math., Vol. 47(2), (1989), pp. 2503–2507. [12] S. Lipscomb: Symmetric inverse semigroups. Mathematical Surveys and Monographs, Vol. 46, American Mathematical Society, Providence, RI, 1996. [13] V. Maltcev: “Systems of generators, ideals and the principal series of the Brauer semigroup”, Proceedings of Kyiv University, Physical and Mathematical Sciences, Vol. 2, (2004), pp. 59–65. [14] V. Maltcev: “On one inverse subsemigroups of the semigroup Cn ”, to appear in Proceedings of Kyiv University. [15] V. Maltcev: On inverse partition semigroups IP X , preprint, Kyiv University, Kyiv, Ukraine, 2005. [16] P. Martin: “Temperley-Lieb algebras for nonplanar statistical mechanics – the partition algebra construction”, J. Knot Theory Ramiﬁcations, Vol. 3(1), (1994), pp. 51–82. [17] P. Martin: “The structure of the partition algebras”, J. Algebra, Vol. 183(2), (1996), pp. 319–358. [18] P. Martin and A. Elgamal: “Ramiﬁed partition algebras”, Math. Z., Vol. 246(3), (2004), pp. 473–500.

434

G. Kudryavtseva, V. Mazorchuk / Central European Journal of Mathematics 4(3) 2006 413–434

[19] P. Martin and D. Woodcock: “On central idempotents in the partition algebra”, J. Algebra, Vol. 217(1), (1999), pp. 156–169. [20] V. Mazorchuk: “On the structure of Brauer semigroup and its partial analogue”, Problems in Algebra, Vol. 13, (1998), pp. 29–45. [21] V. Mazorchuk: “Endomorphisms of Bn , PBn , and Cn ”, Comm. Algebra, Vol. 30(7), (2002), pp. 3489–3513. [22] M. Parvathi: “Signed partition algebras”, Comm. Algebra, Vol. 32(5), (2004), pp. 1865–1880. [23] A. Vernitski: “A generalization of symmetric inverse semigroups”, preprint 2005. [24] Ch. Xi: “Partition algebras are cellular”, Compositio Math., Vol. 119(1), (1999), pp. 99–109.

DOI: 10.2478/s11533-006-0016-7 Research article CEJM 4(3) 2006 435–448

Accelerating the convergence of trigonometric series Anry Nersessian∗ , Arnak Poghosyan† Institute of Mathematics, National Academy of Sciences of Armenia, Yerevan 375019, Armenia

Received 1 July 2005; accepted 13 March 2006 Abstract: A nonlinear method of accelerating both the convergence of Fourier series and trigonometric interpolation is investigated. Asymptotic estimates of errors are derived for smooth functions. Numerical results are represented and discussed. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Convergence acceleration, Pade approximation, asymptotic error MSC (2000): 65B99, 42A10, 42A15, 41A21

1

Introduction

It is well known that, for f ∈ L2 (−1, 1), the series ∞

f (x) =

iπnx

fn e

n=−∞

is convergent by L2 -norm

||f || =

1 , fn = 2 1

−1

2

1

f (x)e−iπnx dx

(1)

−1

1/2

|f (x)| dx

.

For practical purposes, approximations are obtained by using only a ﬁnite number of Fourier coeﬃcients {fn }, |n| ≤ N < ∞. As is also well known [32], when we approximate f by truncated Fourier series (partial sums) SN (f ) :=

N n=−N

∗ †

E-mail: [email protected] E-mail: [email protected]

fn eiπnx

(2)

436

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

or by trigonometric interpolation IN (f ) :=

N n=−N

fn eiπnx ,

fn =

N 1 f (xk )e−iπnxk , 2N + 1 k=−N

xk =

2k , 2N + 1

(3)

the resulting error is strongly dependent on the smoothness of f . Approximating a 2periodic f ∈ C ∞ (R) function by SN or IN (N >> 1) is highly eﬀective. When the approximated function has a point of discontinuity, the above mentioned approximations lead to the Gibbs phenomenon. The ”oscillations” caused by this phenomenon are typically propagated into regions away from the singularity and degrade the quality of the approximations. Diﬀerent ways of treating this deﬁciency have been suggested in the literature (see, for example, [14–17]). The idea of increasing the convergence rate of Fourier series by subtracting a polynomial that represents the discontinuities in the function and some of its ﬁrst derivatives was suggested by A.Krylov in 1906 [19] and later, in 1964, by Lanczos [20, 21] (see also [2, 23] and [18, with references]). The key problem lies in determining the singularity amplitudes. As formulated by Gottlieb [12], these amplitudes could be found from the ﬁrst N Fourier coeﬃcients. This idea has been realized by Eckhoﬀ in a series of papers [5–8] where the values of the ”jumps” are solutions of the corresponding system of linear equations. Let us refer to this approach as the Krylov-Gottlieb-Eckhoﬀ (KGE) method (see also [3, 11, 13, 22] and, for the multidimensional case, [25, 28]). Application of Pade approximants [1] to Fourier series has been studied by several investigators. The general form of Fourier-Pade representation has been suggested by Cheney [4], but he does not discuss algorithms for computing coeﬃcients, rates of convergence, and so forth. Geer [10] introduced and studied a class of approximations to a periodic function f that uses the ideas of Pade (rational approximations). While these approximations do not ”eliminate” the Gibbs phenomenon, they do mitigate its eﬀect. For eliminating the Gibbs phenomenon, algorithms based on Pade-type approximations were described and studied in [9, 26, 29, 30]. In [24], Pade approximants are applied to the asymptotic expansion of coeﬃcients of Fourier series for piecewise smooth functions, leading to a new kind of approximation. In [27], the corresponding asymptotic estimates of errors of these approximations are investigated. Here, we extend the method to trigonometric interpolations. The proposed approximations are exact for a system of quasipolynomials while the KGE-method is exact for a subsystem of polynomials. Thus, we obtain a generalization of the latter. The quasipolynomial approach is nonlinear while the KGE-method is (given the exact jumps) linear. If the jumps and Fourier coeﬃcients of the approximated function are known, then the KGE-method can be constructed without any additional calculations. However, for the quasipolynomial method, we also need the values of some parameters that can be determined from a nonlinear system of equations with jumps in the coeﬃcients. This additional complexity in calculation yields round oﬀ errors of the approximations that are more precise and more stable. Theorems and numerical examples are presented. Moreover, comparisons between the quasipolynomial and the

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

437

KGE-method are made. We expect that the proposed approximations, especially insofar as they are derived by a tool as ﬂexible as the system of quasipolynomials, should result in new algorithms of increased precision and robustness.

2

KGE-method

We say that f ∈ C q [−1, 1], q ≥ 0 if f (q) is continuous in [−1, 1]. Denote Ak (f ) = f (k) (1) − f (k) (−1), k = 0, · · · , q. The idea of the KGE-method is to split the given function f ∈ C q [−1, 1] into two parts q−1 f (x) = F (x) + Ak (f )Bk (x), (4) k=0

where F is a relatively smooth function and Bk (x) are 2-periodic Bernoulli polynomials with Fourier coeﬃcients ⎧ ⎪ ⎨ 0, n=0 (5) Bk,n = n+1 ⎪ ⎩ (−1) k+1 , n = ±1, ±2, ... 2(iπn)

Approximating the function F by truncated Fourier series leads to the KGE-approximation SN,q (f ) =

N

iπnx

Fn e

+

n=−N

q−1

Ak (f )Bk (x),

(6)

k=0

where the coeﬃcients Fn can be expressed by fn and Bk,n from (1), (4), and (5). Similarly, approximating the function F by trigonometric interpolation leads to KGEinterpolation (see (3)) IN,q (f ) =

N

Fn e

iπnx

n=−N

+

q−1

Ak (f )Bk (x),

(7)

k=0

k )n from (4). where the discrete coeﬃcients Fn can be expressed by fn and (B Theorem 2.1. Suppose f ∈ C q [−1, 1], q ≥ 1, f (q+1) ∈ L1 (−1, 1). Then RN,q (f ) := f (x) − SN,q (f ) = Aq (f )

(−1)n+1 eiπnx + o(N −q ), N → ∞. q+1 2(iπn)

(8)

|n|>N

Proof. By q-fold integration by parts in (1), we have that 1 q−1 (−1)n+1 Ak (f ) 1 fn = + f (q) (x)e−iπnx dx. k+1 q 2 (iπn) 2(iπn) −1 k=0

(9)

438

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

Therefore,

RN,q (f ) =

Fn eiπnx ,

(10)

|n|>N

where (−1)n+1 Aq (f ) 1 + Fn = q+1 2 (iπn) 2(iπn)q+1

1

f (q+1) (x)e−iπnx dx.

(11)

−1

Note that, according to the well-known Riemann-Lebesgue theorem [32], the second term is o(n−q−1 ) as n → ∞. This concludes the proof. Similarly, we prove the following result: Theorem 2.2. Suppose f ∈ C q [−1, 1], q ≥ 1, f (q+1) ∈ L1 (−1, 1). Then rN,q (f ) := f (x) − IN,q (f ) = Aq (f )× |n|>N

N ∞ (−1)n+1 eiπnx (−1)n+s+1 eiπnx +o(N −q ), N → ∞. (12) + q+1 q+1 q+1 2(iπn) 2(iπ) (n + s(2N + 1)) n=−N s=−∞ s=0

Proof. Just note that, from (11), we have at least Fn = O(n−2 ), n → ∞, and so Fn =

∞

Fn+s(2N +1) .

s=−∞

3

Quasipolynomial (QP-) method

3.1 The essential features of quasipolynomial approximation [24] are both the application of Pade approximants to the asymptotic expansion of fn (see 9) and the solution of the corresponding system of nonlinear equations (a system that arises in the theory of Pade approximations). Consider a ﬁnite sequence of complex numbers θ := {θk }m k=1 , m ≥ 1, and denote k−1 Δ0n (θ) = An (f ), Δkn (θ) = Δk−1 n (θ) + θk Δn−1 (θ), k ≥ 1.

If n < 0, we set Δkn (θ) = 0. It is easy to verify that, for x = −1/θ1 , q−1

Aq−1 θ1 1 Ak x = x (Ak + θ1 Ak−1 )xk . + 1 + θ x 1 + θ x 1 1 k=0 k=0 q−1

k

q

(13)

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

439

Note that, for θ1 = 0, the sum on the left side of (13) remains unchanged. Iterating this transformation up to m times yields the following formula (x = −1/θk ; k = 1, · · · , m; m ≤ q − 1): q−1

k

Ak x = x

k=0

q

m k=1

θk Δk−1 1 q−1 (θ) k + Δm k m k (θ)x . (1 + θ x) s s=1 s=1 (1 + θs x) k=0 q−1

(14)

Suppose f ∈ C q [−1, 1]. Applying transformation (14) to the ﬁrst term of (9) with (iπn)−1 instead of x, we derive

where

and

fn = Qn + Pn , n = 0,

(15)

q−m−1 (−1)n+1 (iπn)m Δm k (θ) Qn = m 2 s=1 (iπn + θs ) k=0 (iπn)k+1

(16)

m k−1 (−1)n+1 θk Δq−1 (θ)(iπn)k Pn = + k 2(iπn)q+1 k=1 s=1 (iπn + θs )

1 q−1 (−1)n+1 (iπn)m Δm 1 k (θ) + + f (q) (t) e−iπnt dt. m k+1 q 2 k=1 (iπn + θk ) k=q−m (iπn) 2(iπn) −1

(17)

Inasmuch as (15) holds, we can split the function f into two parts f (x) = Q(x) + P (x), where Q(x) =

∞

∞

Qn eiπnx , P (x) =

n=−∞ n=0

Pn eiπnx , P0 = f0 .

(18)

(19)

n=−∞

Approximating P by the truncated Fourier series leads to SN,q,m (f ) = Q(x) +

N

(fn − Qn )eiπnx

(20)

n=−N

and approximating P by trigonometric interpolation leads to IN,q,m (f ) = Q(x) +

N

n )eiπnx . (fn − Q

(21)

n=−N

It is important to note that, for θ1 = θ2 = · · · = θm = 0, approximations SN,q,m (f ) and IN,q,m (f ) coincide with the KGE-approximation and the KGE- interpolation, respectively. Approximation properties of SN,q,m and IN,q,m are strongly connected with the smoothness of the function P or, put another way, with the rate of convergence of Pn to zero. This condition leads to the following system of nonlinear equations (see the second term in (17)) for the unknown vector θ: Δm k (θ) = 0, k = q − m, · · · , q − 1.

(22)

440

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

Note that, if θ is a solution of (22) and f ∈ C q [−1, 1], f (q+1) ∈ L1 (−1, 1), then Pn = O(n−q−1 ), n → ∞. We call approximations (20) and (21), together with (22), QPapproximation and QP-interpolation, respectively. Actually, we apply the Pade approximation [q+m-1/m] to the sum on the left side of (14) [1].

3.2 We are interested in the asymptotic behavior of RN,q,m (f ) := f (x) − SN,q,m (f ) and rN,q,m (f ) := f (x) − IN,q,m (f ). By γk (m), k = 0, · · · , m we denote the coeﬃcients of the polynomial m

(1 + θk x) ≡

k=1

γk (m)xk .

k=0

Note that Δm k (θ)

m

= Ak (f ) +

m

γs (m)Ak−s (f ).

s=1

Hence, the system (22) can be written in the form m

γs (m)Ak−s+q−m−1 (f ) = −Ak+q−m−1 (f ), k = 1, · · · , m.

(23)

s=1

Denote Urm = [Ak−s+r (f )], k, s = 1, · · · , m. Theorem 3.1. [27] Suppose f ∈ C q [−1, 1], q ≥ 1, f (q+1) ∈ L1 [−1, 1] and m detUq−m−1 = 0.

Then, with θ from (22), the following holds: RN, q,m (f ) = (−1)m

m+1 detUq−m (−1)n+1 eiπnx + o(N −q ), N → ∞. m detUq−m−1 2(iπn)q+1

(24)

|n|>N

Proof. From (18) and (20), we have RN, q,m (f ) =

Pn eiπnx ,

(25)

|n|>N

where (see (17)) (−1)n+1 Pn = 2(iπn)q+1

Aq (f ) +

m k=1

θk Δk−1 q−1 (θ) k θs 1 + s=1 iπn

+ o(n−q−1 ), n → ∞.

(26)

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

441

It now follows that (−1)n+1 eiπnx + o(N −q ), N → ∞. 2(iπn)q+1

RN,q,m (f ) = Δm q (θ)

(27)

|n|>N

Here, we use the fact that m−1 m−2 Δm (θ) + θm Δm−1 (θ) + θm−1 Δm−2 q (θ) = Δq q−1 (θ) = Δq q−1 (θ)+

+θm Δm−1 q−1 (θ)

=

Δ0q (θ)

+

m

θk Δk−1 q−1 (θ)

= Aq (f ) +

k=1

m

θk Δk−1 q−1 (θ).

k=1

According to Cramer’s rule, γs (m) =

Ms , s = 1, · · · , m, m detUq−m−1

where {Ms } are the corresponding minors. Consequently, Δm q (θ) = Aq (f ) +

m

γs (m)Aq−s (f ) =

s=1

m m+1 1 m detUq−m Ms Aq−s (f ) = (−1) . = Aq (f ) + m m detUq−m−1 detUq−m−1 s=1

m Theorem 3.2. Suppose f ∈ C q [−1, 1], q ≥ 1, f (q+1) ∈ L1 [−1, 1], and detUq−m−1 = 0. Then, for θ from (22), the following holds: m+1 detUq−m × rN, q,m (f ) = (−1) m detUq−m−1 m

⎛

⎞ N

∞

(−1)n+s+1 eiπnx ⎜ (−1)n+1 eiπnx ⎟ −q + ⎝ ⎠ + o(N ), N → ∞. q+1 q+1 q+1 2(iπn) 2(iπ) (n + s(2N + 1)) n=−N s=−∞ |n|>N

s=0

(28) Proof. Using the relation (at least Pn = O(n−2 ), n → ∞) ∞

Pn =

Pn+s(2N +1) ,

s=−∞

we obtain rN,q,m (f ) =

|n|>N

iπnx

Pn e

−

N n=−N

iπnx

e

∞

Pn+s(2N +1) .

s=−∞ s=0

Proceeding as in the proof of theorem 3.1, this concludes the proof.

442

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

3.3 For a practical realization of the QP-method, we need the explicit form of the function Q(x). Consider the case m + 1 ≤ q ≤ 2m and θs = 0, θs = θk , s, k = 1, · · · , m. Given the relation m (−1)m−k−1 θjm−k−1 (iπn)m−k−1 m m = , k = 0, · · · , q − m − 1, (iπn + θ ) (θ − θ ) j s j s=1 (iπn + θs ) s=1 j=1 s=j

we expand Qn into simple fractions q−m−1 m (−1)n 1 m−k m−k−1 m Δm θj . Qn = k (θ)(−1) 2 j=1 (iπn + θj ) s=1 (θs − θj ) k=0

(29)

s=j

According to the representation ∞ (−1)n eiπnx 1 −θj x = e iπn + θj shθj n=−∞

from (29), for the case m + 1 ≤ q < 2m, we derive Q(x) =

m j=1

2 sh(θj )

q−m−1

e−θj x m

s=1 (θs − θj ) s=j

m−k m−k−1 Δm θj . k (θ)(−1)

(30)

k=0

If q = 2m, then m m−1 Δm e−θj x m−1 (θ) m + Δm (θ)(−1)m−k θjm−k−1 . Q(x) = m 2 k=1 θk j=1 2sh θj s=1 (θs − θj ) k=0 k

(31)

s=j

The explicit form of Q(x) in other cases can be calculated similarly. In general, we have the following representation: Lemma 3.3. [24]. Let {αs } , s = 1, ..., , 1 ≤ < ∞ , be a ﬁnite set of complex numbers and Υ ⊆ {αs } a subset of integers. Then ‡ ∞ (−1)k+1 p(k)eiπkx p(z)eiπzx =π Res , βs βs z=αr sin(πz) (k − α (z − α ) s) s s=1 s=1 r=1 k=−∞ k∈Υ /

where {βs }, s = 1, ..., is a set of positive integers, p(z) is a polynomial of degree less than ‡ ‡ s=1 βs − 1, and x = (x + 1) (mod2) − 1, −1 < x < 1. Now it follows that, in general, Q(x) is a quasipolynomial of the form Q(x) = ak xpk eiωk x , k

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

443

where ωk ∈ C and where {pk } is a set of nonnegative integers. We have calculated the explicit form of Q(x) for speciﬁc cases. Calculations are carried out using the MATHEMATICA software package [31]. For a given q and m, denote Qq,m (x) = Q(x). Some of them are A0 A0 −θ1 x − e , 2θ1 2 sh θ1

Q2,1 (x) = Q3,1 (x) = − Q3,2 (x) = −

A1 A1 + A0 θ1 A1 +x + e−θ1 x , 2 2θ1 2θ1 2 θ1 sh θ1

A0 θ 1 A0 θ 2 e−θ1 x + e−θ2 x , 2 (θ1 − θ2 ) sh θ1 2 (θ1 − θ2 ) sh θ2

A1 θ13 + A2 (θ12 − 6) x + Q4,1 (x) = − 12θ13 2

Q4,2 (x) =

A2 + A1 θ1 A2 A2 A0 − 2 + x2 − 2 e−θ1 x , θ1 4θ1 2 θ1 sh θ1

A 1 + A0 θ 2 A 1 + A0 θ 1 A1 + A0 (θ1 + θ2 ) e−θ1 x − e−θ2 x + , 2 (θ1 − θ2 ) sh θ1 2 (θ1 − θ2 ) sh θ2 2θ1 θ2

Q4,3 (x) = −

A0 θ12 A0 θ22 e−θ1 x − e−θ2 x − 2 (θ1 − θ2 )(θ1 − θ3 ) sh θ1 2 (θ2 − θ1 )(θ2 − θ3 ) sh θ2 −

A0 θ32 e−θ3 x . 2 (θ3 − θ1 )(θ3 − θ2 ) sh θ3

Similar calculations can be carried out for multiple θ. For example, when θ1 = θ2 = θ, q = 3, m = 2, we have Q3,2 (x) =

A0 (2θcth θ − 1 + 2xθ)e−θx . 2sh θ

If θ1 = θ2 = θ3 = θ, q = 4, m = 3, we derive Q4,3 (x) =

A0 e−θx (2xθ(5 − 3θcth θ) − 3x2 θ2 − 6θ2 cth2 θ + 3θ2 + 10θcth θ − 2). 4sh θ

For m = 1, system (22) can easily be solved symbolically. In particular, the explicit forms of some of the quasipolynomials are derived to be A0 AA1 x A20 0 e − , 1 2A1 2sh A A0 A2 A21 A21 A31 A0 x A1 Q3,1 (x) = − 2 + x − + e , A2 2A2 2 2A2 2A2 sh A1 Q2,1 (x) =

A4 A1 A22 Q4,1 (x) = − − 23 + +x 12 2A3 12A3

A3 A0 − 22 2 2A3

+x

2

A2 A1 − 2 4 4A3

A3

A3 e A 2 + 22 A3 . 2A3 sh A2 x

444

4

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

Numerical results

For a given f , q, and m, we put m det(Uq−m−1 ) . aq,m (f ) = Aq det(U m+1 )

(32)

q−m

The constant aq,m (f ) describes the eﬀectiveness of the QP-approximation (Theorem 3.1) compared to the KGE-approximation (Theorem 2.1) as well as the eﬀectiveness of the QP-interpolation (Theorem 3.2) compared to the KGE-interpolation (Theorem 2.2) when N >> 1. Let us consider two typical examples. All calculations are carried out using MATHEMATICA software on a Pentium 4 computer. First, consider the Bessel function f (x) = J0 (14x − 1).

(33)

In Figure 1, the graphics of aq,m (f ) for (33) are represented when q = 8, 9, 10 and 1 ≤ m ≤ q −1. We observe that the QP-method is more precise than the KGE-method almost 250 times for q = 8; m = 4 and more than 300 times for q = 10; m = 4, 6. Figure 1 also shows the optimal values of m when parameter q is ﬁxed.

a8,m

a9,m 250

250

a10,m 300

50

50

50 m

m 1

4

7

1

4 5

8

m 1

5

9

Fig. 1 Graphics of aq,m (f ) for (33) when q = 8, 9, 10 and 1 ≤ m ≤ q − 1. Results in Figure 1 are asymptotic (N 1). It is interesting to see the numerical behavior of the QP-method for both small and moderate values of N . We illustrate the results for just the QP-interpolation because our experiments show that, in general, the behavior of the QP-approximation (see [24] and [27] for details) is very similar to that of the QP-interpolation. The actual eﬀectiveness (in a uniform metric) of the QPinterpolation compared to the KGE-interpolation can be represented by the ratio aN,q,m (f ) =

max|x|≤1 |rN,q (f )| . max|x|≤1 |rN,q,m (f )|

(34)

In Table 1, approximate values of aN,8,4 are shown for (33). Calculations are carried out with 64 digits of precision. Comparison with the theoretical value a8,4 = 271.1 shows that experimental and theoretical estimates are rather close for N ≥ 32. In Figure 2, the uniform errors are scaled logarithmically. Here, we compare the QP- and the KGE-interpolations for q = 8

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

445

N

8

16

32

64

128

256

512

aN,8,4

97

206

250

264

269

270

271

14

20 17.5 15 12.5 10 7.5 5

lgerror

lgerror

Table 1 Approximate values of aN,8,4 for diﬀerent N while interpolating the function (33).

KGE QP 0

100

200

300

400

12 10 8 KGE QP

6 4

500

0

100

200

N

300

400

500

N

Fig. 2 Uniform errors in log scale, f deﬁned by (33), q = 8, m = 4, N ≤ 512. Left: with 64 digits of precision, Right: with standard precision. and m = 4. The left ﬁgure corresponds to calculations with 64 digits of precision; the right ﬁgure we obtain from standard MATHEMATICA precision calculations. We see that, even in standard machine precision, the QP-interpolation is much more precise than the KGE-interpolation. For N ≥ 100, the QP-method is nearly 103 (compare this with the theoretically-predicted value of 271) times more precise than the KGEmethod. Furthermore, the QP-method is less sensitive to round-oﬀ errors. Now consider the second example f (x) =

1 . 1.1 − x

(35)

This function has the greatest increase of Ak jumps within the class of analytic functions in the neighborhood of the interval [−1, 1]. In Figure 3, the graphics of aq,m are represented. Approximate values of aN,8,4 are displayed in Table 2. For this example, when N ≥ 256, experimental results aN,8,4 are close to the theoretical estimate a8,4 = 72.9. Table 2 Approximate values of aN,8,4 for diﬀerent N while interpolating the function (35). N

8

16

32

64

128

256

512

1024

aN,8,4

3414

433

237

62

44

62

68

71

In Figure 4, the logarithms of the uniform errors are represented. The left ﬁgure corresponds to calculations with 64 digits of precision; the right ﬁgure is obtained from stan-

446

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

a8,m 70

a10,m 250

a9,m 120 50

20

50 1

4

7

m

m

m 1

4 5

8

1

5

9

15 12.5 10 7.5 5 2.5 0

lgerror

lgerror

Fig. 3 The graphs of aq,m for ﬁxed values of q (q = 8, 9, 10) and 1 ≤ m ≤ q when (35) is approximated.

KGE QP 0

100

200

300

400

500

12 10 8 6 4 2 0

KGE QP 0

100

N

200

300

400

500

N

Fig. 4 Uniform errors in log scale, f deﬁned by (35), q = 8, m = 4, N ≤ 512. Left: with 64 digits of precision, Right: with standard precision. dard precision calculations. With standard precision and N ≥ 200 the QP-interpolation is 105 times more precise than the KGE-interpolation. For practical application of the QP-method, the numerical values of jumps Ak (f ) are also needed. These values can be recovered from Fourier coeﬃcients or from discrete Fourier coeﬃcients as shown in [5–8]. Numerical experiments show [24] that the application of this procedure to the QP-method is acceptable and, in general, maintains all characteristic features.

Acknowledgment The authors were supported in part by ISTC Grant A-823

References [1] G.A. Baker and P. Graves-Morris: Pade Approximants. Encyclopedia of mathematics and its applications, 2nd ed., Cambridge Univ. Press, Cambridge, 1996.

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

447

[2] G. Baszenski, F.-J. Delvos and M. Tasche: “A united approach to accelerating trigonometric expansions”, Comput. Math. Appl., Vol. 30(3–6), (1995), pp. 33–49. [3] W. Cai, D. Gottlieb and C.W. Shu: “Essentially non oscillatory spectral Fourier methods for shock wave calculations”, Math. Comp., Vol. 52, (1989), pp. 389–410. [4] E.W. Cheney: Introduction to Approximation Theory, McGraw-Hill, New York, 1996. [5] K.S. Eckhoﬀ: “Accurate and eﬃcient reconstruction of discontinuous functions from truncated series expansions”, Math. Comp., Vol. 61, (1993), pp. 745–763. [6] K.S.Eckhoﬀ: “Accurate reconstructions of functions of ﬁnite regularity from truncated Fourier series expansions”, Math. Comp., Vol. 64, (1995), pp. 671–690. [7] K.S. Eckhoﬀ: “On a high order numerical method for functions with singularities”, Math. Comp., Vol. 67, (1998), pp. 1063–1087. [8] C.E. Wasberg: On the numerical approximation of derivatives by a modiﬁed Fourier collocation method, Thesis (PhD), Department of Mathematics, University of Bergen, Norway, 1996. [9] T.A. Driscoll and B. Fornberg: “A Pade-based algorithm for overcoming the Gibbs phenomenon”, Numerical Algorithms, Vol. 26, (2000), pp. 77–92. [10] J. Geer: “Rational trigonometric approximations using Fourier series partial sums”, J. Sci. Computing, Vol. 10(3), (1995), pp. 325–356. [11] D. Gottlieb: “Spectral methods for compressible ﬂow problems”, In: Soubbaramayer and J.P. Boujot (Eds.): Proc. 9th Internat. Conf. Numer. Methods Fluid Dynamics, Lecture Notes in Phys., Vol. 218, Saclay, France, Springer-Verlag, Berlin and New York, 1985, pp. 48–61. [12] D. Gottlieb: “Issues in the application of high order schemes”, In: M.Y. Hussaini, A. Kumar and M.D. Salas (Eds): Proc. Workshop on Algorithmic Trends in Computational Fluid Dynamics (Hampton, Virginia, USA), Springer-Verlag, ICASE /NASA LaRC Series, 1991, pp. 195–218. [13] D. Gottlieb, L. Lustman and S.A. Orszag: “Spectral calculations of one-dimensional inviscid compressible ﬂows”, SIAM J. Sci. Statist. Comput., Vol. 2, (1981), pp. 296– 310. [14] D. Gottlieb, C.W. Shu, A. Solomonoﬀ and H. Vandevon: “On the Gibbs Phenomenon I: Recovering exponential accuracy from the Fourier partial sum of a non-periodic analytic function”, J. Comput. Appl. Math., Vol. 43, (1992), pp. 81–92. [15] D. Gottlieb and C.W. Shu: On the Gibbs Phenomenon III: Recovering Exponential Accuracy in a sub-interval from the spectral partial sum of a piecewise analytic function, ICASE report, 1993, pp. 93–82. [16] D. Gottlieb and C.W. Shu: “On the Gibbs phenomena IV: Recovering exponential accuracy in a sub-interval from a Gegenbauer partial sum of a piecewise analytic function”, Math. Comp., Vol. 64, (1995), pp. 1081–1096. [17] D. Gottlieb and C.W. Shu: “On the Gibbs Phenomenon V: Recovering Exponential Accuracy from collocation point values of a piecewise analytic function”, Numer. Math., Vol. 33, (1996), pp. 280–290.

448

A.Nersessian, A.Poghosyan / Central European Journal of Mathematics 4(3) 2006 435–448

[18] W.B. Jones and G. Hardy: “Accelerating Convergence of Trigonometric Approximations”, Math. Comp., Vol. 24, (1970), pp. 47–60. [19] A. Krylov: On an approximate calculations, Lectures delivered in 1906 (in Russian), St Peterburg, Tipolitography of Birkenfeld, 1907. [20] C. Lanczos: “Evaluation of noisy data”, J. Soc. Indust. Appl. Math., Ser. B Numer. Anal., Vol. 1, (1964), pp. 76–85. [21] C. Lanczos: Discourse on Fourier Series, Oliver and Boyd, Edinburgh, 1966. [22] P.D. Lax: “Accuracy and resolution in the computation of solutions of linear and nonlinear equations”, In: C. de Boor and G.H. Golub (Eds.): Recent Advances in Numerical Analysis, Proc. Symposium Univ of Wisconsin-Madison, Academic Press, New York, 1978, pp. 107–117. [23] J.N. Lyness: “Computational Techniques Based on the Lanczos Representation”, Math. Comp., Vol. 28, (1974), pp. 81–123. [24] A. Nersessian: “Bernoulli type quasipolynomials and accelerating convergence of Fourier Series of piecewise smooth functions (in Russian)”, Reports of NAS RA, Vol. 104(4), (2004), pp. 186–191. [25] A. Nersessian and A. Poghosyan: “Bernoulli method in multidimensional case”, Preprint No20 Ar-00, Deposited in ArmNIINTI 09.03.00, (2000), pp. 1-40 (in Russian). [26] A. Nersessian and A. Poghosyan: “On a rational linear approximation on a ﬁnite interval”, Reports of NAS RA, Vol. 104(3), (2004), pp. 177–184 (in Russian). [27] A. Nersessian and A. Poghosyan: “Asymptotic estimates for a nonlinear acceleration method of Fourier series”, Reports of NAS RA (in Russian), to be published. [28] A. Nersessian and A. Poghosyan: “Asymptotic errors of accelerated two-dimensional trigonometric approximations”, In: G.A. Barsegian, H.G.W. Begehr, H.G. Ghazaryan and A. Nersessian (Eds.): Complex Analysis, Diﬀerential Equations and Related Topics, Yerevan, Armenia, September 17-21, 2002, ”Gitutjun” Publishing House, Yerevan, Armenia, 2004, pp. 70–78. [29] A. Poghosyan: “On a convergence of a rational trigonometric approximation”, In: G.A. Barsegian, H.G.W. Begehr, H.G. Ghazaryan and A. Nersessian (Eds.): Complex Analysis, Diﬀerential Equations and Related Topics, Yerevan, Armenia, September 17-21, 2002, ”Gitutjun” Publishing House, Yerevan, Armenia, 2004, pp. 79–87. [30] A. Nersessian and A. Poghosyan: “On a rational linear approximation Fourier Series for smooth functions”, J. Sci. Comput., to be published. [31] S. Wolfram: The MATHEMATICA book, 4th ed., Wolfram Media, Cambridge University Press, 1999. [32] A. Zygmund: Trigonometric Series, Vol. 1,2, Cambridge Univ. Press, Cambridge, 1959.

DOI: 10.2478/s11533-006-0019-4 Research article CEJM 4(3) 2006 449–506

Local geometry of orbits for an ordinary classical Lie supergroup Tomasz Przebinda∗ Department of Mathematics, University of Oklahoma, Norman, OK 73019, USA

Received 22 February 2006; accepted 10 May 2006 Abstract: In this paper we identify a real reductive dual pair of Roger Howe with an Ordinary Classical Lie supergroup. In these terms we describe the semisimple orbits of the dual pair in the symplectic space, a slice through a semisimple element of the symplectic space, an analog of a Cartan subalgebra, the corresponding Weyl group and the corresponding Weyl integration formula. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Dual pairs, Lie supergroups, orbits, integration formulas MSC (2000): 17B05, 17B75, 22E15

Introduction The purpose of this article is to present a few elementary facts about the local structure of orbits in the symplectic space under the action of a real reductive dual pair, see [6] and [7]. We shall use this material later to study the characters of the representations which occur in Howe’s correspondence. The corresponding facts for the adjoint action of a real reductive group on its Lie algebra is essentaily contained in section one (eleven pages) of part one of [11]. The main results are presented as quickly as possible, with the proofs deﬀered to further sections. These proofs, based on elementary linear algebra, are rather noninteresting, but had to be included. Some of the material included here is contained in an unpublished work of Howe, [5]. However our approach through the Lie superalgebras seems more akin to the standard theory, [11]. ∗

E-mail: [email protected]

450

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

1

A slice through a point

Let M be a manifold and let G be a Lie group acting on M . Let x ∈ M and let Gx be the stabilizer of x in G. Assume that the orbit Gx ⊆ M is a regularly embedded submanifold. A connected submanifold U ⊆ M is called an admissible slice through x if and only if

x ∈ U,

(1.1)

U is G − stable, x

(1.2)

the tangent space Tx (M ) = Tx (U ) ⊕ Tx (Gx),

(1.3)

if g ∈ G and u, u ∈ U are such that gu = u then g ∈ G ,

(1.4)

the map G × U (g, u) → gu ∈ M is a submersion.

(1.5)

x

The condition (1.3) implies that the map μ : GU gu → gx ∈ Gx

(1.6)

is well deﬁned. As shown in [11, part I, pages 15, 16], μ is a locally trivial ﬁbration with the ﬁber U . In other words, for every point gx ∈ Gx there is an open neighborhood W ⊆ Gx, and a diﬀeomorphism φ such that the following diagram commutes: φ

W × U −−−→ μ−1 (W ) ⏐ ⏐ ⏐ ⏐ μ W

=

−−−→

(1.7)

W,

where the left vertical arrow is the projection on the ﬁrst component. Let N ⊆ M be a complete metric subspace. Suppose N is the union of a ﬁnite set of G-orbits. Then, as shown in [12, 8.A.4.5], we can label the orbits O1 , O2 , ...Ok so that for 1 ≤ j ≤ k the set k Ol (1.8) Nj = l=j

is closed in N . Suppose x ∈ Oj for some 1 ≤ j ≤ k. A connected manifold U ⊆ M is called a weakly admissible slice through x if and only if the conditions (1.0), (1.2), (1.5) hold and the intersection of the image of the map (1.5) with Nj (1.9) is equal to Oj , and U ∩ Oj = {x}.

(1.10)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

2

451

Ordinary classical Lie supergroups and dual pairs

Let D = R, C or H, and let V0 , V1 be two ﬁnite dimensional left vector spaces over D. Set V = V0 ⊕ V1

(2.1)

and deﬁne an element S ∈ End(V ) by S(v0 + v1 ) = v0 − v1 Set

(vo ∈ V0 , v1 ∈ V1 ).

(2.2)

End(V )0 = {x ∈ End(V ); Sx = xS}, End(V )1 = {x ∈ End(V ); Sx = −xS},

(2.3)

GL(V )0 = GL(V ) ∩ End(V )0 . The real vector space End(V )0 is a Lie algebra, with the usual commutator [x, y] = xy − yx. The adjoint action of GL(V )0 on End(V ) Ad(g)x = gxg −1

(g ∈ GL(V )0 , x ∈ End(V ))

preserves both End(V )0 and End(V )1 . Furthermore the anticommutator End(V )1 × End(V )1 (x, y) → {x, y} = xy + yx ∈ End(V )0

(2.4)

is R-bi-linear and GL(V )0 -equivariant. Set x, y = trD/R {Sx, y}

(x, y ∈ End(V )).

(2.4’)

(Here trD/R (y) is the trace of y ∈ End(V ) viewed as an endomorphism of V over R.) It is easy to see that the form , is preserved under the action of GL(V )0 . Lemma 2.1. The restriction of the bilinear form , to End(V )1 is symplectic and non-degenerate. Moreover, the group homomorphism Ad : G → Sp(End(V )1 , , ) maps the groups G0 = {g ∈ GL(V )0 ; g|V1 = 1}, and G1 = {g ∈ GL(V )0 ; g|V0 = 1} injectively onto an irreducible dual pair of type II in the symplectic group Sp(End(V )1 , , ). Proof. The following map Hom(V0 , V1 ) ⊕ Hom(V1 , V0 ) (A, B) → xA,B ∈ End(V )1 xA,B (v0 + v1 ) = Bv1 + Av0

(v0 ∈ V0 , v1 ∈ V1 ),

(2.5)

452

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

is an R-linear bijection. Furthermore, for v0 ∈ V0 and v1 ∈ V1 , and for any A, A ∈ Hom(V0 , V1 ) and B, B ∈ Hom(V1 , V0 ) we have xA,B xA ,B (v0 + v1 ) = BA v0 + AB v1 , and therefore S(xA,B xA ,B − xA ,B xA,B )(v0 + v1 ) = (BA − B A)v0 + (A B − AB )v1 . Hence, xA,B , xA ,B = trD/R (SxA,B xA ,B + xA ,B SxA,B ) = trD/R (S(xA,B xA ,B − xA ,B xA,B )) = trD/R (BA − B A) + trD/R (A B − AB )

(2.6)

= 2trD/R (BA − B A). Thus the form , is symplectic. It is easy to check that if trD/R (BA − B A) = 0 for all A and B then A = 0 and B = 0. Thus the form , is non-degenerate. The groups G0 and G1 are isomorphic to GL(V0 ) and GL(V1 ) by restriction. Further, the action of the groups G0 and G1 on Hom(V0 , V1 ) induced by the isomorphism (2.5), embeds these groups into GL(Hom(V0 , V1 )). It is not hard to check that G0 and G1 are mutual centralizers in GL(Hom(V0 , V1 )), and hence form a dual pair of type II in the symplectic group. Let ι be a possibly trivial involution on D. Let τ0 be a non-degenerate ι-hermitian form on V0 , and let τ1 be a non-degenerate ι-skew-hermitian form on V1 . Set τ = τ0 ⊕ τ1 . Then τ (u, v) = ι(τ (v, Su)) (u, v ∈ V ). (2.7) Deﬁne

g(V, τ )0 = {x ∈ End(V )0 ; τ (xu, v) = τ (u, −xv), u, v ∈ V }, g(V, τ )1 = {x ∈ End(V )1 ; τ (xu, v) = τ (u, Sxv), u, v ∈ V },

(2.8)

G(V, τ )0 = {g ∈ GL(V )0 ; τ (gu, gv) = τ (u, v), u, v ∈ V }. Clearly, G(V, τ )0 is a Lie subgroup of GL(V )0 , with the Lie algebra g(V, τ )0 . Moreover, it is easy to check that the anticommutator (2.4) maps g(V, τ )1 × g(V, τ )1 into g(V, τ )0 . Furthermore, the adjoint action of G(V, τ )0 preserves g(V, τ )0 , g(V, τ )1 , and the form , . Lemma 2.2. The restriction of the bilinear form , to g(V, τ )1 is symplectic and non-degenerate. Moreover, Ad : G(V, τ )0 → Sp(g(V, τ )1 , , ) maps the groups G0 = {g ∈ G(V, τ )0 ; g|V1 = 1}, and G1 = {g ∈ G(V, τ )0 ; g|V0 = 1} injectively onto an irreducible dual pair of type I in the symplectic group Sp(g(V, τ )1 , , ).

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

453

Proof. Recall the map End(V0 ) A → A ∈ End(V0 ), τ0 (Au0 , v0 ) = τ0 (u0 , A v0 )

(A ∈ End(V0 )).

Let v1 , v2 , v3 , ... be a basis of V0 , and let v1 , v2 , v3 , ... be the dual basis, in the sense that τ0 (vi , vj ) = δij . (Here δij is the Kronecker delta: δij = 1 if i = j, and δij = 0 if i = j.) Then for A ∈ End(V )0 τ0 (Avi , vi ) = τ0 (vi , A vi ) = ι(τ0 (A vi , vi )). i

i

i

Thus trD/R (A) = trD/R (A )

(A ∈ End(V0 )).

(2.9)

Deﬁne the following maps Hom(V0 , V1 ) w → w∗ ∈ Hom(V1 , V0 ), τ1 (wv0 , v1 ) = τ0 (v0 , w∗ v1 )

(v0 ∈ V0 , v1 ∈ V1 ),

∗

Hom(V1 , V0 ) w → w ∈ Hom(V0 , V1 ), τ0 (wv1 , v0 ) = τ1 (v1 , w∗ v0 ) Then

(2.10)

(v0 ∈ V0 , v1 ∈ V1 ).

τ1 (wv0 , v1 ) = τ0 (v0 , w∗ v1 ) = ι(τ0 (w∗ v1 , v0 )) = ι(τ1 (v1 , w∗∗ v0 )) = τ1 (−w∗∗ v0 , v1 ).

Thus (as is well known, [5]) w∗∗ = −w

(w ∈ Hom(V0 , V1 )).

(2.11)

For x ∈ g(V, τ )1 let wx ∈ Hom(V0 , V1 ) be the restriction of x to V0 . Then x(v0 + v1 ) = wx∗ v1 + wx v0

(v0 ∈ V0 , v1 ∈ V1 ).

Since for x, y ∈ g(V, τ )1 x, y = trD/R (S(xy − yx)), the form , is symplectic. Furthermore, by (2.12), Sxy(v0 + v1 ) = wx∗ wy v0 − wx wy∗ v1

(v0 ∈ V0 , v1 ∈ V1 ).

Thus S(xy − yx)(v0 + v1 ) = (wx∗ wy − wy∗ wx )v0 + (wy wx∗ − wx wy∗ )v1 . Hence, x, y = trD/R (wx∗ wy − wy∗ wx ) + trD/R (wy wx∗ − wx wy∗ ) = 2trD/R (wx∗ wy ) − 2trD/R (wx wy∗ ).

(2.12)

454

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

But, by (2.9), (2.10) and (2.11), trD/R (wx wy∗ ) = trD/R (wy∗ wx ) = trD/R ((wy∗ wx ) )

= trD/R (wx∗ wy∗ ∗ ) = −trD/R (wx∗ wy ).

Thus x, y = 4trD/R (wx∗ wy )

(x, y ∈ g(V, τ )1 ).

(2.13)

As shown in [5], the right hand side of (2.13) deﬁnes a non- degenerate symplectic form on Hom(V0 , V1 ). Since (2.12) deﬁnes an R-linear bijection g(V, τ )1 x → wx ∈ Hom(V0 , V1 ),

(2.13)

the ﬁrst part of the Lemma follows. The groups G0 , G1 deﬁned in (b), are isomorphic to the isometry groups G(V0 , τ0 ), G(V1 , τ1 ), by restriction. As is well known, [5], these isometry groups form an irreducible dual pair of type I in the symplectic group on Hom(V0 , V1 ), equipped with the symplectic form deﬁned by the right hand side of (2.13). Definition 2.3. An irreducible ordinary classical Lie supergroup is a pair (G, g) with g = g0 ⊕ g1 , where either G = GL(V )0 , g0 = End(V )0 , g1 = End(V )1 , as in (2.3),

(II)

G = G(V, τ )0 , g0 = g(V, τ )0 , g1 = g(V, τ )1 , as in (2.9).

(I)

or The pair (G, g) is a supergroup of type II in the case (II) and of type I in the case (I). The space V shall be called the deﬁning module or the deﬁning space for (G, g). If needed, we shall indicate this by writing G = G(V ) and g = g(V ). For the general theory of Lie superalgebras and Lie supergroup see, [8] and [9]. The following Proposition is easy to verify. Proposition 2.4. The restriction of the form , , (see (2.4’)), to g0 is symmetric, non-degenerate and G-invariant. Furthermore, the spaces g0 and g1 are orthogonal. If we identify g with the dual g∗ by y(x) = y, x

(x, y ∈ g)

(a)

then, for x ∈ g1 , the map g1 z → {x, z} ∈ g0

(b)

g0 y → [x, y] ∈ g1 .

(c)

is adjoint to the map In other words, {x, z}, y = z, [x, y]

(y ∈ g0 , x, z ∈ g1 ).

(d)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

455

Theorem 2.5. Let (G, g) be an irreducible ordinary classical Lie supergroup. Up to conjugation by G there is exactly one automorphism θ of g such that θ|g0 is a Cartan involution on g0 and θ|g1 is a positive compatible complex structure on g1 . The automorphism θ may be realized as follows. Let V = V0 ⊕ V1 be the deﬁning module for (G, g). Then there is a positive deﬁnite hermitian form η on V such that V0 is orthogonal to V1 with respect to η, and if End(V ) x → x† ∈ End(V ) is the adjoint with respect to η, (η(xu, v) = η(u, x† v)), then −x† if x ∈ g0 , θ(x) = Sx† if x ∈ g1 . Moreover, if the Lie supergroup (G, g) is of type I and (D, ι) = (C, 1), then there is an element T ∈ G, unique up to conjugation, such that η( , ) = τ (T , ). Proof. The existence of θ is known (see, for example [3, 8.1, 10.2]). We shall verify the uniqueness. Since θ is an automorphism of g, the restriction θ|g1 is invertible in End(g1 ) and [θy, x] = θ[y, θ−1 x]

(y ∈ g0 , x ∈ g1 ).

In other words, ad(θy)|g1 = (θ|g1 )(ad(y)|g1 )(θ|g1 )−1

(y ∈ g0 ).

Suppose θ1 is another automorphism of g such that θ1 |g0 is a Cartan involution on g0 and θ1 |g1 is a positive compatible complex structure on g1 . Since the Cartan involution on g0 is unique up to conjugation by a element of G, [12, 2.3.2], we may assume that θ1 |g0 = θ|g0 . Then for y ∈ g0 , (θ|g1 )(ad(y)|g1 )(θ|g1 )−1 = ad(θy)|g1 = ad(θ1 y)|g1 = (θ1 |g1 )(ad(y)|g1 )(θ1 |g1 )−1 . Hence, (θ|g1 )−1 (θ1 |g1 ) ∈ Sp(g1 , , )ad(g0 ) . (Here, X Y is the centralizer of Y in X.) But we know from the structure of dual pairs, [7], that this last set is contained in (the centralizer of the identity component of Ad(G) in) Ad(G). For i = 1, 2, 3, ..., n, let (G(i) , g(i) ) be irreducible ordinary classical Lie supergroups, not necessarily of the same type, with the deﬁning modules V (i) . The group G = G(1) × G(2) × G(3) × ... × G(n) and the Lie superalgebra g = g(1) ⊕ g(2) ⊕ g(3) ⊕ ... ⊕ g(n)

456

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

act on the vector space V = V (1) ⊕ V (2) ⊕ V (3) ⊕ ... ⊕ V (n) componentwise. The resulting pair (G, g) shall be called the direct product of the ordinary classical Lie supergroups (G(i) , g(i) ), with the deﬁning module V . An ordinary classical Lie supergroup (G, g) is a ﬁnite direct product of irreducible ordinary classical Lie supergroups, as deﬁned above. Notice that the group G corresponds to the unique reductive dual pair obtained via the action of G on the symplectic space g1 . This correspondence is bijective.

3

The tangent space to the G-orbit through a point x ∈ g1

Let (G, g) be an irreducible ordinary classical Lie supergroup and let x ∈ g. Since the derivative of the adjoint action of the group G on g coincides with the adjoint action of the Lie algebra g0 on g, we may identify the tangent space to Gx at x with [g0 , x] = {[y, x]; y ∈ g0 }.

(3.1)

g1 = {z ∈ g1 ; {x, z} = 0}.

(3.2)

For x ∈ g1 let x

This is the anticommutant of x in g1 . For any subset W ⊆ g1 let W ⊥ = {y ∈ g1 ; y, z = 0 for all z ∈ W }.

(3.3)

Since our symplectic form , is non-degenerate, we have W ⊥⊥ = W,

(3.4)

if W is a vector subspace of g1 . Lemma 3.1. For any x ∈ g1 we have [g0 , x] = (x g1 )⊥ . Proof. Let z ∈ g1 . Then by (2.4.d), z ∈ [g0 , x]⊥ if and only if {x, z} = 0. Thus [g0 , x]⊥ = x g1 , and our claim follows from (3.4). Let θ be as in the Theorem 2.5. Then the formula (x, y) = −x, θy

(x, y ∈ g)

(3.5)

deﬁnes a symmetric positive deﬁnite form on g. In particular, for any x ∈ g1 , the orthogonal complement to [g0 , x] in g1 , with respect to the form (3.5), is equal to θ(x g1 ). The form (3.5) restricts to any subspace of g, and induces a positive deﬁnite form on the quotient of g by any subspace. Let U, V be two vector spaces, over the reals, of the same dimension. Suppose U, V are subspaces or quotients of g (one could be a subspace and the other one the quotient).

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

457

Let L : U → V be a linear map. We deﬁne the absolute value of the determinant of L, |det(L)|, to be the absolute value of the determinant of the matrix of L with respect to any orthonormal basis of U, V, with respect to the form induced by (3.5). Fix x ∈ g1 . The derivative of the map G/Gx gGx → gx ∈ g1

(3.6)

at x may be identiﬁed with the following linear map g0 /gx0 y + gx0 → [y, x] ∈ [g0 , x].

(3.7)

Denote by J(x)

(x ∈ g1 )

the absolute value of the determinant of the map (3.7). We shall give a formula for the function J(x), for x ∈ g1 semisimple, in Corollary 6.9.

4

A G-equivariant localization in g1

In this section we state several theorems which shall be veriﬁed later. As in the previous section, let (G, g) be an irreducible ordinary classical Lie supergroup and let x ∈ g1 . The element x ∈ g1 is called semi-simple (or nilpotent) if and only if x is semi-simple (or nilpotent) as an endomorphism of V . Theorem 4.1. Let x ∈ g1 and let x = xs + xn be the Jordan decomposition of x, as an element of End(V ). (Here xs stands for the semisimple part of x, and xn for the nilpotent part of x.) Then xs and xn belong to g1 . Furthermore, an element y ∈ g1 anti-commutes with x if and only if y anti-commutes with xs and xn . Theorem 4.2. For any x ∈ g1 the semisimple part xs belongs to Cl(Gx), the closure of the orbit Gx. Theorem 4.3. For any x ∈ g1 , x is semisimple if and only if the orbit Gx is closed. 2

Notice that if x ∈ g1 , then x2 = 12 {x, x} ∈ g0 . Let Gx ⊆ G denote the centralizer of 2 2 2 x2 , and let gx , gx0 , gx1 denote the centralizer of x2 in g, g0 , g1 respectively. Theorem 4.4. Let x ∈ g1 be semisimple. 2 2 (a) Suppose ker(x) = 0. Then (Gx , gx ) is the direct product of the irreducible ordinary classical Lie supergroups, with the corresponding dual pairs isomorphic either to (Un , Un ) or to (GLn (D), GLn (D)), where the division algebra D may be diﬀerent than the division algebra over which the deﬁning module V for the supergroup (G, g) was deﬁned. 2 The restriction of the symplectic form , to gx1 is non-degenerate and 2

gx1 = x g1 ⊕ gx1

458

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

is a complete polarization. (Here gx1 = {y ∈ g1 ; xy = yx}.) (b) Let V 0 = ker(x) and let V + = xV . Then V =V0⊕V+ is a direct sum (orthogonal in the type I case) decomposition into graded non-zero subspaces preserved by x. Moreover, 2

Gx = G(V 0 ) × G(V + )x

2

and 2

2

g1 x = g1 (V 0 ) ⊕ g1 (V + )x , where the sum is orthogonal, with respect to the symplectic form , , and g1 (V 0 ) = 0 unless V 0 ∩ V0 = 0 and V 0 ∩ V1 = 0. Moreover, the double anticommutant of x in g1 coincides with the double commutant of x in g1 (V + ): (x g1 )

g1 = g1 (V + )(g1 (V

+ )x )

. x

(c) The maximal possible dimension of the real vector space ( g1 ) g1 is equal to the minimum of the rank of G(V0 ) and the rank of G(V1 ), viewed as real reductive Lie groups. For the x x such that the dimension of ( g1 ) g1 is maximal, we have V 0 ⊆ V0 or V 0 ⊆ V1 , so that g1 (V 0 ) = 0, and (gx ) (x g1 ) g1 = g1 1 . x

An explicit description of the double anticommutant ( g1 ) g1 will be given in the proof of the theorem (see (13.13.1), (13.22.1), (13.31.1), (13.42.1), (13.47.1), (13.47.2), and (13.53.1)). Theorem 4.5. Let x ∈ g1 be semisimple. Then gx1 has a basis for the Gx -invariant neighborhoods of x consisting of admissible slices Ux through x, such that for i = 0, 1, the map 2 Ux y → y 2 |Vi ∈ g0 (Vi )x is an (injective) immersion, (see [10] Vol 1, for the deﬁnition of an immersion.) Theorem 4.6. [3] The set of nilpotent G-orbits in g1 is ﬁnite. Theorem 4.7. Let x ∈ g1 be nilpotent and let W ⊆ g1 be a subspace such that g1 = [g0 , x] ⊕ W. Then the aﬃne space x + W ⊆ g1 has a basis for the neighborhoods of x consisting of weakly admissible slices through x. The following theorem has a substantial overlap with the Proposition 8.2 in [5]

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

459

Theorem 4.8. The map g1 ⊇ Gx → Gx2 ⊆ g0 is injective on the set of semisimple orbits. (Here, in order to simplify the notation we write Gx rather than Ad(G)x, and similarly for Gx2 .)

5

The G-orbits in g1

We retain the notation of the previous section. An element x ∈ g1 , or the pair (x, V ), is called decomposable if and only if there are two non-zero Z/2Z-graded subspaces V , V ⊆ V , (which are orthogonal if (G, g) is of type I), preserved by x and such that V = V ⊕ V . In this case we say that (x, V ) is the direct sum of the elements (x|V , V ) and (x|V , V ). The element (x, V ) is called indecomposable if and only if (x, V ) is not decomposable. Theorem 5.1. For any x ∈ g1 , (x, V ) is the direct sum of indecomposable elements. Proof (a reduction to the case when x is semisimple). Let x = xs + xn be the Jordan decomposition of x. Then, as we know from Theorem 4.1, xn ∈ g1 . Suppose xn = 0. There is a decomposition V = V 1 ⊕ V 2 ⊕ V 3 ⊕ ... into indecomposables with respect to xn , [3, sections 5 and 6]. Since x commutes with xn , each V j is x-invariant, and indecomposable with respect to x (because xn is a polynomial of x). Thus we may assume that x is semisimple. We shall consider this case and complete the argument in sections 8 and 9. Let (G, g), (G , g ) be two irreducible ordinary classical Lie supergroups with the deﬁning spaces V , V respectively. We’ll say that two elements x ∈ g1 , x ∈ g1 are similar if and only if the supergroups (G, g), (G , g ) are of the same type and there is a Z/2Z-graded linear bijection φ : V → V (an isometry in the type I case) such that x = φ−1 x φ. In particular if V = V and (G, g) = (G , g ) then x is similar to x if and only if x and x are in the same G-orbit. In that case we shall write x ≈ x . Theorem 5.2. Let (G, g) be of type I. The following is a complete list of all non-zero semisimple indecomposable elements (x, V ), x ∈ g1 , up to similarity. In each case we indicate which elements of the list are similar, describe an element g ∈ G which provides the similarity, and list the eigenvalues of x.

460

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

V0 = Dv0 ⊕ Dv0 , V1 = Dv1 ⊕ Dv1 ;

τ (v0 , v0 ) = τ (v0 , v0 ) = τ (v1 , v1 ) = τ (v1 , v1 ) = 0, τ (v0 , v0 ) = τ (v1 , v1 ) = 1;

if ι = 1 then let ξ ∈ D \ 0, if ι = 1 then let ξ ∈ C ⊆ D and ξ 2 ∈ / iR; x = x(ξ) : v0 → ξv1 , v1 → ξv0 , v0 → −ι(ξ)v1 , v1 → ι(ξ)v0 ; if D = R then x(ξ) ≈ x(−ξ) has eigenvalues ξ, iξ, −ξ, −iξ; if D = C and ι = 1 then x(ξ) ≈ x(iξ) ≈ x(−ξ) ≈ x(−iξ) has eigenvalues ξ, iξ, −ξ, −iξ; if D = C and ι = 1 then x(ξ) ≈ x(−ξ) has eigenvalues ξ, iι(ξ), −ξ, −iι(ξ);

(a)

if D = H then x(ξ) ≈ x(−ξ) ≈ x(ι(ξ)) ≈ x(−ι(ξ)) has eigenvalues ξ, ι(ξ), −ξ, −ι(ξ), iξ, iι(ξ), −iξ, −iι(ξ); for all D, gx(ξ)g −1 = x(−ξ) if g : v0 → −v0 , v1 → v1 , v0 → −v0 , v1 → v1 ; for D = C and ι = 1, gx(ξ)g −1 = x(iξ) if

g : v0 → −iv0 , v1 → −v1 , v0 → iv0 , v1 → v1 ;

for D = H, gx(ξ)g −1 = x(ι(ξ)) if

g : v0 → jv0 , v1 → jv1 , v0 → jv0 , v1 → jv1 ;

V0 = Dv0 , V1 = Dv1 , C ⊆ D, ι = 1; τ (v0 , v0 ) = = ±1, τ (v1 , v1 ) = δi = ±i; ξ 2 ∈ iR \ 0, sgn(im(ξ 2 )) = − δ; x = x(ξ) : v0 → ξv1 , v1 → ξv0 ;

(b)

x(ξ) ≈ x(−ξ) has eigenvalues ξ, −ξ; gx(ξ)g −1 = x(−ξ) if g : v0 → −v0 , v1 → v1 ;

V0 = Rv0 ⊕ Rv0 , V1 = Rv1 ⊕ Rv1 , D = R;

τ (v0 , v0 ) = τ (v0 , v0 ) = = ±1, τ (v1 , v1 ) = τ (v1 , v1 ) = 0, τ (v0 , v0 ) = 0, τ (v1 , v1 ) = 1; ξ ∈ R \ 0; x = x(ξ) : v0 → ξ(v1 − v1 ), v1 → ξ(v0 − v0 ),

v0 → ξ(v1 + v1 ), v1 → ξ(v0 + v0 );

x(ξ) ≈ x(−ξ) has eigenvalues ξ(1 − i), ξ(1 + i), −ξ(1 − i), −ξ(1 + i); gx(ξ)g −1 = x(−ξ) if g : v0 → −v0 , v1 → v1 , v0 → −v0 , v1 → v1 ;

(c)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

461

V0 = (Ru0 ⊕ Rv0 ) ⊕ (Ru0 ⊕ Rv0 ),

V1 = (Ru1 ⊕ Rv1 ) ⊕ (Ru1 ⊕ Rv1 ), D = R,

the spaces in parenthesis are isotropic, and τ (u0 , u0 ) = τ (v0 , v0 ) = τ (u1 , u1 ) = τ (v1 , v1 ) = 1;

ξ, η ∈ R, ξ 2 = η 2 , ξη = 0; x = x(ξ, η) :

u0 → ξu1 + ηv1 , u1 → ξu0 + ηv0 , v0 → −ηu1 + ξv1 , v1 → −ηu0 + ξv0 , u0 → −ξu1 + ηv1 , u1 → ξu0 − ηv0 , v0 → −ηu1 − ξv1 , v1 → ηu0 + ξv0 ;

(d)

x(ξ, η) ≈ x(−ξ, η) ≈ x(ξ, −η) ≈ x(−ξ, −η) ≈ x(η, ξ) ≈ x(−η, ξ) ≈ x(η, −ξ) ≈ x(−η, −ξ) has eigenvalues ξ + iη, ξ − iη, −ξ + iη, −ξ − iη, η + iξ, −η + iξ, η − iξ, −η − iξ; gx(ξ, η)g −1 = x(−ξ, η) if g : u0 → −u0 , v0 → v0 , u0 → −u0 , v0 → v0 , u1 → u1 , v1 → −v1 , u1 → u1 , v1 → −v1 ;

gx(ξ, η)g −1 = x(η, ξ) if g : u0 → u0 , v0 → v0 , u0 → u0 , v0 → v0 , u1 → v1 , v1 → −u1 , u1 → −v1 , v1 → u1 ;

Theorem 5.3. Let (G, g) be of type II. The following is a complete list of all non-zero semisimple indecomposable elements (x, V ), x ∈ g1 , up to similarity. In each case we indicate which elements of the list are similar, describe an element g ∈ G which provides the similarity, and list the eigenvalues of x. V0 = Dv0 , V1 = Dv1 ; ξ ∈ D \ 0; x = x(ξ) : v0 → ξv1 , v1 → ξv0 ; if D = H then x(ξ) ≈ x(−ξ) has eigenvalues ξ, −ξ; if D = H then x(ξ) ≈ x(−ξ) ≈ x(ι(ξ)) ≈ x(−ι(ξ))

(a)

has eigenvalues ξ, −ξ, ι(ξ), −ι(ξ); gx(ξ)g −1 = x(−ξ) if g : v0 → −v0 , v1 → v1 ; gx(ξ)g −1 = x(ι(ξ)) if D = H and g : v0 → jv0 , v1 → jv1 ; V0 = Rv0 , V1 = Rv1 ; ξ ∈ R \ 0; x = x(ξ) : v0 → ξv1 , v1 → −ξv0 ; x(ξ) ≈ x(−ξ) has eigenvalues ± iξ; gx(ξ)g −1 = x(−ξ) if g : v0 → −v0 , v1 → v1 ;

(a’)

462

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

V0 = Ru0 ⊕ Rv0 , V1 = Ru1 ⊕ Rv1 , D = R; ξ, η ∈ R \ 0; x = x(ξ, η) : u0 → ξu1 + ηv1 , u1 → ξu0 + ηv0 , v0 → −ηu1 + ξv1 , v1 → −ηu0 + ξv0 ; x(ξ, η) ≈ x(−ξ, η) ≈ x(ξ, −η) ≈ x(−ξ, −η)

(b)

has eigenvalues ξ + iη, ξ − iη, −ξ + iη, −ξ − iη; gx(ξ, η)g −1 = x(−ξ, η) if g : u0 → −u0 , v0 → v0 , u1 → u1 , v1 → −v1 ; gx(ξ, η)g −1 = x(ξ, −η) if g : u0 → u0 , v0 → −v0 , u1 → u1 , v1 → −v1 . Proof (of Theorem 4.1). Let x ∈ g1 and let x = xs +xn be the Jordan decomposition of x, as an element End(V ). Suppose (G, g) is of type II. Notice that Sxs S −1 is semisimple and Sxn S −1 is nilpotent, and that these elements commute. Moreover, Sxs S −1 + Sxn S −1 = SxS −1 = −x = −xs − xn . Thus the uniqueness of the Jordan decomposition in End(V ) implies that Sxs S −1 = −xs and Sxn S −1 = −xn . In other words, xs , xn ∈ g1 . Suppose (G, g) is of type I. Then, as shown above, xs , xn ∈ End(V )1 . Consider the map End(V )1 y → y ∈ End(V )1 , τ (yu, v) = τ (u, y v)

(u, v ∈ V ).

Then y is semisimple if and only if y is semisimple, and y is nilpotent if and only if y is nilpotent. In particular, xs is semisimple and xn is nilpotent. The Theorem 5.1 (for semisimple elements) and Theorem 5.2 imply that Sxs is semisimple. It is easy to check that Sxn is nilpotent and that Sxs Sxn = Sxn Sxs . Since x ∈ g1 , we have, by the deﬁnition (2.8), x = Sx = Sxs + Sxn . Thus the uniqueness of the Jordan decomposition in End(V ) implies xs = Sxs and xn = Sxn . Hence, xs , xn ∈ g1 . Let y ∈ g1 . If y anticommutes with xs and xn then clearly y anticommutes with x = xs + xn . Conversely, suppose yx = −xy. Then, with S as in (2.2), the following computation holds in End(V ): x(Sy) = −Sxy = (Sy)x. Thus x commutes with Sy. Therefore xs and xn commute with Sy. Hence, for z = xs or xn , zy = zS 2 y = −Sz(Sy) = −S(Sy)z = −yz. In other words, xs and xn anti-commute with y.

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Let

k(k−1)/2

δ(k) = (−1)

=

1 if k ∈ 4Z or 4Z + 1, −1 if k ∈ 4Z + 2 or 4Z + 3.

463

(5.1’)

Theorem 5.4. [3] Let (G, g) be of type I. The following is a complete list of all non-zero nilpotent indecomposable elements (x, V ), x ∈ g1 , up to similarity. m ∈ 4Z; m Dvk , veven ∈ V0 , vodd ∈ V1 ; V = k=0 k

vk = x v0 = 0, 0 ≤ k ≤ m, xvm = 0; τ (vk , vl ) = 0 if l = m − k, τ (vk , vm−k ) = δ(k)δ(

(a) m )sgn(τ0 ), 2

where sgn(τ0 ) = 1 if D = C and ι = 1; m ∈ 4Z, D = R, ι = 1; m+1 Dvk , veven ∈ V0 , vodd ∈ V1 ; V = k=1

vk+1 = xk v1 = 0, 0 ≤ k ≤ m, xvm+1 = 0;

(b)

τ (vk , vl ) = 0 if l = m + 2 − k, τ (vk , vm+2−k ) = δ(k)τ (v1 , vm+1 ), m τ (v1 , vm+1 ) = i sgn(−iτ1 )δ(1 + ) if D = C; 2 τ (v1 , vm+1 ) = j if D = H; m ∈ 4Z, D = H, ι = 1; m+1 V = (Dvk ⊕ Dvk ), veven , veven ∈ V0 , vodd , vodd ∈ V1 ; k=1 = xk v1 = 0, 0 ≤ k ≤ m, xvm+1 = 0, xvm+1 = 0; vk+1 = xk v1 = 0, vk+1

(c)

τ (vk , vl ) = τ (vk , vl ) = 0, 1 ≤ k, l ≤ m + 1,

τ (vk , vl ) = τ (vk , vl ) = 0, l = m + 2 − k,

τ (vk , vm+2−k ) = −τ (vk , vm+2−k ) = δ(k), 1 ≤ k ≤ m + 1;

m ∈ 2Z \ 4Z, D = R, ι = 1; m Dvk , veven ∈ V0 , vodd ∈ V1 ; V = k=0 k

vk = x v0 = 0, 0 ≤ k ≤ m, xvm = 0; τ (vk , vl ) = 0 if l = m − k, τ (vk , vm−k ) = δ(k)isgn(−iτ1 ), (here − iτ1 is hermitian);

(d)

464

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

m ∈ 2Z \ 4Z; m+1 V = Dvk , veven ∈ V0 , vodd ∈ V1 ; k=1

vk+1 = xk v1 = 0, 0 ≤ k ≤ m, xvm+1 = 0;

(e)

τ (vk , vl ) = 0 if l = m + 2 − k, τ (vk , vm+2−k ) = δ(k)τ (v1 , vm+1 ), m τ (v1 , vm+1 ) = δ(1 + )sgn(τ0 ); 2 m ∈ 2Z \ 4Z, D = H, ι = 1; m V = (Dvk ⊕ Dvk ), veven , veven ∈ V0 , vodd , vodd ∈ V1 ; k=0 k

= 0; vk = x v0 = 0, vk = xk v0 = 0, 0 ≤ k ≤ m, xvm = 0, xvm

τ (vk , vl ) = τ (vk , vl ) =

τ (vk , vl ) τ (vk , vl )

τ (vk , vm−k )

=

(f)

= 0, 0 ≤ k, l ≤ m, = 0, l = m − k,

−τ (vk , vm−k )

= δ(k), 0 ≤ k ≤ m;

m ∈ 2Z + 1; m V = (Dvk ⊕ Dvk+1 ), veven , veven ∈ V0 , vodd , vodd ∈ V1 ; k=0 k

vk = x v0 = 0, vk+1 = xk v1 = 0, 0 ≤ k ≤ m, xvm = 0, xvm+1 = 0;

(g)

, vl+1 ) = 0, 0 ≤ k, l ≤ m, τ (vk , vl ) = τ (vk+1

τ (vk , vl+1 ) = τ (vk+1 , vl ) = 0, l = m − k,

) = δ(k)(−1)k , τ (vk+1 , vm−k ) = δ(k)δ(m), 0 ≤ k ≤ m; τ (vk , vm+1−k

The following theorem is well known, and goes back to Jordan. See [3] for details. Theorem 5.5. Let (G, g) be of type II. The following is a complete list of all non-zero nilpotent indecomposable elements (x, V ), x ∈ g1 , up to similarity. V =

m

Dvk , veven ∈ V0 , vodd ∈ V1 ;

k=0 k

(a)

vk = x v0 = 0, 0 ≤ k ≤ m, xvm = 0; V =

m+1

Dvk , veven ∈ V0 , vodd ∈ V1 ;

k=1

(b)

vk+1 = x v1 = 0, 0 ≤ k ≤ m, xvm+1 = 0; k

For a nilpotent element x ∈ g1 the height of x, or the height of (x, V ), is the integer m ≥ 0 such that xm = 0 and xm+1 = 0. In particular x = 0 if and only if the height of x is 0. The pair (x, V ) is called uniform if ker(xm ) = im(x) (= x(V )). These notions are adopted from [1], and have been used in [3].

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

465

Let x ∈ g1 and let x = xs + xn be the Jordan decomposition of x. Let V = V (1) ⊕ V (2) ⊕ ...

(5.2)

be the decomposition of V into a direct (and orthogonal in the type I case) sum, such that each (xn , V (j) ) is uniform (see [3]). Then each V (j) is preserved by xs . As one can see from the proof of Theorems 5.12 and 6.1 in [3], there is a graded xs - invariant subspace F (j) such that V (j) = F (j) ⊕ xn F (j) ⊕ x2n F (j) ⊕ ..., where, in the type I case, xkn F (j) ⊥ xln F (j) , for k + l ≤ mj − 1, and mj is the height of (xn , V (j) ). Since the xn and xs commute, the action of xs on V (j) is determined by the action on the F (j) . This space is equipped with the form j τmj ,j (u, v) = τ (u, xm n v)

(u, v ∈ F (j) ),

(5.3)

in the type I case. As an endomorphism of F (j) , xs is of degree one (i.e. the restriction of xs to F (j) is in End(F (j) )1 ) and (u, v ∈ F (j) ),

τmj ,j (xs u, v) = τmj ,j (u, Sxs v)(−1)mj

(5.4)

If mj ∈ 4Z, then τmj ,j = τmj ,j |F (j) ⊕ τmj ,j |F (j) 0

(5.5)

1

with τmj ,j |F (j) hermitian and τmj ,j |F (j) skew-hermitian. 0 1 If mj ∈ 2Z\4Z, then (5.5) holds with τmj ,j |F (j) skew-hermitian and τmj ,j |F (j) hermitian. 0

1

Thus for mj even we know how to decompose (xs , F (j) ) into indecomposables. Recall the function δ, (5.1’). For mj ∈ 2Z + 1 τmj ,j (u, v) = −δ(mj )ι(τmj ,j (v, u))

(u, v ∈ F (j) ),

τmj ,j |F (j) = 0, and τmj ,j |F (j) = 0. 0

(5.6)

1

Let us write m = mj , τm = τmj ,j and F = F (j) in the last case. We may rewrite (5.6) and (5.4) as τm (u, v) = −δ(m)ι(τm (v, u)), (5.7) F0 , F1 are isotropic subspaces of F, τm (xs u, v) = −τm (u, Sxs v)

(u, v ∈ F ).

As an easy consequence of Theorem 5.5 we deduce the following fact. Theorem 5.6. Suppose (xs , F ), described in (5.7), is indecomposable and non-zero. Then ι = 1 and, up to similarity, F = Dv0 ⊕ Dv1

(v0 ∈ F0 , v1 ∈ F1 ),

x s : v 0 → a 1 v 1 , v 1 → a0 v 0 , where a0 = δ(m)ι(a0 ) and a1 = −δ(m)ι(a1 ) if D = R.

466

6

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

A Cartan subspace, the Weyl group and an integration formula

Definition 6.1. An element x ∈ g1 is regular if and only if the G-orbit through x is of maximal possible dimension. A Cartan subspace h1 ⊆ g1 is the double anticommutant x h1 = ( g1 ) g1 of a regular semisimple element of x ∈ g1 . The Weyl group W (G, h1 ) is the quotient of the stabilizer of h1 in G by the subgroup which acts trivially on h1 . (We shall identify the Weyl group W (G, h1 ) with its image in GL(h1 ).) Proposition 6.2. The following is a complete list of the Cartan subspaces h1 ⊆ g1 and the Weyl groups W (G, h1 ), up to conjugation by an element of G, such that h1 contains a non-zero regular semisimple indecomposable element: Type I V0 = Dv0 ⊕ Dv0 , V1 = Dv1 ⊕ Dv1 ;

τ (v0 , v0 ) = τ (v0 , v0 ) = τ (v1 , v1 ) = τ (v1 , v1 ) = 0, τ (v0 , v0 ) = τ (v1 , v1 ) = 1; x(a) : v0 → av1 , v1 → av0 , v0 → −ι(a)v1 , v1 → ι(a)v0 , a ∈ D; if D = R then h1 = {x(a); a ∈ R}, |W (G, h1 )| = 2, the non-trivial element of W (G, h1 ) maps x(a) to x(−a); if D = C and ι = 1 then h1 = {x(a); a ∈ C}, |W (G, h1 )| = 4,

(a)

the non-trivial elements of W (G, h1 ) map x(a) to x(−a), x(ia), x(−ia); if D = C and ι = 1 then h1 = {x(a); a ∈ C}, |W (G, h1 )| = 2, the non-trivial element of W (G, h1 ) maps x(a) to x(−a); if D = H then h1 = {x(a); a ∈ C}, |W (G, h1 )| = 4, the non-trivial elements of W (G, h1 ) map x(a) to x(−a), x(ι(a)), x(−ι(a)); V0 = Dv0 , V1 = Dv1 , C ⊆ D, ι = 1; τ (v0 , v0 ) = = ±1, τ (v1 , v1 ) = δi = ±i; x(a) : v0 → av1 , v1 → av0 , a ∈ C;

(b)

h1 = {x(a); a = − δiι(a) ∈ C}, |W (G, h1 )| = 2; the non-trivial element of W (G, h1 ) maps x(a) to x(−a); V0 = Rv0 ⊕ Rv0 , V1 = Rv1 ⊕ Rv1 , D = R;

τ (v0 , v0 ) = τ (v0 , v0 ) = = ±1, τ (v1 , v1 ) = τ (v1 , v1 ) = 0, τ (v0 , v0 ) = 0, τ (v1 , v1 ) = 1;

x = x(a) : v0 → a(v1 − v1 ), v1 → a(v0 − v0 ),

v0 → a(v1 + v1 ), v1 → a(v0 + v0 ), a ∈ R; h1 = {x(a); a ∈ R}, |W (G, h1 )| = 2;

the non-trivial element of W (G, h1 ) maps x(a) to x(−a);

(c)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

467

V0 = (Ru0 ⊕ Rv0 ) ⊕ (Ru0 ⊕ Rv0 ),

V1 = (Ru1 ⊕ Rv1 ) ⊕ (Ru1 ⊕ Rv1 ), D = R,

the spaces in parenthesis are isotropic, and τ (u0 , u0 ) = τ (v0 , v0 ) = τ (u1 , u1 ) = τ (v1 , v1 ) = 1; x = x(a, b) : u0 → au1 + bv1 , u1 → au0 + bv0 , v0 → −bu1 + av1 , v1 → −bu0 + av0 ,

(d)

u0 → −au1 + bv1 , u1 → au0 − bv0 ,

v0 → −bu1 − av1 , v1 → bu0 + av0 , a, b ∈ R; h1 = {x(a, b); a, b ∈ R}, |W (G, h1 )| = 8; the non-trivial elements of W (G, h1 ) map x(a, b) to

x(−a, b), x(a, −b), x(−a, −b), x(b, a), x(−b, a), x(b, −a), x(−b, −a); Type II V0 = Dv0 , V1 = Dv1 ; x(a) : v0 → av1 , v1 → av0 , a ∈ D; if D = R or C, then h1 = {x(a); a ∈ D}, |W (G, h1 )| = 2; the non-trivial element of W (G, h1 ) maps x(a) to x(−a);

(e)

if D = H, then h1 = {x(a); a ∈ C}, |W (G, h1 )| = 4; the non-trivial elements of W (G, h1 ) map x(a) to x(−a), x(ι(a)), x(−ι(a));

V0 = Rv0 , V1 = Rv1 ; x(a) : v0 → av1 , v1 → −av0 ; h1 = {x(a); a ∈ R}, |W (G, h1 )| = 2;

(f)

the non-trivial element of W (G, h1 ) maps x(a) to x(−a);

V0 = Ru0 ⊕ Rv0 , V1 = Ru1 ⊕ Rv1 , D = R; x = x(a, b) : u0 → au1 + bv1 , u1 → au0 + bv0 , v0 → −bu1 + av1 , v1 → −bu0 + av0 ; h1 = {x(a, b); a, b ∈ R}, |W (G, h1 )| = 4;

(g)

the non-trivial elements of W (G, h1 ) map x(a, b) to x(−a, b), x(a, −b), x(−a, −b); Proof. This Proposition is a straightforward consequence of Theorems 5.2, 5.3 and the proof of Theorem 4.4, (see the section 13).

468

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

In general, a Cartan subspace h1 ⊆ g1 induces a direct sum decomposition V = V 0 ⊕V 1 ⊕ V 2 ⊕ ... ⊕ V i1 ⊕V i1 +1 ⊕ V i1 +2 ⊕ ... ⊕ V i2

(6.1)

... ⊕V ik−1 +1 ⊕ V ik−1 +2 ⊕ ... ⊕ V ik , orthogonal in the type I case, into graded subspaces preserved by h1 , such that (a) h1 (V 0 ) = 0, (b) for each 0 ≤ i ≤ ik there is x ∈ h1 such that (x, V i ) is indecomposable, (c) there is x ∈ h1 such that the elements (x, V j ), (x, V k )

(6.2)

are indecomposable and similar if and only if there is 1 ≤ l ≤ k − 1, with il < j ≤ il+1 and il < k ≤ il+1 . Then the Weyl group W (G, h1 ) = (Si1 (W (G(V 1 ), h1 (V 1 )) × W (G(V 2 ), h1 (V 2 )) × ... × W (G(V i1 ), h1 (V i1 )))) ×(Si2 −i1 (W (G(V i1 +1 ), h1 (V i1 +1 )) × W (G(V i1 +2 ), h1 (V i1 +2 )) × ... × W (G(V i2 ), h1 (V i2 ))))

(6.3)

... ×(Sik −ik−1 (W (G(V ik−1 +1 ), h1 (V ik−1 +1 )) × W (G(V ik−1 +2 ), h1 (V ik−1 +2 )) × ... × W (G(V ik ), h1 (V ik )))), with the action on h1 compatible with the decomposition (6.3). (Here Sm stands for the group of all permutations of m objects.) Proposition 6.2, together with (6.1), imply that there are ﬁnitely many conjugacy classes of Cartan subspaces in g1 and that any Cartan subspace consists of elements which commute in End(V ). Also, it is easy to see from Deﬁnition 6.1 that each semisimple element of g1 belongs to a Cartan subspace of g1 . Also, the set or regular elements coincides with the set where certain determinants don’t vanish, hence it is open and dense. We record these facts in the following proposition. Proposition 6.3. There are ﬁnitely many G-conjugacy classes of Cartan subspaces in g1 . Every semisimple element of g1 belongs to the G-orbit through an element of a Cartan subspace. The set of regular semisimple elements is dense in g1 . Any two elements of a Cartan subspace h1 ⊆ g1 commute as endomorphisms of V . The following lemma shall be veriﬁed at the end of section 13. Lemma 6.4. For any two commuting regular semisimple elements x, y ∈ g1 we have 2

2

2

2

(a) gx1 = gy1 ; (b) gx1 = gy1 ; (c) x g1 = y g1 ; (d) gx0 = gy0 ; (e) gx0 = gy0 .

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

469

Let h1 ⊆ g1 be a Cartan subspace and let hreg 1 ⊆ h1 be the subset of regular elements. 2 Denote by h1 = {h1 , h1 } the linear span of the elements {x, y}, where x, y ∈ h1 . Deﬁne y y h2 h1 y g1 = g1 , gh11 = g1 , gi 1 = gi , (i = 0, 1). (6.4) y∈h1

y∈h1

y∈h21

Lemma 6.5. For any x ∈ hreg 1 , (a)

h1

h2

g1 = x g1 , (b) gh11 = gx1 = h1 , (c) gi 1 = gxi

2

(i = 0, 1).

Proof. In the deﬁnition (6.4), it suﬃces to take the ﬁnite intersection over the y’s which form a basis of the corresponding linear space. For the equations (a), (b) we may choose a basis of h1 consisting of regular elements. Then the equalities follow from Lemma 6.4. Since the elements of h1 commute, the space h21 is spanned by the squares y 2 , y ∈ h1 . Thus we may choose a basis y12 , y22 ,..., of h21 such that each yi ∈ hreg 1 . Then the equality (c) also follows from Lemma 6.4. 0 + Proposition 6.6. Let x ∈ hreg = xV , as in Theorem 4.4(b). 1 . Set V = ker(x) and V Then

h1 = g1 (V + )h1 = g1 (V + )x ; h1

g1 = h21

h1

g1 (V ); h21

g1 = g1 (V + ) h2 g01

(a)

+

(b) x2

= g1 (V + ) ;

(c)

= g0 (V 0 ) ⊕ h0 ,

(d) + h21

+

where h0 = g0 (V )

is a Cartan subalgebra of g0 (V ),

and the sum is orthogonal; gh01

= g0 (V 0 ) ⊕ h21 ;

(e)

the restriction of the form , to h0 is non-degenerate and h0 =

Sh21

⊕

h21

(f)

is a complete polarization.

Proof. Parts (a), (b), (c) are immediate from Theorem 4.4 and Lemma 6.5. Similarly we have the orthogonal decomposition in (d). A straightforward computation based on Theorem 4.4(a) shows that 2

dim g0 (V + )x = 2 dim g1 (V + )x . By Theorem 4.4(a) and 4.4(c), dim g1 (V + )x = min{rank g0 (V0+ ), rank g0 (V1+ )}. Since g0 (V + ) = g0 (V0+ ) ⊕ g0 (V1+ ), we see that the restriction of x2 to V + is a regular element of g0 (V + ). Hence, h0 is a Cartan subalgebra of g0 (V + ) and (d) follows.

470

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

For (e) we may assume that V 0 = 0 and that (x, V ) is indecomposable. Then the equality follows from Proposition 6.2 via a case by case analysis. Similarly we check (f). Lemma 6.7. For any x ∈ hreg the following map is a linear bijection 1 (Sgh01 )⊥ y → [y, x] ∈ (h1 g1 )⊥ .

(a)

The map (a) intertwines the adjoint action of h21 on both spaces. The following are direct sum decompositions into the trivial and the non-trivial h21 -components: (Sgh01 )⊥ = Sh21 ⊕ g0 (V 0 )⊥ ∩ h⊥ 0, h2

(h1 g1 )⊥ = h1 g1 ⊕ (g11 )⊥ .

(b)

(c)

The map (a) restricts to bijections Sh21 y → [y, x] ∈ h1 g1 ,

(d) h2

1 ⊥ g0 (V 0 )⊥ ∩ h⊥ 0 y → [y, x] ∈ (g1 ) .

(e)

Proof. We see from Proposition 6.6(e) that (Sgh01 )⊥ = g0 (V 0 )⊥ ∩ (Sh21 )⊥ . The h21 -trivial component of this space is g0 (V 0 )⊥ ∩ (Sh21 )⊥ ∩ h0 = Sh21 . This veriﬁes (b). We see from (b), and from Proposition 6.6(e), that g0 = (Sgh01 )⊥ ⊕ gh01 . Since h1 g1 = x g1 , Lemma 3.1 implies that the map (a) is well deﬁned and surjective. Let z ∈ h1 and let x, y be as in (a). Then [z 2 , [y, x]] = [[z 2 , y], x] + [y, [z 2 , x]] = [[z 2 , y], x], because [z 2 , x] = 0. Hence, the map (a) is h21 -intertwining, and the proof of (a) is complete. Part (c) is clear from Theorem 4.4. Parts (d) and (e) follow from the intertwining property of the map (a). The derivative of the map h1 x → x2 ∈ h21

(6.5)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

471

at x ∈ h1 , coincides with the following linear map h1 y → {y, x} ∈ h21 ,

(6.6)

which, by Proposition 2.4, is adjoint to the map Sh21 y → [x, y] ∈ Sh1 .

(6.7)

(Notice that the range of the map (6.7) is contained in Sh1 . Indeed, let x, z ∈ hreg 1 . Then 2 2 2 2 2 2 [Sz , x] ∈ [g0 , g1 ] ⊆ g1 = Sg1 and [Sz , x] = S(z x+xz ) = S2z x. Clearly z x commutes with x. Thus [Sz 2 , x] ∈ Sgx1 . Furthermore z 2 x|V 0 = 0. Therefore [Sz 2 , x] ∈ Sh1 .) Hence, |det(h1 y → {y, x} ∈ h21 )| = |det(Sh21 y → [y, x] ∈ Sh1 )|.

(6.8)

Deﬁne polynomials Dj (x), x ∈ g1 , by 2

det(tI − ad(x )|g1 ) =

R

(x ∈ g1 ),

tj Dj (x)

(6.9)

j=r

where R = dim(g1 ), DR = 1, and r ≥ 0 is the smallest integer such that Dr is not identically equal zero. Lemma 6.8. For x ∈ h1 we have |Dr (x)| = |det(ad(x2 )|

h2

(g1 1 )⊥

)| h2

1 ⊥ 2 = |det(g0 (V 0 )⊥ ∩ h⊥ 0 y → [y, x] ∈ (g1 ) )|

= |det(ad(x2 )|g0 (V 0 )⊥ ∩h⊥0 )|.

Proof. The ﬁrst equality is clear from (6.9). The map ad(x2 )|

h2 (g1 1 )⊥

h2

h2

: (g1 1 )⊥ → (g1 1 )⊥

coincides with (−1 times) the composition of the following two maps: h2

(g1 1 )⊥ y → {y, x} ∈ g0 (V 0 )⊥ ∩ h⊥ 0, and

h2

1 ⊥ g0 (V 0 )⊥ ∩ h⊥ 0 y → [y, x] ∈ (g1 ) ,

(6.10)

(6.11)

which, by Proposition 2.4, are adjoint to each other. Hence the second equality follows. Similarly the map 0 ⊥ ⊥ ad(x2 ) : g0 (V 0 )⊥ ∩ h⊥ 0 → g0 (V ) ∩ h0 is (−1 times) the composition of the maps (6.10) and (6.11), and the third equality follows.

472

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Recall the function J(x), equal to the absolute value of the determinant of the map (3.7): J(x) = |det(g0 /gx0 y + gx0 → [y, x] ∈ [g0 , x])|, (x ∈ g1 ). (6.12) Corollary 6.9. For x ∈ h1 we have J(x) = |det(h1 y → {y, x} ∈ h21 )| · |Dr (x)|1/2 . Proof. Let x ∈ hreg 1 . By Proposition 6.6(e), gx0 = g0 (V 0 ) ⊕ h21 . Therefore Proposition 6.6(e) and Proposition 6.6(f)) imply (Sgx0 )⊥ = (g0 (V 0 ) ⊕ Sh21 )⊥ = g0 (V 0 )⊥ ∩ (Sh21 )⊥ 2 = (g0 (V 0 )⊥ ∩ h⊥ 0 ) ⊕ Sh1 .

But we see from Proposition 6.6(d) and Proposition 6.6(f) that 2 2 g0 = g0 (V 0 ) ⊕ (g0 (V 0 )⊥ ∩ h⊥ 0 ) ⊕ Sh1 ⊕ h1

2 = (g0 (V 0 ) ⊕ h21 ) ⊕ (g0 (V 0 )⊥ ∩ h⊥ 0 ⊕ Sh1 ),

where the middle direct sum is orthogonal. Thus, x g0 = gx0 ⊕ (gx0 )⊥ = (Sh21 ⊕ g0 (V 0 )⊥ ∩ h⊥ 0 ) ⊕ g0 .

(6.13)

Furthermore, by (3.1), (4.4) and Proposition 6.6(b), h2

[g0 , x] = (x g1 )⊥ = x g1 ⊕ g1 (V 0 ) = x g1 (V + ) ⊕ g1 (V 0 ) = h1 g1 ⊕ (g11 )⊥ . Hence, by Lemma 6.7, J(x) is the absolute value of the determinant of the map Lemma 6.7(d) times the absolute value of the determinant of the map Lemma 6.7(e): h2

1 ⊥ J(x) = |det(Sh21 y → {y, x} ∈ Sh1 )| · |det(g0 (V 0 )⊥ ∩ h⊥ 0 y → [y, x] ∈ (g1 ) )|.

Hence our formula for J(x) follows from (6.8) and Lemma 6.8.

Example 6.10. The dual pair O2n (C), Sp2n (C) For i = 0, 1 choose a basis vi1 , vi2 , ..., vin , vi1 , vi2 , ..., vin

of the vector space Vi such that τ (vik , vik )=1

(k = 1, 2, 3, ..., n)

and all the other pairings are zero. Let h1 be the Cartan subspace consisting of elements x(a), a ∈ Cn , such that x(a) :v0k → ak v1k , v1k → ak v0k , → −ak v1k , v1k → ak v0k , v0k

(k = 1, 2, 3, ..., n).

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

473

Then det(h1 y → {y, x} ∈

h21 )

=

(2ak ),

k=1

Dr (x(a)) =

n

2 n √ (a2j − a2k )(a2j + a2k ) · ( 2ak ) , and r = 4n. j=k

k=1

Example 6.11. The dual pair GLn (C), GLn (C). For i = 0, 1 choose a basis vi1 , vi2 , ..., vin of the vector space Vi . Let h1 be the Cartan subspace consisting of elements x(a), a ∈ Cn , such that x(a) : v0k → ak v1k , v1k → ak v0k ,

(k = 1, 2, 3, ..., n).

Then det(h1 y → {y, x} ∈ Dr (x(a)) =

h21 )

=

n

(2ak ),

k=1

2 (a2j − a2k ) , and r = 2n. j=k

reg Corollary 6.12. For a Cartan subspace h1 ⊆ g1 let h+ be a (measurable) funda1 ⊆ h1 mental domain for the action of the Weyl group W (G, h1 ). Let Q : g1 x → x2 ∈ g0 . Let greg,ss ⊂ g1 denote the subset of regular semisimple elements. Then for f ∈ Cc (greg,ss ) 1 1

1 f (x) dx = |W (G, h1 )| g1 h1 = J(x) h1

=

h+ 1

Qh+ 1

Qh+ 1

hreg 1

J(x)

.

G/Gh 1

f (gx) dg dx

.

G/Gh 1 −1

f (gx) dg dx 1/2

|Dr (Q (x))|

G/Gh 1

.

f (gQ−1 (x)) dg dx,

where the summation is over a maximal family of mutually non-conjugate Cartan subspaces h1 ⊆ g1 . Proof. The ﬁrst equality follows from the fact that the absolute value of Jacobian of the map reg,ss h1 G/Gh1 × hreg 1 (gG , x) → gx ∈ g1

at (gGh1 , x) is equal to J(x). The second equality is obvious and the third one follows from Corollary 6.21.

474

7

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

A canonical complementary subspace to the tangent space of an orbit

Here we adopt the view point of Harish-Chandra, [4, section 14], that the complementary subspace in Theorem 4.7 should be the orthogonal with respect to a natural positive deﬁnite form, (see (7.13) below). Recall the symmetric positive deﬁnite form ( , ) = − , θ on g deﬁned in (3.5), and the adjoint map: End(g) A → A† ∈ End(g), (A(x), y) = (x, A† (y)), x, y ∈ g.

(7.1)

Lemma 7.1. For the adjoint representation ad : g → End(g), we have −ad(θ(x)) if x ∈ g0 , ad(x)† = ad(θ(x)) if x ∈ g1 . Proof. Let x, y, z ∈ g0 . Then (ad(x)y, z) = −[x, y], θz = −y, −[x, θz] = −y, −θ[θx, z] = (y, −ad(θx)z). Let x ∈ g0 , and let y, z ∈ g1 . Then the above computation applies without any change. Hence the formula follows for x ∈ g0 (as is well known [4, Lemma 27]). If x ∈ g1 , and either y, z ∈ g0 or y, z ∈ g1 , then all the pairings in question are zero. Let x ∈ g1 , y ∈ g1 and z ∈ g0 . Then, by Proposition 2.4, (ad(x)y, z) = −{x, y}, θz = −y, [x, θz] = −y, θ[θx, z] = (y, ad(θx)z). Let x ∈ g1 , y ∈ g0 and z ∈ g1 . Then, by Proposition 2.4, (ad(x)y, z) = −[x, y], θz = θz, [x, y] = {x, θz}, y = −y, {x, θz} = −y, θ{θx, z} = (y, ad(θx)z).

This veriﬁes the second formula.

Let x ∈ g0 . Then the ( , )-orthogonal complement to [g0 , x] in g0 is equal to θ(gx0 ) = Thus (7.2) g0 = [g0 , x] ⊕ θ(gx0 ).

θ(x) g0 .

In particular, if h0 ⊆ g0 is a Cartan subalgebra, then g0 = [g0 , x] ⊕ θ(h0 )

(x ∈ hreg 0 ),

(7.3)

which provides a geometric interpretation for the notion of a θ-stable Cartan subalgebra. If x ∈ g0 is nilpotent and such that {x, y = −θ(x), h = [x, y]} is a Cayley triple, [2], then (7.2) may be rewritten as g0 = [g0 , x] ⊕ gy0 . (7.4)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

475

Moreover, if l = Rx ⊕ Ry ⊕ Rh, then by Lemma 7.1, ad(l) ⊆ End(g0 ) is a self adjoint family of operators. Hence g0 decomposes into a direct, ( , )- orthogonal sum of irreducible components, which have managable structure because the Lie algebra l is isomorphic to sl(2, R). Furthermore, g0 = [g0 , x] ⊕ gx0 (7.5) is another ( , )-orthogonal decomposition, and the map [g0 , x] × gy0 (v, w) → [v, x] + w ∈ g0

(7.6)

is a linear bijection. Consider an element x ∈ g1 . Then θ(x g1 ) = θ(x) g1 is the ( , )-orthogonal complement of [g0 , x] in g1 . Thus g1 = [g0 , x] ⊕ θ(x g1 ). (7.7) In particular, if h1 ⊆ g1 is a Cartan subspace, then g1 = [g0 , x] ⊕ θ(h1 g1 )

(x ∈ hreg 1 ).

(7.8)

Moreover, if ( )⊥ denotes the orthogonal complement with respect to the form , , then θ(x) θ((gx0 )⊥ ) = (g0 )⊥ is the ( , )-orthogonal complement of gx0 , so that g0 = θ((gx0 )⊥ ) ⊕ gx0 ,

(7.9)

and the following map is a linear bijection θ((gx0 )⊥ ) × θ(x g1 ) (v, w) → [v, x] + w ∈ g1 .

(7.10)

As shown by Harish-Chandra, [4, sections 13 and 14], the Jacobson-Morozov theorem and a theorem of Mostow imply that for a nilpotent orbit O ⊆ g there is an element x ∈ O such that the Lie algebra generated by x and θ(x) is isomorphic to sl(2, R). This may be deduced directly from the classiﬁcation Theorems 5.4 and 5.5, and motivates the following problem. Problem 7.2. Let O ⊆ g∞ be a non-zero nilpotent orbit. For an element x ∈ O let s(x) ⊆ g be the Lie sub- superalgebra generated by x and θ(x). Let nO = min{dim s(x); x ∈ O}. Describe all the s(x) with dim s(x) = nO . Remark 7.3. With the notation of Problem 7.2, suppose (x, V ) is indecomposable. Then one can show, using the classiﬁcation Theorems 4.4 and 4.5, that for x = 0, the height of x is even if and only if there is (a possibly diﬀerent) x ∈ O such that [{x, θ(x)}, x] = x. Consequently s(x) is isomorphic to (o1 , sp2 (R)) as a dual pair, and (x2 , −θ(x)2 , [x2 , −θ(x)2 ]) is a Cayley triple.

476

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Notice that the adjoint representation maps g into the ortho- symplectic Lie subsuperalgebra osp(g) ⊆ End(g)0 ⊕ End(g)1 , deﬁned as in (2.8) with the τ replaced by , . Let x ∈ g1 be nilpotent. Then ad(x) ∈ osp(g)1 is nilpotent. Hence the classiﬁcation Theorems 4.4 and 4.5, applied to osp(g) provide a decomposition of g into a direct orthogonal sum of ad(x)-indecomposables. Moreover θ((gx0 )⊥ ) = θ((g0 ∩ ker(ad(x))⊥ ) = θ((g0 ∩ ad(x)(g)) = ad(θ(x))(g1 ) = {θ(x), g1 }.

(7.11)

Therefore (7.11) may be rewritten as {θ(x), g1 } × θ(x g1 ) (v, w) → [v, x] + w ∈ g1 .

(7.12)

The space W of Theorem 4.7 may be taken to be θ(x g1 ) and the local coordinates around x are provided by the map {θ(x), g1 } × θ(x g1 ) (v, w) → [v, x + u] + w ∈ g1

8

Let

(u ∈ θ(x g1 )).

(7.13)

A proof of Theorem 5.1 for x semisimple and (G, g) of type II, and a proof of Theorem 5.3 ⎧ C ⎪ ⎨ V , the complexiﬁcation of V, if D = R, U = V, if D = C, ⎪ ⎩ V, viewed as a vector space over C, if D = H.

Since x is semisimple we have a direct sum decomposition into eigenspaces: U= U λ , xu = λu, u ∈ U λ .

(8.1)

λ

Let L be a set of eigenvalues of x such that L ∩ (−L) = ∅ and L ∪ (−L) is the set of all non-zero eigenvalues of x. Since SU λ = U −λ , (see (2.2) for S), U = U0 ⊕ (U λ ⊕ U −λ ) (8.2) λ∈L

is a direct sum decomposition into Z/2Z-graded subspaces preserved by x. The space U 0 is either zero or decomposes into a direct sum of one dimensional graded subspaces U= U 0,k . (8.3) k

For each λ ∈ L let Uλ =

l

U λ,l

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

be a direct sum decomposition into one-dimensional subspaces. Then U λ ⊕ SU λ = (U λ,l ⊕ SU λ,l )

477

(8.4)

l

is a direct sum decomposition into graded x-invariant subspaces, which does not admit any ﬁner decomposition of this type. Thus each term U λ,l ⊕ SU λ,l is indecomposable under the action of x and S. This veriﬁes Theorem 5.1 for (G, g) of type II and D = C. Fix λ and l as in (8.4), and let uλ ∈ U λ,l be a non-zero vector. Set v0 = uλ + Suλ and v1 = uλ − Suλ . Then Sv0 = v0 , Sv1 = −v1 , xv0 = λv1 , xv1 = λv0 .

(8.5)

Thus Theorem 5.3, for D = C, follows. Let D = R. Let U u → u ∈ U be the complex conjugation with respect to the real form V ⊆ U . Then for each eigenvalue λ of x we have U λ = U λ.

(8.6)

We may split the set of eigenvalues of x into a disjoint union {0} ∪ LR ∪ (−LR ) ∪ LC ∪ (−LC ), where the elements λ ∈ LR are such that λ2 ∈ R, and the elements λ ∈ LC are such that λ2 ∈ C \ R, so that the four complex numbers λ, −λ, λ, −λ are distinct. Thus U = U0 ⊕ (U λ ⊕ U −λ ) ⊕ (U λ ⊕ U −λ ⊕ U λ ⊕ U −λ ). (8.7) λ∈LR

λ∈LC

Each summand in (8.7) is invariant under x, S, and the complex conjugation. The terms U 0 and U λ ⊕ U −λ , with λ ∈ R \ 0, may be treated as in the case D = C. The indecomposable summands of U λ ⊕ U −λ are described in part (a) of Theorem 5.3. Suppose λ = iξ ∈ iR \ 0. The, with the notation (8.5), x : v0 + v 0 → ξi(v1 − v 1 ) → −ξ(v0 + v 0 ). Hence, in this case, the indecomposable summands of U λ ⊕ U −λ are described in part (a’) of Theorem 5.3. Consider λ ∈ LC . Let Uλ = U λ,l l

be a direct sum decomposition into one dimensional subspaces. Then U λ ⊕ U −λ ⊕ U λ ⊕ U −λ = (U λ,l ⊕ U −λ,l ⊕ U λ,l ⊕ U −λ,l )

(8.8)

l

is a direct sum decomposition into (x, S, u → u invariant) subspaces of minimal possible dimension (equal 4). This veriﬁes Theorem 5.1 for for (G, g) of type II and D = R.

478

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Fix λ ∈ LC and l as in (8.8). Let uλ ∈ U λ,l be a non-zero vector. Set uλ = uλ , u−λ = Suλ , and u−λ = u−λ . Then xuλ = λuλ , xu−λ = −λu−λ , xuλ = λuλ , xu−λ = −λu−λ .

(8.9)

Let λ = ξ + iη, with ξ, η ∈ R. Then ξη = 0. We see from (8.9) that x :uλ + u−λ + uλ + u−λ → λ(uλ − u−λ ) + λ(uλ − u−λ ), uλ − u−λ + uλ − u−λ → λ(uλ + u−λ ) + λ(uλ + u−λ ), uλ + u−λ − uλ − u−λ → λ(uλ − u−λ ) − λ(uλ − u−λ ),

(8.10)

uλ − u−λ − uλ + u−λ → λ(uλ + u−λ ) − λ(uλ + u−λ ). Set

u0 = uλ + u−λ + uλ + u−λ , v0 = i(uλ + u−λ − uλ − u−λ ), u1 = uλ − u−λ + uλ − u−λ , v1 = i(uλ − u−λ − uλ + u−λ ).

We see from (8.10) that x :u0 → ξu1 + ηv1 , u1 → ξu0 + ηv0 , v0 → −ηu1 + ξv1 , v1 → −ηu0 + ξv0 .

(8.11)

The formulas (8.11) are consistent with the formulas of part (b) of Theorem 5.3. This veriﬁes Theorem 5.3 for D = R. Let D = H. Since jU λ = U λ , each summand in the decomposition (8.7) is a vector space over H. Let λ ∈ LR ∪ LC . Pick a non-zero vector uλ ∈ U λ . Let v0 = uλ + Suλ , v1 = uλ − Suλ . Then xv0 = λv1 , xv1 = λv0 ,

(8.12)

Hv0 + Hv1 ⊆ U

(8.13)

and is a graded x-invariant subspace over H. The non-zero part of the right hand side of (8.7) may be grouped into a direct sum of spaces of the form (8.13). This veriﬁes Theorems 5.1 and 5.3 for D = H.

9

A proof of Theorem 5.1 for x semisimple and (G, g) of type I, and a proof of Theorem 5.2

We consider four cases: (D = C, ι = 1), (D = C, ι = 1), (D = H, ι = 1) and (D = R, ι = 1). Case (D = C, ι = 1). Let V = Vλ (9.1) λ

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

479

be the decomposition of V into the eigenspaces for x (xv = λv for v ∈ V λ ). For two eigenvalues λ, μ and the corresponding eigenvectors v λ , v μ , we have μ2 τ (v μ , v λ ) = τ (x2 v μ , v λ ) = τ (v μ , −x2 v λ ) = τ (v μ , v λ )(−λ2 ). Thus V λ ⊥ V μ if μ2 + λ2 = 0.

(9.2)

Let us decompose the set of non-zero eigenvalues of x into a disjoint union L ∪ (−L) ∪ iL ∪ (−iL). Then, by (9.2), V =V0⊕

(V λ ⊕ V −λ ⊕ V iλ ⊕ V −iλ )

(9.3)

λ∈L

is a direct sum orthogonal decomposition into graded subspaces preserved by x. For λ ∈ L, pick a non-zero vector v λ ∈ V λ , and a non-zero vector v iλ ∈ V iλ . Let v0 = v λ + Sv λ , v1 = v λ − Sv λ , v0 = i(v iλ + Sv iλ ), v1 = v iλ − Sv iλ . Then x : v0 → λv1 , v1 → λv0 , v0 → −λv1 , v1 → λv0 .

(9.4)

Hence, λτ (v1 , v1 ) = τ (xv0 , v1 ) = τ (v0 , Sxv1 ) = τ (v0 , Sλv0 ) = τ (v0 , v0 )λ. Thus τ (v1 , v1 ) = τ (v0 , v0 ).

(9.5)

We may multiply v0 and v1 by the same complex number to ensure τ (v1 , v1 ) = τ (v0 , v0 ) = 1.

(9.6)

Notice that τ (v0 , v0 )λ = τ (v0 , xv1 ) = τ (−Sxv0 , v1 ) = τ (−Sλv1 , v1 ) = λτ (v1 , v1 ) and λτ (v0 , v0 ) = τ (xv1 , v0 ) = τ (v1 , Sxv0 ) = τ (v1 , Sλv1 ) = τ (v1 , v1 )(−λ). Hence τ (v0 , v0 ) = τ (v1 , v1 ) = 0. Similarly τ (v0 , v0 )λ = τ (v0 , xv1 ) = τ (−Sxv0 , v1 ) = τ (Sλv1 , v1 ) = −λτ (v1 , v1 ) and λτ (v0 , v0 ) = τ (xv1 , v0 ) = τ (v1 , Sxv0 ) = τ (v1 , −Sλv1 ) = τ (v1 , v1 )λ.

(9.7)

480

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Hence τ (v0 , v0 ) = τ (v1 , v1 ) = 0.

(9.8)

Cv0 ⊕ Cv0 ⊕ Cv1 ⊕ Cv1 ⊆ V λ ⊕ V −λ ⊕ V iλ ⊕ V −iλ

(9.9)

The subspace is graded and x-invariant and the action of x on this subspace is consistent with the formulas of part (a) of Theorem 5.2. Moreover it is clear that we may decompose the right hand side of (9.9) into direct sum of such subspaces. This veriﬁes Theorems 5.1 and 5.2. Case (C, ι = 1). Here, instead of (9.2) we have 2

V λ ⊥ V μ if μ2 + λ = 0,

(9.10)

where λ = ι(λ). We decompose the set of non-zero eigenvalues of x into a disjoint union LiR ∪ (−LiR ) ∪ LC ∪ (−LC ) ∪ iLC ∪ (−iLC ), where λ2 ∈ iR \ 0 for λ ∈ LiR , and λ2 ∈ C \ iR for λ ∈ LC . Then by (9.10), V =V0⊕ (V λ ⊕ V −λ ) ⊕ (V λ ⊕ V −λ ⊕ V iλ ⊕ V −iλ ) λ∈LiR

(9.11)

λ∈LC

is a direct sum orthogonal decomposition into graded x-invariant subspaces. For λ ∈ LiR , pick a non-zero vector v λ ∈ V λ . Let v0 = v λ + Sv λ , v1 = v λ − Sv λ . Then x : v0 → λv1 , v1 → λv0 .

(9.12)

Moreover, λτ (v0 , v0 ) = τ (xv1 , v0 ) = τ (v1 , Sxv0 ) = τ (v1 , Sλv1 ) = τ (v1 , v1 )(−λ). Thus

λ τ (v1 , v1 ) = − τ (v0 , v0 ), λ

where −

λ λ2 = − 2 = −i sgn(im(λ2 )). |λ| λ

(9.13)

(9.14)

Since τ0 is hermitian, τ (v0 , v0 ) ∈ R \ 0. Thus we may multiply v0 and v1 by the same positive real number so that (by (9.13) and (9.14)) τ (v0 , v0 ) = = ±1, τ (v1 , v1 ) = δi = ±i, with − δ = sgn(im(λ2 )).

(9.15)

Clearly Cv0 ⊕ Cv1 ⊆ V λ ⊕ V −λ

(9.16)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

481

is a graded, x-invariant subspace described in part (b) of Theorem 5.2. This subspace is indecomposable and the right hand side of (9.16) decomposes into an orthogonal direct sum of such subspaces. Let λ ∈ LC and let v λ ∈ V λ be a non-zero vector. Set v0 = v λ + Sv λ , v1 = v λ − Sv λ , v0 = i(v iλ + Sv iλ ), v1 = v iλ − Sv iλ . Then x : v0 → λv1 , v1 → λv0 , v0 → −λv1 , v1 → λv0 .

(9.17)

Furthermore λτ (v1 , v1 ) = τ (xv0 , v1 ) = τ (v0 , Sxv1 ) = τ (v0 , v0 )λ so that τ (v1 , v1 ) = τ (v0 , v0 ). As before we may scale the vectors v0 and v1 by the same number so that τ (v1 , v1 ) = τ (v0 , v0 ) = 1.

(9.18)

Moreover,

2

λ2 τ (v0 , v0 ) = τ (x2 v0 , v0 ) = τ (v0 , −x2 v0 ) = τ (v0 , v0 )(−λ ) and

2

λ2 τ (v1 , v1 ) = τ (x2 v1 , v1 ) = τ (v1 , −x2 v1 ) = τ (v1 , v1 )(−λ ), so that

2

2

(λ2 + λ )τ (v0 , v0 ) = (λ2 + λ )τ (v1 , v1 ) = 0, which implies τ (v0 , v0 ) = τ (v1 , v1 ) = 0.

(9.19)

τ (v0 , v0 ) = τ (v1 , v1 ) = 0.

(9.20)

Cv0 ⊕ Cv0 ⊕ Cv1 ⊕ Cv1 ⊆ V λ ⊕ V −λ ⊕ V iλ ⊕ V −iλ

(9.21)

Similarly, Clearly, is a graded, x-invariant subspace, as in part (a) of Theorem 5.2. This subspace is indecomposable and the right hand side of (9.21) decomposes into an orthogonal direct sum of such subspaces. Case (D = H, ι = 1). Here we view V as a vector space over C ⊆ H. Then the decomposition (9.1) holds and jV λ = V λ , (9.22) where λ = ι(λ). Furthermore, 2

V λ ⊥ V μ if μ2 + λ2 = 0 and μ2 + λ = 0.

(9.23)

482

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Indeed, let v μ ∈ V μ , v λ ∈ V λ and let τ (v μ , v λ ) = α + jβ, with α, β ∈ C. Then μ2 (α + jβ) = μ2 τ (v μ , v λ ) = τ (x2 v μ , v λ ) = τ (v μ , −x2 v λ ) = τ (v μ , −λ2 v λ ) 2

2

= τ (v μ , v λ )(−λ ) = (α + jβ)(−λ ). Hence,

2

(μ2 + λ )α = 0 and (μ2 + λ2 )jβ = 0, and (9.23) follows. We decompose the set of non-zero eigenvalues of x into a disjoint union LiR ∪ (−LiR ) ∪ LR ∪ (−LR ) ∪ LC ∪ (−LC ),

(9.24)

where λ2 ∈ iR \ 0 for λ ∈ LiR , λ2 ∈ R \ 0 for λ ∈ LR , and λ2 ∈ C \ (iR ∪ R) for λ ∈ LC . Then by (9.22) and (9.23), (V λ ⊕ V −λ ⊕ V λ ⊕ V −λ ) V =V0⊕ λ∈LiR

⊕

(V λ ⊕ V −λ ⊕ V iλ ⊕ V −iλ )

(9.25)

λ∈LR

⊕

(V λ ⊕ V −λ ⊕ V iλ ⊕ V −iλ ⊕ V λ ⊕ V −λ ⊕ V −iλ ⊕ V iλ )

λ∈LC

is a direct sum orthogonal decomposition into graded x-invariant subspaces over H. Let λ ∈ LiR . Then (9.12) holds, and since τ (v0 , v0 ) ∈ R, (9.13) holds too. Therefore (9.15) holds and, instead of (9.16), we see that Hv0 ⊕ Hv1 ⊆ V λ ⊕ V −λ ⊕ V λ ⊕ V −λ

(9.26)

is a graded x-invariant subspace, as in part (c) of Theorem 5.2. This subspace is indecomposable and the right hand side of (9.26) decomposes into an orthogonal direct sum of such subspaces. For λ ∈ LR ∪ LC the argument (9.17)-(9.21) carries over. This completes the proof of both Theorems 5.1 and 5.2 in the case D = H. Case (D = R, ι = 1). Let VC = V C,λ (9.27) λ C

be the decomposition of V , the complexiﬁcation of V , into the eigenspaces for x. Let V C v → v ∈ V C be the complex conjugation with respect to the real form V ⊆ V C . The form τ extends uniquely to a complex linear form on V C . As before we check that V C,μ ⊥ V C,λ if μ2 + λ2 = 0.

(9.28)

Let LiR , LR , LC be as in (9.24). Then V C = V C,0 ⊕ (V C,λ ⊕ V C,−λ ⊕ V C,λ ⊕ V C,−λ ) ⊕

λ∈LiR

(V C,λ ⊕ V C,−λ ⊕ V C,iλ ⊕ V C,−iλ )

λ∈LR

⊕

λ∈LC

(V C,λ ⊕ V C,−λ ⊕ V C,iλ ⊕ V C,−iλ ⊕ V C,λ ⊕ V C,−λ ⊕ V C,−iλ ⊕ V C,iλ )

(9.29)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

483

is an orthogonal direct sum decomposition into graded x-invariant subspaces invariant under the complex conjugation, V C u → u ∈ V C with respect to the real form V . Let λ ∈ LiR and let v λ ∈ V C,λ be a non-zero vector. Let u = v λ + Sv λ and v = v λ − Sv λ . Then x : u → λv, v → λu, u → λv, v → λu. Hence

(9.30)

λτ (u, u) = τ (xv, u) = τ (v, Sxu) = τ (v, −λv) = τ (v, v)(−λ), λτ (v, v) = τ (xu, v) = τ (u, Sxv) = τ (u, λu) = τ (u, u)λ.

Therefore τ (u, u) = τ (v, v) = τ (u, u) = τ (v, v) = 0.

(9.31)

Furthermore λτ (v, v) = τ (xu, v) = τ (u, Sxv) = τ (u, u)λ, so that τ (v, v) = τ (u, u)(−i)sgn(im(λ2 )).

(9.32)

Multiplying u and v by the same non-zero real number does not change (9.30), (9.31), (9.32). Thus we may assume τ (u, u) = /2, (9.33) where = ±1. Set v0 = u + u, v0 = i(u − u), v1 = v + v, v1 = i(v − v).

(9.34)

Then, by (9.31) and (9.33), τ (v0 , v0 ) = τ (u, u) + τ (u, u) = , τ (v0 , v0 ) = −τ (u, −u) − τ (−u, u) = ,

τ (v0 , v0 ) = τ (u, −iu) + τ (u, iu) = 0, τ (v1 , v1 ) = τ (v, v) + τ (v, v) = 0,

(9.35)

τ (v1 , v1 ) = τ (iv, −v) + τ (−v, iv) = 0,

τ (v1 , v1 ) = τ (v, −iv) + τ (v, iv) = −2iτ (v, v)

= −2τ (u, u)sgn(im(λ2 )) = −τ (v0 , v 0 )sgn(im(λ2 )).

We see from (9.30) and (9.34) that, with λ = ξ + iη, ξ, η ∈ R, x :v0 → ξv1 + ηv1 , v1 → ξv0 + ηv0 ,

v0 → −ηv1 + ξv1 , v1 → −ηv0 + ξv0 .

(9.36)

Notice that im(λ2 ) = 2ξη.

(9.37)

484

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Thus the last formula in (9.35) may be rewritten as τ (v1 , v1 ) = δ, −δ = sgn(ξη).

(9.38)

Thus λ = ξ(1 − iδ) and if we replace v1 by δv1 then (9.36) coincides with x :v0 → ξ(v1 − v1 ), v1 → ξ(v0 − δv0 ),

v0 → δξ(v1 + v1 ), v1 → ξ(v0 + δv0 ).

(9.39)

Let g : v0 → v0 , v0 → δv0 , v1 → v1 , v1 → v1 . Then g ∈ G and gxg −1 acts according to the formula (c) of Theorem 5.2. The subspace Rv0 ⊕ Rv0 ⊕ Rv1 ⊕ Rv1 ⊆ V

(9.40)

is graded, x-invariant and indecomposable. Let λ ∈ LR . Then either λ ∈ R \ 0 or λ ∈ iR \ 0. Suppose λ ∈ R \ 0. Then the eigenspace V C,λ is closed under the complex conjugation. Hence we may chose a non-zero vector v λ ∈ V ∩ V C,λ . Let v iλ ∈ V ∩ V C,iλ be a non-zero vector such that v iλ = Sv iλ . Set v0 = v λ + Sv λ , v1 = v λ − Sv λ , v0 = v iλ + Sv iλ , v1 = −i(v iλ − Sv iλ ).

(9.41)

Then v0 = v0 , v1 = v1 , v0 = v0 , v1 = v1 ,

(9.42)

x : v0 → λv1 , v1 → λv0 , v0 → −λv1 , v1 → λv0 .

(9.43)

and Furthermore, λτ (v1 , v1 ) = τ (xv0 , v1 ) = τ (v0 , Sxv1 ) = τ (v0 , v0 )λ, so that τ (v1 , v1 ) = τ (v0 , v0 ).

(9.44)

Similarly we check that the vectors v0 , v0 , v1 , v1 are isotropic. Multiplying v0 and v1 by the same number we may assume that τ (v1 , v1 ) = τ (v0 , v0 ) = 1.

(9.45)

Rv0 ⊕ Rv0 ⊕ Rv1 ⊕ Rv1 ⊆ V

(9.46)

The subspace is graded, x-invariant and indecomposable. The formulas (9.43) are compatible with the formulas of part (a) of Theorem 5.2. Finally, let λ ∈ LC . Choose non-zero vectors v λ ∈ V C,λ , v iλ ∈ V C,iλ and let u = v λ + Sv λ , v = v λ − Sv λ , u = v iλ + Sv iλ , v = v iλ − Sv iλ .

(9.47)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Then

x :u → λv, v → λu, u → λv, v → λu, u → iλv , v → iλu , u → iλ v , v → iλ u .

We see from (9.28) and (9.24) that C,λ V + V C,−λ ⊥ V C,−iλ + V C,iλ , and C,λ C,λ C,−λ C,−λ V ⊥ V . +V +V

485

(9.48)

(9.49)

Thus by (9.29), the restriction of the form τ to V C,λ + V C,−λ + V C,iλ + V C,−iλ is non-degenerate. Hence we may choose the vectors v λ , v iλ so that τ (u, u ) = 0.

(9.50)

The following calculation λτ (v, v ) = τ (xu, v ) = τ (u, xv ) = τ (u, iλu ) = τ (u, u )iλ shows that τ (v, v ) = τ (u, u ). As before we may assume that 1 τ (v, v ) = τ (u, u ) = . (9.51) 2 Since τ is the complexiﬁcation of a real form the usual calculation using (9.24), (9.28) and (9.51) implies 1 τ (u, u ) = τ (u, u ) = , 2 τ (u, u) = τ (u, u) = τ (u, u ) = τ (u , u ) = 0, τ (u, u) = τ (u , u ) = 0, 1 τ (v, −iv ) = = , 2 τ (v, v) = τ (v, v) = τ (v, v ) = τ (v, v )

(9.52)

τ (v, iv )

= τ (v , v ) = τ (v , v ) = 0. Set

u0 = u + u, v0 = i(u − u), u0 = u + u , v0 = −i(u − u ),

u1 = v + v, v1 = i(v − v), u1 = −i(v − v ), v1 = −v − v .

(9.53)

Let λ = ξ + iη, ξ, η ∈ R, Then the formulas of part (d) of Theorem 5.2 hold. Furthermore the subspace Ru0 ⊕ Ru0 ⊕ Rv0 ⊕ Rv0 ⊕ Ru1 ⊕ Ru1 ⊕ Rv1 ⊕ Rv1 ⊆ V C,λ ⊕ V C,−λ ⊕ V C,iλ ⊕ V C,−iλ ⊕ V C,λ ⊕ V C,−λ ⊕ V C,−iλ ⊕ V C,iλ is graded, x-invariant and indecomposable, and the space on the right hand side of the inclusion decomposes into a direct sum of such subspaces. This completes our proof.

486

10

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

A proof of Theorem 4.2

We may assume that (xn , V ) is uniform and that V = F ⊕ xn F ⊕ x2n F ⊕ ...xm n F, where the subspace F is graded and xs -invariant. For each set of positive numbers a0 , a1 , ..., am (such that ak am−k = 1, 0 ≤ k ≤ m, in the type I case) the formula g(u0 + xn u1 + ... + xm n um ) (u0 , u1 , ..., um ∈ F )

= a0 u0 + a1 xn u1 + ... + am xm n um deﬁnes an element g ∈ Gxs . Furthermore, ak+1 k+1 x ui gxn g −1 : xkn ui → ak k

(ui ∈ F, 0 ≤ i, k ≤ m)

(10.1)

(10.2)

(l)

An elementary argument shows that there is a sequence ak , l = 1, 2, 3, ... of positive numbers such that for all 0 ≤ k ≤ m, (l)

→ lim

l→∞

ak+1 (l)

ak

= 0.

(10.3)

(l)

Let g (l) ∈ Gxs be as in (10.1), for the sequence ak . Then, by (10.2) and (10.3), g (l) xg (l)−1 = xs + g (l) xn (g (l) )−1 → xs as l → ∞.

11

A proof of Theorem 4.8

Let x1 ∈ g1 be semisimple and let V = V (0) ⊕ V (1) ⊕ V (2) ⊕ ... be such that each x1 |V (0) = 0 and for j ≥ 1, (x1 , V (j) ) is indecomposable, with x1 |V (j) = 0. We assume that the sum is orthogonal in the type I case. Then, by Theorems 5.2 and (j) (j) 5.3, (x21 , V0 ) and (x21 , V1 ), j ≥ 1, are indecomposable. Let x2 ∈ g1 be another semisimple elements such that Gx21 = Gx22 . Then, by the above argument, we may assume that x2 |V (0) = 0 and that for j ≥ 1 (x2 , V (j) ) is indecomposable with x2 |V (j) = 0. Hence we may assume that (x1 , V (j) ) and (x2 , V (j) ), j ≥ 1, are indecomposable. But then the Theorem 4.8 follows from Theorems 5.2 and 5.3 by inspection.

12

A proof of Theorem 4.3

Suppose X ∈ G1 is not semisimple. Then in the Jordan decomposition X = XS + XN , XN = 0. By Theorem 4.2, XS ∈ Cl(Gx). But XS ∈ / Gx. Thus the orbit Gx is not closed. Suppose X ∈ G1 is semisimple. Since the Gl(V )-orbit through X in End(V ) is closed, we see that Cl(Gx) is a union of semisimple orbits. Since Gx2 ⊆ G0 is closed, Theorem 4.8 implies that Cl(Gx) is a single orbit, hence is equal to Gx.

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

13

487

A proof of Theorem 4.4

Let x ∈ g1 be semisimple. We’ll say that V is isotypic for x, or that (x, V ) is isotypic, if (x, V ) decomposes into mutually similar indecomposable pieces. Two isotypic elements (x, V ) and (x , V ) are of diﬀerent types if the indecomposable pieces of (x, V ) are not similar to the indecomposable pieces of (x , V ). Lemma 13.1. Let (x, V ) and (x , V ) be two semisimple isotypic elements of diﬀerent types. Then the only y ∈ Hom(V , V ) such that xy + yx = 0 is y = 0. Proof. We may assume that the elements (x, V ), (x , V ) are indecomposable. We need to check that the map Hom(V , V ) y → xy + yx ∈ Hom(V , V ) is injective. The eigenvalues of this map are sums of the eigenvalues of x and x . By Theorems 5.2 and 5.3 these sums are not zero. Lemma 13.2. The anticommutant of g1 in g1 is zero:

g1

g1 = 0.

Proof. Suppose the Lie superalgebra g is of type II. Then g1 = Sg1 . Let x ∈ g1 g1 and let y ∈ g1 . Then Sy ∈ g1 and therefore {Sy, x} = 0. In particular, 0 = tr{Sy, x} = y, x . Thus x is orthogonal to g1 , and since the form , is non-degenerate, we see that x = 0. Suppose the Lie superalgebra g is of type I. In this case g1 ∩ Sg1 = 0, so we are forced to use a diﬀerent argument. We may assume that g is complex (and of type I). Then dim V0 ≥ 2 or dim V1 ≥ 2. Consider the case dim V0 ≥ 2. The second one is analogous. Let x ∈ g1 g1 , and let y ∈ g1 . Then, in terms of (2.12), we have (xy + yx)(v0 + v1 ) = (wx∗ wy + wy∗ wx )v0 + (wx wy∗ + wy wx∗ )v1 . Thus the condition xy +yx = 0 translates to wx∗ wy +wy∗ wx = 0. Hence, for any v0 , v0 ∈ V0 , τ0 (v0 , wx∗ wy v0 ) + τ0 (v0 , wy∗ wx v0 ) = 0, or equivalently, τ1 (wx v0 , wy v0 ) + τ1 (wy v0 , wx v0 ) = 0. Now ﬁx v0 ∈ V0 \ 0 and let v0 ∈ V0 \ 0 be such that the vectors v0 , v0 are linearly independent. Then {w(v0 ); w(v0 ) = 0, w ∈ Hom(V0 , V1 )} = V1 . Hence, wx (v0 ) is orthogonal to V1 with respect to the form τ1 . Since this form is nondegenerate, wx = 0, and therefore x = 0. Let V = V 0 ⊕ V 1 ⊕ V 2 ⊕ ...

488

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

be the isotypic decomposition of V , with respect to x, with V 0 = ker(x). Then xV = V 1 ⊕ V 2 ⊕ ... and the ﬁrst part of (b) follows. Let (G(V i ), g(V i )) be the restriction of (G, g) to V i . Then 2 2 2 2 Gx = G(V 0 )x × G(V 1 )x × G(V 2 )x × ..., 2

2

2

2

gx = g(V 0 )x ⊕ g(V 1 )x ⊕ g(V 2 )x ⊕ ..., where the summands are orthogonal with respect to the symplectic form , on g1 . 2 2 Moreover, G(V 0 )x = G(V 0 ) and g(V 0 )x = g(V 0 ). Hence, 2

2

2

Gx = G(V 0 ) × G(V 1 )x × G(V 2 )x × ..., 2

2

2

gx = g(V 0 ) ⊕ g(V 1 )x ⊕ g(V 2 )x ⊕ ... . This veriﬁes the second part of (b). We shall see in section 13 () that there is y ∈ x g1 such that for all i = j greater than or equal to 1, the restrictions (y, V i ), (y, V j ) are isotropic of diﬀerent types. Hence the lemmas 13.1 and 13.2 imply (x g1 )

g1 = (

x g (V 1 )) 1

g1 (V 1 ) ⊕ (

x g (V 2 )) 1

g1 (V 2 ) ⊕ ... .

Hence the proof of the last formula in (b) is reduced to the case when (x, V ) is isotypic and non-zero. Also, the above formula reduces the proof of (c), which we leave to the reader, to the case when (x, V ) is isotypic and non-zero. From now on, we assume that (x, V ) is isotypic and x = 0. We proceed via a case by case analysis according to Theorems 5.2 and 5.3. Case 5.2.a. Here D is arbitrary and the vector spaces Vi (i = 0, 1) have basis vi,1 , vi,2 , vi,3 , ..., vi,n , such that τ (v0,k , v0,k ) = τ (v1,k , v1,k )=1

(k = 1, 2, 3, ..., n)

(13.1)

and all the other pairings are zero. Further, → −ι(ξ)v1,k , v1,k → ι(ξ)v0,k . x = x(ξ) : v0,k → ξv1,k , v1,k → ξv0,k , v0,k

(13.2)

x2 : v0,k → ξ 2 v0,k , v1,k → ξ 2 v1,k , v0,k → −ι(ξ)2 v0,k , v1,k → −ι(ξ)2 v1,k .

(13.3)

Thus

2

Since ξ 2 = −ι(ξ)2 , the group Gx preserves each of the isotropic subspaces n

Dvi,k ⊆ Vi ,

k=1

n

Dvi,k ⊆ Vi

(i = 0, 1).

(13.4)

k=1

By Witt’s Theorem the restriction x2 n

G |

k=1

Dvi,k

n 2 = GL( Dvi,k )x k=1

(i = 0, 1).

(13.5)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

489

Hence, G

x2

x2

x2

= G |V0 × G |V1 =

GLn (D) × GLn (D) if D = H or D = H and ξ 2 ∈ R, GLn (C) × GLn (C) if D = H and ξ 2 ∈ / R.

(13.6) Suppose w ∈ Hom(V0 , V1 ) commutes with x . Then by (13.3), there are elements ∗ ∗ , wkl , wkl ∈ D commuting with ξ 2 and such that wkl , wkl 2

n

w(v0,k ) = ∗

w (v1,k ) =

wkl v1,l ,

l=1 n

w(v0,k )

=

n

wkl v1,l ,

l=1 ∗ wkl v0,l ,

w

∗

(v1,k )

=

l=1

n

(13.7) ∗ wkl v0,l .

l=1

By (2.10) and (13.1) we have ∗ ) ι(wpk

= τ0 (v0,k ,

n

∗ wpl v0,l ) = τ0 (v0,k , w∗ (v1,p ))

l=1

=

τ1 (w(v0,k ), v1,p )

n = τ1 ( wkl v1,l , v1,p ) = wkp , l=1

and ∗ ) ι(wpk

=

∗ τ0 (v0,k , v0,k )ι(wpk )

=

τ0 (v0,k ,

n

∗ wpl v0,l ) = τ0 (v0,k , w∗ (v1,p ))

l=1

=

τ1 (w(v0,k ), v1,p )

=

wkp τ1 (v1,p , v1,p )

= −wkp .

Hence, ∗ ∗ wpk = −ι(wkp ), wpk = ι(wkp ).

(13.8)

2

For y, z ∈ gx1 let w = y|V0 and let u = z|V0 . Then w, u ∈ Hom(V0 , V1 ) commute with x2 and, by (2.13), (13.7) and (13.8), 1 y, z = trD/R (w∗ u) 4 n n ∗ ∗ = trD/R (ukl wlk + ukl wlk ) = trD/R (ukl ι(wkl ) − ukl ι(wkl )). k,l=1

(13.9)

k,l=1

The formula (2.12), (13.2), (13.7) and (13.8) imply yx : v0,k → −

n

ξι(wlk )v0,l ,

v0,k

l=1

xy : v0,k →

n l=1

Hence,

wkl ξv0,l , v0,k →

→−

n

n

ι(ξ)ι(wlk )v0,l ,

l=1

(13.10)

wkl ι(ξ)v0,l .

l=1

y ∈ gx1 if and only if wkl = −ι(ξ)ι(wlk )ι(ξ)−1 ,

y ∈ x g1 if and only if wkl = ι(ξ)ι(wlk )ι(ξ)−1 .

(13.11)

490

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Suppose y, z ∈ gx1 . Then by (13.9) and (13.11), n 1 trD/R (−ι(ξ)ι(ulk )ι(ξ)−1 ι(wkl ) + ukl ξ −1 wlk ξ) y, z = 4 k,l=1

=− =−

n

k,l=1 n

−1

trD/R (wkl ξ ulk ξ) + trD/R (wkl ξulk ξ −1 ) +

k,l=1

n

k,l=1 n

trD/R (ulk ξ −1 wkl ξ)

(13.12)

trD/R (ulk ξ −1 wkl ξ) = 0,

k,l=1

where the equation ξ −1 ulk ξ = ξulk ξ −1 follows from the fact that ulk commutes with ξ 2 . The computation (13.12) shows that gx1 is an isotropic subspace of g1 . Suppose y ∈ gx1 and z ∈ x g1 . Then wkl = ι(ξ −1 wlk ξ) and, as in (13.12), we show that n 1 trD/R (ulk ξwkl ξ −1 ), y, z = −2 4 k,l=1

(13.13)

which implies that the symplectic form , provides a non-degenerate pairing between 2 2 2 gx1 and x g1 . The supergroup (Gx , gx ) is irreducible, of type II, and the ranks of Gx |V0 2 and Gx |V1 are equal. Furthermore a straightforward computation shows that (x(ξ) g1 )

x(ξ)

(g

g1 = g1 1

)

= {x(ζ); ζ 2 ∈ D(D

ξ2 )

}.

(13.13.1)

Case 5.2.b Here D ⊇ C and the vector spaces Vi (i = 0, 1) have basis vi,1 , vi,2 , vi,3 , ..., vi,n , such that τ (v0,k , v0,k ) = = ±1, τ (v1,k , v1,k ) = δi = ±i

(k = 1, 2, 3, ..., n)

(13.14)

and all the other pairings are zero. Furthermore, x = x(ξ) : v0,k → ξv1,k , v1,k → ξv0,k .

(13.15)

x2 : v0,k → ξ 2 v0,k , v1,k → ξ 2 v1,k .

(13.16)

Thus Since ξ 2 ∈ iR \ 0, 2

2

2

Gx = Gx |V0 × Gx |V1 = Un (C) × Un (C).

(13.17)

∗ Suppose w ∈ Hom(V0 , V1 ) commutes with x2 . Then, by (13.16), there are wkl , wkl ∈D 2 commuting with ξ and such that

w(v0,k ) =

n l=1

∗

wkl v1,l , w (v1,k ) =

n l=1

∗ wkl v0,l .

(13.18)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

491

By (2.10) and (13.14) we have ∗ ) ι(wpk

= τ0 (v0,k ,

n

∗ wpl v0,l ) = τ0 (v0,k , w∗ (v1,p ))

l=1

= τ1 (w(v0,k ), v1,p ) = wkp τ1 (v1,p , v1,p ) = wkp δi. Thus, ∗ wpk = − δiι(wkp ).

(13.19)

2

For y, z ∈ gx1 let w = y|V0 and let u = z|V0 . Then w, u ∈ Hom(V0 , V1 ) commute with x2 and, by (2.13), 1 y, z = trD/R (w∗ u) 4 n n ∗ = trD/R (ukl wlk ) = trD/R (− δiι(wkl )ukl ). k,l=1

(13.20)

k,l=1

The formulas (2.12) and (13.15) imply yx : v0,k →

n

∗ ξwlk v0,l ,

l=1

xy : v0,k →

n

(13.20’) wkl ξv0,l .

l=1

Furthermore, since ξ 2 ∈ R \ 0, the centralizer of ξ 2 in D coincides with the the centralizer of ξ in D. In particular, wkl ξ = ξwkl . By combining this with (13.19) and (13.20’) we see that y ∈ gx1 if and only if wkl = − δiι(wlk ), (13.21) y ∈ x g1 if and only if wkl = δiι(wlk ). Suppose y, z ∈ x g1 . Then (13.20) and (13.21) imply n 1 trD/R (−ξ −1 wlk ξukl ) y, z = 4 k,l=1 n

1 = trD/R (− δiι(ulk )wkl ) = z, y . 4 k,l=1

(13.22)

Thus y, z = 0. Hence x g1 is an isotropic subspace of g1 . Similarly we check that gx1 is an isotropic subspace of g1 , and that the symplectic form provides a non-degenerate 2 2 pairing between x g1 and gx1 . The dual pair corresponding to the supergroup (Gx , gx ) is isomorphic to (Un , Un ). Furthermore a straightforward computation shows that (x(ξ) g1 )

x(ξ)

(g

g1 = g1 1

)

= {x(ζ); ζ 2 ∈ iR, sgn(im(ζ 2 )) = sgn(im(ξ 2 ))}.

(13.22.1)

492

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Case 5.2.c. Here D = R and the vector spaces Vi (i = 0, 1) have basis vi,1 , vi,2 , vi,3 , ..., vi,n , vi,1 , vi,2 , vi,3 , ..., vi,n ,

such that with = ±1 τ (v0,k , v0,k ) =

(13.23)

τ (v0,k , v0,k )

τ (v1,k , v1,k )

= ,

=1

(k = 1, 2, 3, ..., n)

and all the other pairings are zero. Further, with ξ ∈ R \ 0, x = x(ξ, ) :v0,k → ξ(v1,k − v1,k ), v1,k → ξ(v0,k − v0,k ),

→ ξ(v1,k + v1,k ), v1,k → ξ(v0,k + v0,k ). v0,k

Therefore,

, v1,k → −2 ξ 2 v1,k , x2 :v0,k → −2ξ 2 v0,k → 2ξ 2 v0,k , v1,k → 2 ξ 2 v1,k . v0,k

(13.24)

(13.25)

Since ξ = 0, (13.25) implies 2

2

2

Gx = Gx |V0 × Gx |V1 = Un × Un .

(13.26)

Suppose w ∈ Hom(V0 , V1 ) commutes with x2 . Then, by (13.25), there are elements ∗ ∗ , wkl , wkl ∈ R such that wkl , wkl w(v0,k ) =

n

(wkl v1,l − wkl v1,l ),

l=1

w(v0,k )=

n

∗

w (v1,k ) =

(wkl v1,l + wkl v1,l ),

l=1 n

(13.27) ∗ (wkl v0,l

−

∗ wkl v0,l ),

l=1

w∗ (v1,k )=

n

∗ ∗ (wkl v0,l + wkl v0,l ).

l=1

From (13.23) and (13.27) we see that ∗ ∗ wpk = τ0 (v0,k , wpk v0,k ) = τ0 (v0,k , w∗ (v1,p )) = τ1 (w(v0,k ), v1,p ) = τ1 (wk,p v1,p , v1,p ) = wkp ,

and

∗ ∗ wpk = τ0 (v0,k , wpk v0,k ) = τ0 (v0,k , w∗ (v1,p ))

= τ1 (w(v0,k ), v1,p ) = τ1 (−wkp v1,p , v1,p ) = wkp .

Thus, ∗ ∗ wpk = wkp , wpk = wkp .

(13.28)

2

For y, z ∈ gx1 let w = y|V0 and let u = z|V0 . Then w, u ∈ Hom(V0 , V1 ) commute with x2 and, by (2.13), (13.27) and (13.28), 1 y, z = tr(w∗ u) 4 n n ∗ ∗ =2 (ukl wlk − ukl wlk ) = 2 (ukl wkl − ukl wkl ). k,l=1

k,l=1

(13.29)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

493

We calculate using (13.24), (13.27) and (13.28): 1 ( wlk − wlk )v0,l − (wlk + wlk )v0,l , yx :v0,k → ξ l=1 l=1 n

v0,k

1 xy :v0,k ξ v0,k

Hence,

n

n n → ( wlk + wlk )v0,l − (wlk − wlk )v0,l , l=1

l=1

l=1

l=1

l=1

l=1

n n → (wkl − wkl )v0,l − (wkl + wkl )v0,l ,

(13.30)

n n → ( wkl + wkl )v0,l − ( wkl − wkl )v0,l

y ∈ gx1 if and only if wkl = wlk ,

y ∈ x g1 if and only if wkl = − wlk .

(13.31)

It is clear from (13.29) and (13.31) that the spaces x g1 , gx1 are isotropic, and that the sym2 2 plectic form provides a non-degenerate pairing between them. The supergroup (Gx , gx ) is irreducible, of type I, and the corresponding dual pair is isomorphic to (Un , Un ). Furthermore a straightforward computation shows that (x(ξ,) g1 )

x(ξ,)

(g

g1 = g1 1

)

= {x(ζ, ); ζ ∈ R}.

(13.31.1)

Case 5.2.d. Here D = R and the vector spaces Vi (i = 0, 1) have basis ui,1 , ui,2 , ui,3 , ..., ui,n , ui,1 , ui,2 , ui,3 , ..., ui,n , vi,1 , vi,2 , vi,3 , ..., vi,n , vi,1 , vi,2 , vi,3 , ..., vi,n ,

such that τ (ui,k , ui,k ) = τ (vi,k , vi,k ) = 1,

(13.32)

(i = 0, 1; k = 1, 2, 3, ..., n)

and all the other pairings are zero. Moreover, x = x(ξ, η) :u0,k → ξu1,k + ηv1,k , u1,k → ξu0,k + ηv0,k , v0,k → −ηu1,k + ξv1,k , v1,k → −ηu0,k + ξv0,k , u0,k → −ξu1,k + ηv1,k , u1,k → ξu0,k − ηv0,k ,

(13.33)

→ −ηu1,k − ξv1,k , v1,k → ηu0,k + ξv0,k , v0,k

Therefore, with α = ξ 2 − η 2 and β = 2ξη, x2 :u0,k → αu0,k + βv0,k , u1,k → αu1,k + βv1,k , v0,k → −βu0,k + αv0,k , v1,k → −βu1,k + αv1,k , u0,k → −αu0,k + βv0,k , u1,k → −αu1,k + βv1,k ,

(13.34)

→ −βu0,k − αv0,k , v1,k → −βu1,k − αv1,k . v0,k

Since α, β = 0, (13.34) and (13.32) imply 2

2

2

Gx = Gx |V0 × Gx |V1 = GLn (C) × GLn (C).

(13.35)

494

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Suppose w ∈ Hom(V0 , V1 ) commutes with x2 . Then, by (13.34), w maps the span of the u0,k , v0,k to the span of the u1,k , v1,k and the span of the u0,k , v0,k to the span of the ∗ ∗ ∗ ∗ u1,k , v1,k . More precisely, there are numbers wkl , w˜kl , wkl , w˜kl , wkl , w˜kl , wkl , w˜kl ∈ R such that n w(u0,k ) = (wkl u1,l + w˜kl v1,l ), l=1

w(v0,k ) =

n

(−w˜kl u1,l + wkl v1,l ),

l=1

n w(u0,k ) = (wkl u1,l + w˜kl v1,l ),

(13.36)

l=1

)= w(v0,k

n

(−w˜kl u1,l + wkl v1,l ).

l=1

and ∗

w (u1,k ) =

n

∗ ∗ (wkl u0,l + w˜kl v0,l ),

l=1

n ∗ ∗ ∗ w (v1,k ) = (−w˜kl u0,l + wkl v0,l ), l=1

w

∗

(u1,k )

=

n

(13.37) ∗ (wkl u0,l

+

∗ w˜kl v0,l ),

l=1

n ∗ ∗ ∗ w (v1,k ) = (−w˜kl u0,l + wkl v0,l ). l=1

Using (13.32), (13.36) and (13.37) we show that ∗ ∗ ∗ ∗ = −wlk , wkl = wlk , w˜kl = w˜lk , w˜kl = −w˜lk . wkl

(13.38)

2

For y, z ∈ gx1 let w = y|V0 and let u = z|V0 . Then 1 y, z = tr(w∗ u) 4 n ∗ ∗ ∗ ∗ (ukl wlk − u˜kl w˜lk + ukl wlk − u˜kl w˜lk ) =2 =2

k,l=1 n k,l=1

(wkl ukl + w˜kl u˜kl − wkl ukl − w˜kl u˜kl )

(13.39)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

495

Furthermore, yx :u0,k

n → ((−ξwlk − ηw ˜lk )u0,l + (ξ w˜lk − ηwlk )v0,l ),

v0,k →

l=1 n

((ηwlk − ξ w˜lk )u0,l + (−η w˜lk − ξwlk )v0,l ),

l=1

u0,k

n → ((−ξwlk + η w˜lk )u0,l + (ξ w˜lk + ηwlk )v0,l ),

(13.40)

l=1

v0,k →

n

((−ηwlk − ξ w˜lk )u0,l + (η w˜lk − ξwlk )v0,l ),

l=1

and xy :u0,k →

n ((ξwkl − η w ˜kl )u0,l + (ηwkl + ξ w˜kl )v0,l ), l=1

v0,k → u0,k

→

n

((−ξ w˜kl − ηwkl )u0,l + (−η w˜kl + ξwkl )v0,l ),

l=1 n

(13.41)

((ξwkl

+

ηw ˜kl )u0,l

+

(−ηwkl

+

ξ w˜kl )v0,l ),

l=1

v0,k →

n

((−ξ w˜kl + ηwkl )u0,l + (η w ˜kl + ξwkl )v0,l ).

l=1

Hence,

y ∈ gx1 if and only if ξwkl − η w˜kl + ξwlk + ηw ˜lk = 0,

ηwkl + ξ w˜kl + ηwlk − ξ w˜lk = 0;

− ηw ˜lk = 0, y ∈ x g1 if and only if ξwkl − η w˜kl − ξwlk ηwkl + ξ w˜kl − ηwlk + ξ w˜lk = 0.

Thus

y ∈ gx1 if and only if wkl = −wlk , w˜kl = w˜lk ;

y ∈ x g1 if and only if wkl = wlk , w˜kl = −w˜lk .

(13.42)

It is easy to see from (13.39) and (13.42) that the spaces x g1 , gx1 are isotropic, and that the symplectic form provides a non-degenerate pairing between them. The supergroup 2 2 (Gx , gx ) is irreducible, of type II, and as a dual pair is isomorphic to (GLn (C), GLn (C)). Furthermore a straightforward computation shows that (x(ξ,η) g1 )

x(ξ,η)

(g

g1 = g1 1

)

= {x(ζ, γ); ζ, γ ∈ R}.

(13.42.1)

Case 5.3.a The spaces V0 , V1 have basis vi,1 , vi,2 , vi,3 , ..., vi,n

(i = 0, 1)

such that x(ξ) : v0,k → ξv1,k , v1,k → ξv0,k .

(13.43)

496

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Hence, x2 : v0,k → ξ 2 v0,k , v1,k → ξ 2 v1,k .

Therefore G

x2

x2

x2

= G |V0 × G |V1 =

(13.44)

GLn (D) × GLn (D) if ξ 2 ∈ R, GLn (C) × GLn (C) if ξ 2 ∈ / R.

(13.45)

Suppose A ∈ Hom(V0 , V1 ) and B ∈ Hom(V1 , V0 ) commute with x2 . Then there are elements ak,l , bk,l ∈ D commuting with ξ 2 such that A : v0,k →

n

ak,l v1,l , B : v1,k →

l=1

n

bk,l v0,l .

l=1

Furthermore the formula (v0 ∈ V0 , v1 ∈ V1 )

y(v0 + v1 ) = Bv1 + Av0 2

2

deﬁnes an element y ∈ gx1 and all elements of gx1 may be described as above. Suppose y ∈ 2 gx1 . Let A , B be the corresponding elements of Hom(V0 , V1 ), Hom(V1 , V0 ) respectively. Then, by (2.6), n 1 trD/R (bk,l al,k − bk,l al,k ). y, y = trD/R (BA − B A) = 2 k,l=1

(13.46)

We see from (13.43) that y ∈ gx1 if and only if bk,l = ξak,l ξ −1 ,

y ∈ x g1 if and only if bk,l = −ξak,l ξ −1 .

(13.47)

It is clear from (13.46) and (13.47) that x g1 and gx1 are isotropic subspaces of g1 and that the symplectic form , provides a non-degenerate pairing between them. The 2 2 Lie supergroup (Gx , gx ) is of type II, is irreducible and the corresponding dual pair is isomorphic to (GLn (C), GLn (C)) or (GLn (D), GLn (D)), as indicated in (13.45). Furthermore a straightforward computation shows that (x(ξ) g1 )

x(ξ)

(g

g1 = g1 1

)

= {x(ζ); ζ 2 ∈ D(D

ξ2 )

}.

(13.47.1)

Case 5.3.a’ Here the spaces V0 , V1 have basis vi,1 , vi,2 , vi,3 , ..., vi,n

(i = 0, 1),

x = x(ξ) : v0,k → ξv1,k , v1,k → −ξv0,k , (ξ ∈ R \ 0), (x(ξ) g1 )

x(ξ)

(g

g1 = g1 1

and the proof is as in the previous case.

)

= {x(ζ); ζ ∈ R}.

(13.47.2)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

497

Case 5.3.b Here the spaces V0 , V1 have basis ui,1 , ui,2 , ui,3 , ..., ui,n ; vi,1 , vi,2 , vi,3 , ..., vi,n such that

(i = 0, 1)

x = x(ξ, η) :u0,k → ξu1,k + ηv1,k , u1,k → ξu0,k + ηv0,k , v0,k → −ηu1,k + ξv1,k , v1,k → −ηu0,k + ξv0,k

(13.48)

Therefore, with α = ξ 2 − η 2 and β = 2ξη, x2 :u0,k → αu0,k + βv0,k , u1,k → αu1,k + βv1,k , v0,k → −βu0,k + αv0,k , v1,k → −βu1,k + αv1,k

(13.49)

Hence, 2

2

2

Gx = Gx |V0 × Gx |V1 = GLn (C) × GLn (C).

(13.50)

2

Suppose y ∈ gx1 . Then there are numbers ak,l , a ˜k,l , bk,l , ˜bk,l in R such that y :u0,k →

n n (ak,l u1,l + a ˜k,l v1,l ), u1,k → (bk,l u0,l + ˜bk,l v0,l ), l=1

v0,k →

n

l=1 n

(−˜ ak,l u1,l + ak,l v1,l ), v1,k →

l=1

(13.51) (−˜bk,l u0,l + bk,l v0,l ).

l=1

2

If y ∈ gx1 , then (with the notation (13.51)), n 1 (ak,l bl,k − a ˜k,l˜bl,k − ak,l bl,k + a ˜k,l˜bl,k ). y, y = 2 k,l=1

(13.52)

Furthermore yx :u0,k →

n

((ξbk,l − η˜bk,l )u0,l + (ξ˜bk,l + ηbk,l )v0,l ),

l=1

v0,k →

n

((−ηbk,l − ξ˜bk,l )u0,l + (−η˜bk,l + ξbk,l )v0,l )

l=1

and xy :u0,k →

n

((ξak,l − η˜ ak,l )u0,l + (ξ˜ ak,l + ηak,l )v0,l ),

l=1

v0,k

n → ((−ξ˜ ak,l − ηak,l )u0,l + (ξak,l − η˜ ak,l )v0,l ) l=1

Therefore, y ∈ gx1 if ak,l = bk,l , a ˜k,l = ˜bk,l ; y ∈ x g1 if ak,l = −bk,l , a ˜k,l = −˜bk,l .

(13.53)

It is clear from (13.52) and (13.53) that x g1 and gx1 are isotropic subspaces of g1 and that the form , provides a non-degenerate pairing between them. The Lie supergroup

498

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506 2

2

(Gx , gx ) is irreducible, of type II and the corresponding dual pair is isomorphic to (GLn (C), GLn (C)). Furthermore a straightforward computation shows that (x(ξ,η) g1 )

x(ξ,η)

(g

g1 = g1 1

)

= {x(ζ, γ); ζ, γ ∈ R}.

(13.53.1)

This completes the proof of Theorem 4.4. Proof (of Lemma 6.4). Since the elements x and y commute, they preserve the same isotypic decomposition of V . For a ﬁxed isotypic component, all the sets which occur in Lemma 6.4 (a), (b) and (c) are described in (13.11), (13.21), (13.31), (13.42), (13.47) and (13.53). One checks the equalities (a), (b), (c) via a case by case analysis. Similarly one veriﬁes (d) and (e).

14

A proof of Theorem 4.5

Consider the map G × gx1 (g, y) → gy ∈ g1 .

(14.1)

The derivative of (14.1) at (g, y) coincides with the following linear map g0 × gx1 (A, B) → [A, gy] + gB ∈ g1 .

(14.2)

The range of the map (14.2) is equal to [g0 , gy] + g(gx1 ) = g([g0 , y] + gx1 ).

(14.3)

We see from Lemma 3.1 and Theorem 4.4 that [g0 , x] + gx1 = (x g1 )⊥ + gx1 = g1 .

(14.4)

U = {y ∈ gx1 ; [g0 , y] + gx1 = g1 }

(14.5)

Hence, the set is non-empty. The set U is open and Gx -invariant. Furthermore, (14.3) shows that the map (14.1) restricted to G × U is a submersion. The set U satisﬁes the conditions (1.0), (1.1), (1.2) and (1.5). Suppose we have a non-empty open subset U˜x ⊆ gx1 such that 2

2

(1.4) holds for the supergroup (Gx , gx ) : 2 if g ∈ Gx and y, y ∈ U˜x are such that gy = y , then g ∈ Gx . 2

(14.6)

Since x2 is semisimple, there is an admissible slice Ux2 through x2 in gx0 , with respect to the group G. We may assume that U˜x is contained in the preimage of Ux2 under the map 2 gx1 y → y 2 ∈ gx0 . (14.7)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

499 2

Then for y, y ∈ U˜x and g ∈ G such that gy = y we have gy 2 = y 2 . Hence g ∈ Gx . But this together with (14.6) implies the g ∈ Gx . Thus U˜x satisﬁes (1.3). Hence, if we set Ux = U˜x ∩ U , where U is the set deﬁned in (14.5), then Ux satisﬁes all the conditions (1.0)-(1.5). Next we shall verify the statement (14.6). The Theorem 4.4 implies that we may 2 2 assume that either x = 0 or x = 0 and that the supergroup (Gx , gx ) corresponds either to the dual pair (Un , Un ) or to (GLn (D), GLn (D)). Since any G-invariant open neighborhood of 0 in g1 is an admissible slice through 0, we may assume that x = 0. We proceed via a case by case analysis performing the computations in terms of matrices. It will be clear from what follows that the sets constructed may be made arbitrarily small and thus form the desired basis for the neighborhoods of x in gx1 . Case (Un , Un ). Set W = Mn (C), V0 = V1 = Cn and τ0 (u0 , v0 ) = v T0 u0 , τ1 (u1 , v1 ) = v T1 iu1

(u0 , v0 ∈ V0 , u1 , v1 ∈ V1 ).

The space W is identiﬁed with Hom(V0 , V1 ) by w(v) = wv, w ∈ W , v ∈ V0 . Then w∗ = −iwT

(w ∈ W ).

The restriction of x to Hom(V0 , V1 ) coincides with ξI ∈ W , where ξ ∈ iR \ 0, im(ξ 2 ) < 0. π Thus ξ = re− 4 i , where r ∈ R \ 0. A straightforward calculation using the formula (2.12) shows that under the identiﬁcation g1 = g1 |V0 = Hom(V0 , V1 ) = W we have x

For 0 < ≤

T

T

g1 = {ξA; A = −A ∈ W } and gx1 = {ξH; H = H ∈ W }.

1 2

(14.8)

let U˜x, be the set of all points w ∈ gx1 such that

λ1 + λ2 = 0 and |λ − ξ| < |ξ| for all eigenvalues λ, λ1 , λ2 of w.

(14.9)

Let be the operator norm on W , viewed as the space of operators on the Hilbert space (V0 , τ0 ). Suppose g, h ∈ Un and ξH ∈ U˜x, are such that ξgHh−1 ∈ U˜x, . Then, by (14.9), H − I < and gHh−1 − I < . Hence H − g −1 h < , and by the triangle inequality g −1 h − I < 2 . Since 2 ≤ 1, we have A = log(g −1 h) ∈ un . Thus g −1 h = exp(A).

500

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Since ξgHh−1 ∈ U˜x, , we have

T

gHh−1 = gHh−1 . Thus hHg −1 = gHh−1 , or equivalently Hg −1 h = h−1 gH. Therefore H exp(A) = exp(−A)H. Since, by (14.9), H is invertible and since log is injective, this last condition may be expressed as HAH −1 = −A, or equivalently as HA + AH = 0.

(14.10)

Conjugating both sides of (14.10) by an appropriate element of Un we may assume that H is diagonal. Then (14.9) shows that A = 0. Hence g −1 h = I, i.e. g = h. Since the diagonal subgroup {(g, g) ∈ Un × Un } coincides with Gx , we see that the set U˜x, satisﬁes (1.3). This also shows that the derivative of the map H → H 2 at H is an injective linear map. Furthermore, since H is close to the identity, it is positive deﬁnite. Thus H is the unique positive deﬁnite square root of H 2 . Hence the map H → H 2 is injective. Case (GLn (D), GLn (D)). For α > 0 let Mn (C)[α] = {A ∈ Mn (C); |Im(a)| < α for all eigenvalues a of A}, and let GLn (C)[α] = exp(Mn (C)[α]). Then, as is well known [11][part. II, p. 17], exp : Mn (C)[π] → GLn (C)[π]

(14.11)

is a bijective analytic diﬀeomorphism. Moreover, the closure, Cl(GLn (C)[α]) ⊆ GLn (C)[β]

(0 < α < β ≤ π).

(14.12)

Let Mn (C) ⊆ Mn (C) be the set of all matrices A such that the map Mn (C) B → AB + BA ∈ Mn (C)

(14.13)

is surjective. Clearly Mn (C) is a Zariski open, Ad(GLn (C))-invariant neighborhood of the identity I ∈ Mn (C). Lemma 14.1. For any 0 < α < β < π there is an open Ad(GLn (C))-invariant neighborhood of the identity −1 Vα,β = Vα,β ⊆ GLn (C)[α] ∩ Mn (C)a

such that GLn (C)[α] Vα,β ⊆ GLn (C)[β].b

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

501

Proof. The set of eigenvalues of a matrix A ∈ Mn (C) may be viewed as an orbit in Cn under the action of the permutation group. The family of all such orbits has a natural topology. In these terms the set of eigenvalues of a matrix A ∈ Mn (C), is a continuous function. Let S 1 ⊆ Mn (C) denote the unit sphere with respect to the operator norm . Since by Jordan’s Theorem GLn (C)[α] = {A ∈ Mn (C); a = 0, |Arg(a)| < α for all eigenvalues a of A}

(14.14)

the previous paragraph shows that there is an open neighborhood V (α,β) ⊆ GLn (C) of the identity I, such that (S 1 ∩ GLn (C)[α]) V (α,β) ⊆ GLn (C)[β].

(14.15)

Notice that the set (14.14) is closed under the dilations A → tA, t > 0. Hence, (14.15) implies (14.16) GLn (C)[α]V (α,β) ⊆ GLn (C)[β]. Let V α,β = Ad(GLn (C))V (α,β) . As the union of open sets, V α,β is open. Clearly, V α,β is Ad(GLn (C))-invariant and contains the identity. The inclusion (14.16) together with the Ad(GLn (C))-invariance of the sets GLn (C)[γ], γ = α, β, implies (b). Hence the Lemma holds for Vα,β = V α,β ∩ α,β −1 V . Corollary 14.2. Let Vα,β be as in the Lemma 14.1. Suppose A ∈ Vα,β and g, h ∈ GLn (C) are such that hAg −1 = gAh−1 ∈ Vα,β . Then g = h. In particular, if A, B ∈ Vα,β and A2 = B 2 , then A = B. Furthermore, the derivative of the map A → A2 at A ∈ Vα,β is injective. Proof. Set u = g −1 h. Then uA ∈ Vα,β . Since A−1 ∈ Vα,β , Lemma 14.1 implies u = (uA)A−1 ∈ Vα,β Vα,β ⊆ GLn (C)[α] Vα,β ⊆ GLn (C)[β] ⊆ GLn (C)[π]. Hence there is a unique B ∈ Mn (C)[π] such that u = exp(B). Furthermore, uA = g −1 hA = Ah−1 g = Au−1 . Hence exp(B)A = Aexp(−B), or equivalently exp(A−1 BA) = exp(−B).

(14.17)

502

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

Since A−1 BA and −B belong to Mn (C)[π], (14.17) implies A−1 BA = −B, or equivalently AB + BA = 0.

(14.18)

Since A ∈ Mn (C), (14.18) implies B = 0, which means that u = 1. Thus g = h. Let A, B ∈ Vα,β . Then A2 = B 2 is equivalent to AAB −1 = BAA−1 , which implies A = B. The injectivity of the derivative follows from the fact that A ⊆ Mn (C). Let D = R, C or H and let ξ ∈ D \ 0 be such that ξ 2 is in the center of D. Set ⎛ ⎞ ⎜ 0 ξI ⎟ x=⎝ (14.19) ⎠. ξI 0 Under the usual identiﬁcations we have

⎛

⎞

⎜ 0 A⎟ g1 = End(Dn ⊕ Dn )1 = {⎝ ⎠ ; A, B ∈ Mn (D)}, B 0 ⎛ ⎞ ⎜g 0 ⎟ G = GL(Dn ⊕ Dn )0 = {⎝ ⎠ ; g, h ∈ GL(Dn )}. 0h A straightforward calculation shows that ⎞ ⎛ −1 ⎜ 0 ξBξ ⎟ gx1 = {⎝ ⎠ ; B ∈ Mn (D)}, B 0 ⎞ ⎛ ⎜g 0 ⎟ n Gx = {⎝ ⎠ ; g ∈ GL(D )}. 0 ξgξ −1

(14.20)

(14.21)

Let Vα,β be the set constructed in Lemma 14.1 for the group GLn (D) if D = C, and for the complexiﬁcation of GLn (D) if D = C. Set Vα,β (D) = Vα,β ∩ GLn (D).

(14.22)

Then Vα,β (D) is an open Ad(GLn (D))-invariant neighborhood of the identity I ∈ GLn (D). Let ⎞ ⎛ −1

⎜ 0 ξBξ ⎟ Uα,β,ξ (D) ={⎝ ⎠ ; B ∈ ξVα,β (D)} B 0 ⎛ ⎞ ⎜ 0 Aξ ⎟ ={⎝ ⎠ ; A ∈ Vα,β (D)}. ξA 0

(14.23)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

503

The set (14.23) is an open Gx -invariant neighborhood of x in gx1 . Lemma 14.3. Suppose y, z ∈ Uα,β,ξ (D) and s ∈ G are such that sys−1 = z. Then s ∈ Gx . Proof. Let

⎛ ⎜g s=⎝ 0

⎞

⎛

⎞

0⎟ ⎠ h

⎜ 0 Aξ ⎟ and y = ⎝ ⎠. ξA 0 ⎞

⎛

Then

⎜ sys−1 = ⎝

0 hξAg −1

−1

gAξh ⎟ ⎠ 0

Since sys−1 ∈ Uα,β,ξ (D), we have gAξh−1 = ξ(hξAg −1 )ξ −1 = ξhξ −1 Ag −1 ξ. Thus (ξhξ −1 )Ag −1 = gA(ξh−1 ξ −1 ).

(14.24)

Furthermore hξAg −1 ∈ ξUα,β,ξ (D). Thus (ξhξ −1 )Ag −1 = ξ −1 hξAg −1 ∈ Uα,β,ξ (D).

(14.25)

By combining (14.24), (14.25) and Corollary 14.2 we see that ξhξ −1 = g, so that h = ξ −1 gξ = ξgξ −1 . The Lemma 14.3 shows that the set (14.23) satisﬁes the condition (1.3). The rest is also clear from Corollary 14.2.

15

A proof of Theorem 4.7

The ideas presented below originate in [4, sec. 12]. Recall the following Lemma. Lemma 15.1. [12, 8.A.4.5] Let N be a complete metric space and let G be a σ-compact topological group acting on N . Suppose N is the union of a ﬁnite number of G-orbits. The we can label the orbits O1 , O2 , ..., Ok , so that for each 1 ≤ j ≤ k, the set Nj =

k

Ol is closed in N.

l=j

We apply the above Lemma to our ordinary classical Lie supergroup (G, g), with N ⊆ g1 equal to the set of nilpotent elements. As is well known, [3], N is the union of a ﬁnite number of G-orbits.

504

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

In particular, for each 1 ≤ j ≤ k there is an open G-invariant set Wj ⊆ g1 such that Wj ∩ Nj = Oj .

(15.1)

Fix 1 ≤ j ≤ k and an element x ∈ Oj . Let U ⊆ g1 be a subspace complementary to the tangent space to the orbit through x. Thus g1 = [g0 , x] ⊕ U.

(15.2)

For each z ∈ U we have a linear map Tz : g0 ⊕ U (y, z ) → [y, x + z] + z ∈ g1 .

(15.3)

Notice that, by (15.2), T0 is surjective. Further, the map U z → Tz ∈ Hom(g0 ⊕ U, g1 ) is aﬃne and hence continuous. Therefore the set of all z ∈ U such that rank(Tz ) ≥ rank(T0 ) (= dim g1 )

(15.4)

is an open neighborhood of 0 in U . Let us denote this neighborhood by U1 . Let Φ : G × U1 (g, z) → g(x + z)g −1 ∈ g1 .

(15.5)

The derivative of Φ at (g, z) coincides with the following linear map g0 ⊕ U (y, z ) → g([y, x + z] + z )g −1 ∈ g1 .

(15.6)

By (15.4), the map (15.6) is surjective. Thus Φ is a submersion. Recall the set Wj , (15.1). Let U2 = {z ∈ U1 ; x + z ∈ Wj }.

(15.7)

Φ(G × U2 ) ⊆ Wj .

(15.8)

Then Let W ⊆ g0 be a subspace complementary to the kernel of the map ad(x) : g0 y → [y, x] ∈ g1

(15.9)

W y → [y, x] ∈ [g0 , x]

(15.10)

so that the map is a linear bijection. By (15.6), the derivative of the map W × U2 (y, z) → Φ(exp(y), z) = exp(y)(x + z) exp(−y) ∈ g1

(15.11)

at (0, 0) coincides with the following linear map W ⊕ U (y, z ) → [y, x] + z ∈ g1 ,

(15.12)

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

505

which, by (15.2) and (15.10), is a linear bijection. Hence, there is an open neighborhood W1 of 0 in W and an open neighborhood U3 of 0 in U2 , such that the map W1 × U3 (y, z) → exp(y)(x + z) exp(−y) ∈ g1

(15.13)

is an diﬀeomorphism onto an open neighborhood of x in g1 . Let W2 ⊆ W1 be an open neighborhood of 0 such that exp(ad(W2 ))x ⊆ Nj is an open neighborhood of x in Nj . This is compatible with (15.8). Choose an open neighborhood W0 ⊆ W2 of 0 and an open neighborhood U0 ⊆ U3 of 0 such that Φ(exp(W0 ), U0 ) ∩ Nj ⊆ exp(ad(W2 ))x ∩ Nj .

(15.14)

Suppose z ∈ U0 is such that x + z ∈ Oj .

(15.15)

Since x + z = Φ(exp(0), z), (15.14) implies that there is y ∈ W2 such that x + z = exp(y) x exp(−y).

(15.16)

Φ(exp(0), z) = Φ(exp(y), 0).

(15.17)

Thus But then (15.13) implies that y = 0 and z = 0. Thus (x + U0 ) ∩ Oj = {x}.

(15.18)

References [1] N. Burgoyne and R. Cushman: “Conjugacy Classes in Linear Groups”, J. Algebra, Vol. 44, (1975), pp. 339–362. [2] D. Collingwood and W. McGovern: Nilpotent orbits in complex semisimple Lie algebras, Reinhold, Van Nostrand, New York, 1993. [3] A. Daszkiewicz, W. Kra´skiewicz and T. Przebinda: “Dual Pairs and KostantSekiguchi Correspondence. II. Classiﬁcation of Nilpotent Elements”, Centr. Eur. J. Math., Vol. 3, (2005), pp. 430-464. [4] Harish-Chandra: “Invariant Distributions on Lie algebras”, Amer. J. of Math., Vol. 86, (1964), pp. 271–309. [5] R. Howe: Analytic Preliminaries, preprint. [6] R. Howe: “Remarks on classical invariant theory”, Trans. Amer. Math. Soc., Vol. 313, (1989), pp. 539–570, [7] R. Howe: “Transcending Classical Invariant Theory”, J. Amer. Math. Soc., Vol. 2, (1989), pp. 535–552. [8] V. Kac: “Lie superalgebras”, Adv. Math., Vol. 26, (1977), pp. 8–96.

506

T. Przebinda / Central European Journal of Mathematics 4(3) 2006 449–506

[9] B. Kostant: Graded manifolds, graded Lie theory, and prequantization, Lecture Notes in Math., Vol. 570, Springer-Verlag, Berlin-New York, 1977, pp. 177–306. [10] M. Spivak: A comprehensive introduction to diﬀerential geometry, Brandeis University, Waltham, Massachusetts, 1970. [11] V.S. Varadarajan: Harmonic Analysis on Real Reductive Groups I and II,Lecture Notes in Math., Vol. 576, Springer Verlag, 1977. [12] N. Wallach: Real Reductive Groups I, Academic Press, INC, 1988.

DOI: 10.2478/s11533-006-0015-8 Research article CEJM 4(3) 2006 507–524

Duality triads of higher rank: Further properties and some examples Matthias Schork∗ Alexanderstr. 76, 60489 Frankfurt, Germany

Received 6 February 2006; accepted 27 February 2006 Abstract: It is shown that duality triads of higher rank are closely related to orthogonal matrix polynomials on the real line. Furthermore, some examples of duality triads of higher rank are discussed. In particular, it is shown that the generalized Stirling numbers of rank r give rise to a duality triad of rank r. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Duality triad, recurrence relation, orthogonal polynomials MSC (2000): 05Axx, 11B37, 11B83

1

Introduction

In [10, 11] the notion of duality triad was introduced. By this the following system is meant. Let three sequences i = {ik }k≥0 , q = {qk }k≥0 , and d = {dk }k≥0 of complex numbers with ik = 0 be given. For such a triple (i, q, d) we introduce the following “dynamical system”. The discrete time steps n start with 0 and at every time n we consider (i,q,d) the sequence cn = {cn,k }k≥0 ≡ {cn,k }k≥0 of complex numbers satisfying c0,k = δ0,k and the recursion relation cn+1,k = ik−1 cn,k−1 + qk cn,k + dk+1 cn,k+1 .

(1)

Given the initial values and the triple of sequences, the coeﬃcients cn,k are uniquely (i,q,d) determined. Let us consider the polynomial sequence {Φn (x)}n≥0 ≡ {Φn (x)}n≥0 (i.e., deg Φn (x) = n) satisfying Φ0 (x) = 1 and the recursion relation xΦn (x) = dn Φn−1 (x) + qn Φn (x) + in Φn+1 (x). ∗

E-mail: [email protected]

(2)

M. Schork / Central European Journal of Mathematics

This recursion relation for the sequence of triad polynomials Φn (x) is dual to the relation (1) in the sense that the following inversion relation holds [10, 11]: xn = cn,k Φk (x). (3) k≥0

This system of numbers cn,k and polynomials Φn (x) satisfying (1), (2), and (3) for the given triple of sequences (i, q, d) is called duality triad (associated to (i, q, d)) [10, 11]. As stressed in [10], these duality triads are dual recurrences satisfying (1) and (2) as used in dynamical data structure theory [8, 9] which furthermore satisfy a third relation, the inversion relation (3). In general, the coeﬃcients cn,k are thus the expansion coeﬃcients (or connection coeﬃcients) of the monomials xn in the basis of polynomials {Φk (x)}k≥0 . In [4, 10–12, 18] these triads were studied and several well-known sequences of combinatorial numbers cn,k and polynomials Φn (x) were shown to be special cases. Due to (2) the triad polynomials are very closely related to orthogonal polynomials on the real line. In [18] a generalization of duality triads to higher rank (where the rank one case corresponds to the usual duality triads) was suggested and it was conjectured that the Stirling numbers of higher rank introduced in [2, 3] should yield an interesting example of such a generalized duality triad. A precise deﬁnition of duality triads of higher rank was given in [19] and ﬁrst properties were derived. However, only toy examples were given in [19]. It is the aim of the present paper to present some concrete examples of duality triads of higher rank. In particular, it is shown that the generalized Stirling numbers of higher rank (and their q-deformed analogues introduced in [17]) give indeed rise to duality triads of higher rank. The main structural result of this paper is a description of the intimate connection between duality triads of higher rank and orthogonal matrix polynomials on the real line, generalizing the above-mentioned connection of the rank one case. The structure of the paper is as follows. In Section 2 some examples of duality triads (of rank one) and their relation to dynamical data structures and orthogonal polynomials are discussed. In Section 3 we recall the deﬁnition of duality triads of higher rank, introduce the important class of Hermitian duality triads and give a possible interpretation of the deﬁning recursion relation in terms of dynamical data structures. In Section 4 some properties of orthogonal matrix polynomials are recalled and the connection between Hermitian duality triads of rank r and orthogonal matrix polynomials of rank r on the real line is established. Some explicit examples of duality triads of higher rank are discussed in Section 5. Finally, in Section 6 some conclusions are presented.

2

Duality triads: Some examples and properties

In this section we describe a few examples of duality triads which will play a prominent role later on. Many more examples can be found in [4, 10–12, 18]. We also make some remarks concerning the relationship between duality triads and dynamical data structures as discussed in [8] and orthogonal polynomials on the real line.

M. Schork / Central European Journal of Mathematics

Example 2.1. (Pascal triad) ik = 1, qk = 1, dk = 0. The corresponding cn,k satisfy the recursion relation cn+1,k = cn,k−1 + cn,k and are given by the binomial coeﬃcients, i.e., cn,k = nk . The triad polynomials are given by Φk (x) = (x − 1)k so that the inversion relation (3) becomes in this case n

x =

n n

k

k=0

(x − 1)k .

(4)

Example 2.2. (Stirling triad) ik = 1, qk = k, dk = 0. The corresponding cn,k satisfy cn+1,k = cn,k−1 + kcn,k and are given by the Stirling numbers of second kind, i.e., cn,k = S(n, k) where (see [21]) k (−1)k p k S(n, k) = pn . (−1) p k! p=0

(5)

The triad polynomials are given by Φk (x) = x(x − 1) · · · (x − k + 1) =: xk and (3) becomes in this case n n S(n, k)xk . (6) x = k=0

Example 2.3. (q-deformed Stirling triad) ik = q k , qk = kq , dk = 0, where the basic n k nq ! q-numbers are given by kq = 1−q , mq ! = m k=1 kq and k q = kq !(n−k)q ! [1]. The corre1−q sponding cn,k satisfy cn+1,k = q k−1 cn,k−1 + kq cn,k and are given by the q-deformed Stirling numbers of second kind, i.e., cn,k = S(n, k|q) where (see [17] and the references therein) k (−1)k k p (k−p ) 2 S(n, k|q) = (−1) q pn . p q q kq ! p=0

(7)

As a reference for later discussions we denote the recursion relation explicitly, S(n + 1, k|q) = q k−1 S(n, k − 1|q) + kq S(n, k|q).

(8)

Introducing χk (x) := x(x − 1q ) · · · (x − (k − 1)q ), the triad polynomials are given by k Φk (x) = q −(2) χk (x) so that (3) becomes xn =

n

k

q −(2) S(n, k|q)χk (x).

(9)

k=0

Introducing xkq := xq (x − 1)q · · · (x − k + 1)q , a more natural relation generalizing (6) is xnq

=

n

S(n, k|q)xkq .

(10)

k=0

Choosing the special value q = 1 leads to Example 2.2. Note, however, that (10) is not an example of the inversion relation (3) in the strict sense since the occurring functions are

M. Schork / Central European Journal of Mathematics

polynomials in xq but not in x. For further discussions see Remark 5.8. As discussed in [18] it is possible to generalize this example to various diﬀerent generalizations of Stirling numbers, e.g., to the Stirling numbers occurring in “ψ-extended umbral calculus” (see [13] for an up-to-date account). Remark 2.4. (Dynamical data structures) Let us discuss brieﬂy the relation between duality triads and dynamical data structures as discussed in [8]. As a concrete example of a data structure we think of a ﬁle containing some items (representing, e.g., a list or a stack). Let us denote by cn,k the number of possible paths, i.e., histories, of the given ﬁle starting from height 0 (i.e., the empty ﬁle) that are at height (= ﬁle size) k at time n. We think of the the height as the location of a particle on the line. It jumps left or right or sits at each time step according to whether the operation Insertion I, Deletion D or a Query Q is applied to the ﬁle (here we assume that that only these operations are permitted for the corresponding data structure). Let us denote by N pos(O, k) the number of possibilities for performing the operation O (where O ∈ {I, D, Q}) on the ﬁle of size k. Abbreviating ik := N pos(I, k), dk := N pos(D, k), qk := N pos(Q, k), it is then clear that cn,k satisﬁes the recursion relation (1). In order to interpret (1) in this setting the coeﬃcients ik , dk , qk should be non-negative integers. Remark 2.5. (Orthogonal polynomials on the real line) We brieﬂy review some basic facts about orthogonal polynomials following the standard references [5, 22] (see also the recent survey [23] discussing many new developments). Let μ be a positive Borel measure (with an inﬁnite number of points in its support) with support on the real line such that all moments μn := R xn dμ(x) exist. Then there exist unique polynomials pn (x) = κn xn + · · · with κn > 0 that form an orthonormal system in L2 (R, μ), i.e., pm (x) dμ(x) = δn,m . The pn ’s are called orthonormal polynomials (pn , pm ) := R pn (x)¯ corresponding to μ. One of the most remarkable consequences of the fact that μ is supported on the real line is that the pn ’s obey a three-term recurrence relation xpn (x) = an+1 pn+1 (x) + bn pn (x) + an pn−1 (x)

(11)

n where the coeﬃcients can be determined by an = κκn−1 > 0 and bn = R xp2n (x) dμ(x). Conversely, any system of polynomials satisfying (11) with real an > 0, bn is an orthonormal system with respect to a (not necessarily unique) measure μ on the real line (Favard’s theorem). Comparing (11) with (2) shows the close connection of the triad polynomials Φn (x) to orthogonal polynomials. The main diﬀerence is that the tridiagonal transfer (l) matrix T of a duality triad (15) is not necessarily symmetric (depending on μn ), see Theorem 3.5.

3

Duality triads of higher rank

Let us introduce a concise notation following [18, 19]. Thus, (−1)

μk

≡ ik ,

(0)

μk ≡ qk ,

(1)

μk ≡ dk .

(12)

M. Schork / Central European Journal of Mathematics

Remark 3.1. With this notation the basic sequences of the Stirling triad of Example (l) (l) 2.2 can be written as μk = k 1+l if l ≤ 0 and μk = 0 if l > 0. The q-deformed versions (l) (l) of Example 2.3 are given by μk = q −lk kq1+l if l ≤ 0 and μk = 0 if l > 0. The basic recursion relation (1) can then be written as cn+1,k =

1

(l)

μk+l cn,k+l

(13)

l=−1

and the dual recursion relation (2) is given by xΦn (x) =

1

μ(−l) n Φn+l (x).

(14)

l=−1

Deﬁning the tridiagonal transfer matrix T by Tkl :=

1

(σ)

μk+σ δk+σ,l ,

(15)

σ=−1

one can write (13) as cn+1 = T cn with the interpretation as a dynamical system on the space of sequences. The main idea of a duality triad of rank r ≥ 1 consists in generalizing the basic three-term recursion (13) to a (2r + 1)-term recursion relation for the cn,k . Now, let r be a natural number and assume that we are given a (2r + 1)-tuple of sequences μ = (μ(−r) , . . . , μ(r) ). Then the straightforward generalization of (13) is the (2r + 1)-term recursion relation r (l) cn+1,k = μk+l cn,k+l (16) l=−r

with initial condition c0,k = δ0,k . Deﬁning the linear spaces t C∞ f in := {c = (c0 , c1 , . . .) | ck ∈ C, only ﬁnitely many ck = 0}, k akj xj , akj ∈ C, akk = 0}, PC := {Ψ(x) = (Ψ0 (x), Ψ1 (x), . . .) | Ψk (x) = j=0

and the bilinear pairing ·|· : PC × C∞ f in → C[x], given by Ψ(x)|c := may now recall the deﬁnition of duality triads of rank r given in [19].

∞

k=0 ck Ψk (x),

we

Definition 3.2. (Duality triad of rank r) Let a (2r + 1)-tuple of sequences μ = (−r) (μ(−r) , . . . , μ(r) ) with μk = 0 for all k and a polynomial pr (x) of degree r be given. Furthermore, let an r-tuple Ψ(x) = (Ψ0 (x), . . . , Ψr−1 (x)) of polynomials satisfying Ψ0 (x) = 1 and deg Ψk (x) = k be given. The associated duality triad or rank r is deﬁned by the trans (σ) fer matrix T ≡ T (μ) with Tkl := rσ=−r μk+σ δk+σ,l , a sequence {cn }n≥0 with cn ∈ C∞ f in , and a Φ(x) ∈ PC such that: (i) cn+1 = T cn (together with c0,k = δ0,k ), (ii) pr (x)Φ(x) = Φ(x)T (together with Φk (x) = Ψk (x) for 0 ≤ k ≤ r − 1), and

M. Schork / Central European Journal of Mathematics

(iii) Φ(x)T |cn = Φ(x)|T cn for every n ≥ 0. It was shown in [19] that every duality triad of rank one with p1 (x) = x is a duality triad in the original sense of [10, 11] and vice versa. To be more explicit, (i) reproduces the recursion relation (16) whereas (ii) yields the dual recursion relation for the triad polynomials r pr (x)Φn (x) = μ(−l) (17) n Φn+l (x) l=−r

and is the generalization of (14). The inversion relation (iii) is given explicitly by n

[pr (x)] =

rn

cn,k Φk (x)

(18)

k=0

and is the generalization of (3). The transfer matrix T = (Tkl )k,l∈N is a (2r + 1)-diagonal (0) matrix and has around the element Tkk = μk the structure ⎞ ⎛ .. .. .. . . . ⎟ ⎜ ⎟ ⎜ ⎜ . . . μ(0) μ(1) μ(2) · · · ⎟ ⎟ ⎜ k−1 k k+1 ⎟ ⎜ ⎟ ⎜ (0) (1) T = ⎜ · · · μ(−1) . (19) μk+1 · · · ⎟ k−1 μk ⎟ ⎜ ⎟ ⎜ ⎜ · · · μ(−2) μ(−1) μ(0) · · · ⎟ k−1 k k+1 ⎟ ⎜ ⎠ ⎝ .. .. .. . . . Definition 3.3. (Hermitian duality triad of rank r) A duality triad of rank r associated to the (2r + 1)-tuple of sequences μ = (μ(−r) , . . . , μ(r) ) is called Hermitian if the (σ) associated transfer matrix T with Tkl := rσ=−r μk+σ δk+σ,l is Hermitian. Proposition 3.4. For an Hermitian duality triad of rank r the sequence μ(0) is real and the sequences μ(−l) with 1 ≤ l ≤ r can be expressed through the sequences μ(l) with positive index by (−l) (l) μk = μk+l . (20) Thus, only the (r + 1) sequences μ(l) with 0 ≤ l ≤ r are independent. Proof. This follows immediately from (19).

Note that the rather innocent looking property of being Hermitian is in fact a rather strong restriction on a duality triad of rank r. In particular, the explicit examples of duality triads of rank r discussed in Section 5 are not Hermitian. In the rank one case (with notations from the introduction) being Hermitian means that the qk are real and that one has the relation ik−1 = dk . Assuming in particular that the sequence d = (dk )k∈N is real and positive, the recursion relation (2) of the triad polynomials Φn (x) becomes xΦn (x) = dn Φn−1 (x) + qn Φn (x) + dn+1 Φn+1 (x)

(21)

M. Schork / Central European Journal of Mathematics

which is (upon identifying the sequences an ≡ dn and bn ≡ qn ) exactly the recursion relation (11) of orthogonal polynomials on the real line. Let us collect the above observations in the following theorem. Theorem 3.5. Let an Hermitian duality triad of rank one (with polynomial p1 (x) = x) associated to the pair of real sequences d ≡ μ(1) and q = μ(0) with dk > 0 be given. Then the triad polynomials Φn (x) are orthogonal polynomials on the real line in the sense of (11) and determine a positive measure on the real line (Favard’s theorem). Conversely, given a sequence of orthogonal polynomials Φn (x) on the real line satisfying (11) there exists an Hermitian duality triad of rank one (with polynomial p1 (x) = x) associated to (1) (0) (−1) the sequences μn ≡ an , μn ≡ bn and μn ≡ an+1 such that the triad polynomials are equal to the Φn (x). The generalization of this result to duality triads of higher rank will be discussed in the next section (see Theorem 4.2). Remark 3.6. (Dynamical data structures revisited) Let us try to connect duality triads of rank r ≥ 1 to dynamical data structures which were brieﬂy recalled in Remark 2.4 with their connection to duality triads (of rank one). The coeﬃcient cn,k has an interpretation as the number of possible histories a ﬁle (empty at the beginning) with k items has after n time steps where at each time step one of the operations I, D or Q is performed and where exist, e.g., ik possibilities to perform an insertion I if the ﬁle contains k items. This interpretation led directly to (1). Having this interpretation in mind, one should write the recursion relation (16) of a duality triad of rank r as (r)

(1)

(1)

(r)

cn+1,k = ik−r cn,k−r + · · · + ik−1 cn,k−1 + qk cn,k + dk+1 cn,k+1 + · · · + dk+r cn,k+r .

(22)

(σ)

The interpretation is then that one has ik−σ (with 1 ≤ σ ≤ r) possibilities of inserting (τ ) σ items simultaneously in the ﬁle containing k − σ items and having dk+τ (with 1 ≤ τ ≤ r) possibilities of deleting τ items simultaneously in the ﬁle containing k + τ items (σ) (τ ) (of course, for this interpretation all coeﬃcients ik−σ , qk , dk+τ have to be non-negative integers). Thus, if the data structure is accessed sequentially (i.e., one item after another), (σ) (τ ) the coeﬃcients ik−σ with σ > 1 as well as dk+τ with τ > 1 should vanish. In other words, the case r > 1 is not a good model. However, if one imagines that the ﬁle can be accessed in a parallel fashion (e.g., by r independent “operators”) then a possible application of duality triads of higher rank to the study of dynamical data structures seems not to be too far fetched.

4

Duality triads of higher rank and orthogonal matrix polynomials on the real line

In this section we draw a connection between duality triads of rank r and orthogonal matrix polynomials of rank r on the real line. This generalizes the connection between

M. Schork / Central European Journal of Mathematics

duality triads of rank one and scalar orthogonal polynomials discussed in Theorem 3.5. Before we do this we ﬁrst recall some basic facts about orthogonal matrix polynomials (following [7, 15, 20]) and their relation to higher order scalar recurrence relations [7]. An r × r matrix P (x) = (pij (x))1≤i,j≤r with polynomial entries pij (x) of degree at most n is called a matrix polynomial of degree at most n. Alternatively, one can write P (x) = Cn xn + · · · + C0 with some numerical matrices Ck of size r × r. A matrix μ(r) = (μij )1≤i,j≤r of complex Borel measures deﬁned on the real line is positive deﬁnite if for any Borel set E the matrix μ(r) (E) is positive semideﬁnite. We assume that all moments of μ(r) are ﬁnite. With such a matrix μ(r) one can deﬁne a matrix inner product on the space of r×r matrix polynomials via (P, Q) := R P (x) dμ(r) (x) Q∗ (x), and if (P, P ) is nonsingular for any P with nonsingular leading coeﬃcient, then just as in the scalar case one can generate a sequence {Pn }n≥0 of matrix polynomials which is orthonormal with respect to μ(r) , i.e., (Pn , Pm ) = R Pn (x) dμ(r) (x) Pm∗ (x) = δn,m Ir . The sequence {Pn }n≥0 is determined only up to left multiplication by unitary matrices, i.e., if Un are unitary matrices then {Un Pn }n≥0 also forms an orthonormal system with respect to μ(r) . Just as in the scalar case the orthogonal matrix polynomials satisfy a three-term recurrence relation xPn (x) = An+1 Pn+1 (x) + Bn Pn (x) + A∗n Pn−1 (x) (23) where An are nonsingular and Bn Hermitian. Conversely, the analogue of Favard’s theorem is also true: If a sequence of matrix polynomials {Pn }n≥0 satisﬁes (23) with nonsingular An and Hermitian Bn then there exists a positive deﬁnite measure matrix μ(r) such that Pn are orthogonal with respect to μ(r) . Orthogonal matrix polynomials of rank r are closely related to (2r + 1)-term recurrence relations for scalar polynomials. Let a polynomial h(x) of degree r be given. Instead of using the basis of monomials {1, x, x2 , . . .} to span the linear space of polynomials, one can use the basis {xj hi (x) | 0 ≤ j < r, i = 0, 1, . . .}, i.e., {1, x, . . . , xr−1 , h(x), xh(x), . . . , xr−1 h(x), h2 (x), xh2 (x), . . .}. A polynomial p(x) of degree nr + m (0 ≤ m < r) can then expanded in this basis as p(x) =

n r−1

ai,j xj hi (x).

(24)

i=0 j=0

Let us deﬁne operators Rh,r,j (for 0 ≤ j ≤ r − 1) which take from p just those terms of the form ai,j xj hi (x) and then remove the common factor xj and change h(x) to x, i.e., Rh,r,j [p](x) =

n

ai,j xi .

(25)

i=0

Now, we can state the following theorem due to Duran and Van Assche [7]. Theorem 4.1 (Duran, Van Assche). Suppose {pn (x)}n≥0 is a sequence of polynomials satisfying the following (2r + 1)-term recurrence relation r d¯n,k pn−k (x) + dn+k,k pn+k (x) h(x)pn (x) = dn,0 pn (x) + k=1

(26)

M. Schork / Central European Journal of Mathematics

where dn,0 (n = 0, 1, . . .) is a real sequence and dn,k (n = 0, 1, 2, . . .) are complex sequences for k = 1, . . . , r with dn,r = 0 for every n and with the initial conditions pk (x) = 0 for k < 0 and pk given polynomials of degree k, for k = 0, . . . , r − 1. Deﬁne the sequence of matrix polynomials {Pn (x)}n≥0 of rank r by ⎛ ⎞ Rh,r,r−1 [pnr ](x) Rh,r,0 [pnr ](x) · · · ⎜ ⎟ ⎜ ⎟ ⎜ Rh,r,0 [pnr+1 ](x) · · · Rh,r,r−1 [pnr+1 ](x) ⎟ ⎜ ⎟ Pn (x) := ⎜ (27) ⎟. . . ⎜ ⎟ .. .. ⎜ ⎟ ⎝ ⎠ Rh,r,0 [pnr+r−1 ](x) · · · Rh,r,r−1 [pnr+r−1 ](x) Then this sequence of matrix polynomials is orthonormal on the real line with respect to a positive deﬁnite matrix measure and satisﬁes a three-term recurrence relation of the form (23) where the coeﬃcients An , Bn can be described explicitly in terms of dn,k . Conversely, given a sequence Pn (x) = (Pn;k,l (x))1≤k,l≤r of orthonormal matrix polynomials of rank r on the real line (satisfying a three-term recurrence (23)), the sequence pn (x) of scalar polynomials deﬁned for n ∈ N and 0 ≤ m ≤ r − 1 by pnr+m (x) :=

r

xj Pn;m,j (h(x))

(28)

j=0

satisﬁes a (2r + 1)-term recursion relation of the form (26). We now come to the analogue of Theorem 3.5 and show the intimate connection between duality triads of higher rank and orthogonal matrix polynomials. Theorem 4.2. Let an Hermitian duality triad of rank r associated to the polynomial pr (x) and the (r + 1) sequences μ(l) with 0 ≤ l ≤ r be given (the r polynomials Ψk (x) with 0 ≤ k ≤ r − 1 have also to be ﬁxed). Then there exists a sequence of matrix polynomials Pn (x) (determined up to multiplication by unitary matrices Un ) which satisfy a three-term recursion of the form (23) - where the coeﬃcients An , Bn can be described explicitly in (l) terms of the μk - and are orthogonal with respect to a positive deﬁnite matrix measure (Favard’s theorem for matrix polynomials). Conversely, given a sequence Pn (x) of orthogonal matrix polynomials of rank r on the real line and a polynomial h(x) of degree r, there exists an associated Hermitian duality triad of rank r (with polynomial pr (x) = h(x)) whose triad polynomials are completely determined by the entries of the Pn (x). Proof. Let an Hermitian duality triad of rank r be given. The triad polynomials Φn (x) satisfy the recursion relation (17). Writing this more explicitly and using the relation (20) valid for an Hermitian duality triad, we obtain pr (x)Φn (x) =

μ(0) n Φn (x)

+

r

(k) μ(k) Φ (x) + μ Φ (x) . n−k n n+k n+k

k=1

(29)

M. Schork / Central European Journal of Mathematics (0)

(k)

Deﬁning the polynomial h(x) := pr (x) and the sequences dn,0 := μn and dn,k := μn for 1 ≤ k ≤ r, this equation becomes r h(x)Φn (x) = dn,0 Φn (x) + dn,k Φn−k (x) + dn+k,k Φn+k (x) .

(30)

k=1

Thus, the triad polynomials Φn (x) satisfy (26) where the sequence dn,0 is real and where (r)

(−r)

dn,r ≡ μn = μn−r = 0 by deﬁnition of the duality triad of rank r (and the ﬁrst r polynomials are given explicitly by Φk (x) = Ψk (x) for 0 ≤ k ≤ r − 1). Thus, all the assumptions of Theorem 4.1 are satisﬁed. Deﬁning Rh,r,j [Φn ](x) as in (25) and with their help the corresponding matrix polynomials Pn (x) as in (27), the theorem of Duran and Van Assche (Theorem 4.1) assures us that the Pn (x) satisfy the asserted three-term recursion relation and are orthogonal with respect to a positive deﬁnite measure matrix on the real line, thereby showing the ﬁrst assertion. Now, let us show the converse, i.e., the polynomial h(x) and a sequence Pn (x) is given. The second part of the theorem of Duran and van Assche (Theorem 4.1) assures us that the polynomials pn (x) deﬁned by (28) satisfy a (2r + 1)-term recursion relation of the form (26) with certain coeﬃcients (k) dn,k . Deﬁning the sequences μn := d¯n,k for 0 ≤ k ≤ r yields the basic input of an Hermitian duality triad of rank r. Example 4.3. (Discrete Sobolev orthogonal polynomials) An important class of polynomials satisfying a higher order recurrence relation is obtained by taking polynomials orthogonal with respect to an inner product of (discrete) Sobolev type, (p, q) =

p(x)q(x) dμ(x) + R

Mi M

λi,j p(j) (ci )q (j) (ci ),

(31)

i=1 j=0

where p, q are real polynomials on the real line and λi,j ≥ 0. Here derivatives are taken at M points ci ∈ R, and at the point ci the highest derivative is of order Mi [7]. Introducing Mi +1 of degree M + M the polynomial h(x) := M i=1 (x − ci ) i=1 Mi =: r, it can be shown that the corresponding orthogonal polynomials pn with (pn , pm ) = δn,m satisfy a recursion relation of the form h(x)pn (x) = rk=−r an,k pn+k (x), i.e., a (2r+1)-term recursion relation. It is clear that by deﬁning the sequences μ(l) appropriately in terms of the an,k one obtains a duality triad of rank r. Considering the particular case M = 1 and denoting the highest derivative M1 ≡ s, one ﬁnds r = s + 1 and, therefore, a (2s + 3)-term recursion relation for the polynomials. This is the original observation of Marcell´an and Ronveaux [14]. Before closing this section let us mention that there exists a close connection between orthogonal polynomials on an algebraic harmonic curve and orthogonal matrix polynomials on the real line (due to Marcell´an and Sansigre [15]). The orthogonal polynomials on the curve satisfy a (2r + 1)-term recursion relation where r is given by the degree of the polynomial h(z) deﬁning the algebraic curve.

M. Schork / Central European Journal of Mathematics

5

Examples of duality triads of higher rank

In this section some examples of duality triads of rank r ≥ 1 generalizing the duality triads (of rank one) considered in Section 2 will be discussed. Note that none of the examples considered here is Hermitian (and, therefore, not corresponding to orthogonal matrix polynomials by Theorem 4.2). Example 5.1. (“Embedded” Pascal triad) This is an example which was treated brieﬂy in [19]. Let the (2r + 1)-tuple μ = (μ(−r) , 0, . . . , 0, μ(0) , 0, . . . , 0) of sequences be given (−r) (0) := 1 and μk := 1. Furthermore, a polynomial pr (x) and an r-tuple of where μk polynomials Ψ(x) = (Ψ0 (x), . . . , Ψr−1 (x)) has to be given. If we write n = r nr + n∗ with 0 ≤ n∗ ≤ r − 1 then the triad polynomials Φn are given for n ≥ r by n Φn (x) = (pr (x) − 1)[ r ] Ψn∗ (x).

(32)

Let us turn to the coeﬃcients cn,k satisfying cn+1,k = cn,k +cn,k−r . It is clear that cn,k = 0 if n ¯ and obtain cn,kr k is not a multiple of r. If k is a multiple of r we may write k = kr ¯ = k ¯ . Drawing a table of the coeﬃcients cn,k shows why we have called this example “embedded” ∗ Pascal triad (see Example 2.1). Choosing Ψn∗ (x) = xn (for 0 ≤ n∗ ≤ r − 1) it follows ¯ k that Φkr ¯ (x) = (pr (x) − 1) . Since cn,k vanishes if k is not a multiple of r the inversion relation (18) is in this case given by n n ¯ (pr (x) − 1)k . [pr (x)] = ¯ k n

(33)

¯ k=0

By choosing diﬀerent pr (x) (e.g., pr (x) = xr or pr (x) = xr ) one obtains diﬀerent generalizations of (4). (l)

(l)

Example 5.2. (Pascal triad of rank r) Let μk := 1 if l ≤ 0 and μk := 0 if l > 0. Thus, the coeﬃcients cn,k satisfy the recursion relation cn+1,k = cn,k + cn,k−1 + · · · + cn,k−r .

(34)

Clearly, the case r = 1 corresponds to the usual Pascal triad treated in Example 2.1. A small induction shows that the coeﬃcients (for ﬁxed n) are symmetric around the one in the middle, i.e., cn,rn−k = cn,k . Following [6], p. 77 (see also [16], p. 167) we introduce (r) in [6]) by the identity the polynomial coeﬃcients An,k (denoted by n,r+1 k (1 + x + · · · + x ) = r n

rn

(r)

An,k xk .

(35)

k=0 (r)

From this it follows immediately that A0,k = δk,0 as well as (r)

(r)

(r)

(r)

An+1,k = An,k + An,k−1 + · · · + An,k−r

(36)

M. Schork / Central European Journal of Mathematics (r)

so that An,k ≡ cn,k from (34). The case r = 1 leads via the binomial formula directly to (1) the binomial coeﬃcient An,k = nk , see Example 2.1. In the general case one may use the multinomial formula for the left-hand side of (35) to obtain n! (r) An,k = . (37) (n − l1 − · · · − lr )!l1 ! · · · lr ! 0≤l ,...,lr ≤n 1 l1 +2l2 +···+rlr =k

(2)

(2)

Specializing (37) to the case r = 2 one obtains for 0 ≤ k ≤ n (recall that An,2n−k = An,k ) the following explicit values of the trinomial coeﬃcients k

(2) An,k

=

2 l=0

k2 n k−l n! = k−l l (n − k + l)!(k − 2l)!l! l=0

(38)

(where we have denoted by x the greatest integer less than or equal to x). The maximum n2 nn−l (2) (2) value of the An,k for ﬁxed n is attained if k = n and equals An,n = l=0 . This l l particular value for the middle trinomial coeﬃcient is also mentioned in [16]. A table (2) for the ﬁrst few trinomial coeﬃcients An,k ≡ n,3 as well as quadrinomial coeﬃcients k n,4 (3) A ≡ k can be found on p. 78 in [6]. Note that taking x → 1 in (35) shows that n,k (r) rn n k=0 An,k = (1 + r) . Now, let us determine the remaining properties of a duality triad of rank r. Deﬁning pr (x) := 1 + x + · · · + xr as well as Φk (x) := xk for k ≥ r (together with Ψk (x) := xk for 0 ≤ k ≤ r − 1) shows that (35) is the inversion relation (18) in this case. What remains to be checked is the dual recursion relation (17) for the triad polynomials Φk (x) = xk which is given in this case by (1 + x + · · · + x )x = r

n

r

xn+l

(39)

l=0

and which evidently holds. It is interesting to note that in the case r = 1 one obtains p1 (x) = 1 + x and Φk (x) = xk , thus reproducing Example 2.1 only up to a shift x x − 1 (in Example 2.1 one has p1 (x) = x and Φk (x) = (x − 1)k ), suggesting that the deﬁnition of the Pascal triad in the rank one case is not the natural choice. However, we may now summarize the above observations in the following theorem. Theorem 5.3. (Pascal triad of rank r) For a given r ≥ 1 let pr (x) := 1 + x + · · · + xr , (l) Ψk (x) := xk for k = 0, . . . , r − 1 and the (2r + 1) sequences μ(l) deﬁned by μk := 1 if (l) l ≤ 0 and μk := 0 if l > 0 be given. Then the associated duality triad of rank r consists (r) of the coeﬃcients cn,k ≡ An,k of (37) and the triad polynomials Φk (x) ≡ xk satisfying the recursion relation (36), the dual recursion relation (39) and the inversion relation (35). Choosing r = 1 yields the Pascal triad treated in Example 2.1 (up to a shift x x − 1). Example 5.4. (Generalized Pascal triad of rank r) We will now generalize (35) slightly (r) by introducing a vector λ = (λ1 , . . . , λr ) ∈ Zr and deﬁning coeﬃcients An,k (λ) by (1 + λ1 x + · · · + λr x ) = r n

rn k=0

(r)

An,k (λ)xk ;

(40)

M. Schork / Central European Journal of Mathematics (r)

(r)

(r)

if λ = (1, . . . , 1) then An,k (λ) = An,k from above. The deﬁnition implies A0,k (λ) = δk,0 and the recursion relation (r)

(r)

(r)

(r)

An+1,k (λ) = An,k (λ) + λ1 An,k−1 (λ) + · · · + λr An,k−r (λ).

(41) (l)

Thus, the sequences μ(l) of the corresponding duality triad of rank r are given by μk = λ−l (0) (l) if l < 0, μk = 1 and μk = 0 if l > 0. In view of (40) one deﬁnes pr (x) := 1 + λ1 x + · · · + λr xr , Φk (x) := xk as well as Ψk (x) := xk so that (40) is already the inversion relation. The (r) data {μ(l) , pr (x), Ψk (x), Φk (x), cn,k ≡ An,k (λ)} comprise the generalized Pascal triad of rank r (the dual recursion relation for the triad polynomials remains to be checked). Using the multinomial formula for the left-hand side of (40) yields a formula for the coeﬃcients (r) An,k (λ) similar to (37). Note that by choosing the particular vector λ = (0, . . . , 0, 1) (r)

(r)

(r)

one obtains the recursion relation An+1,k (λ) = An,k (λ) + An,k−r (λ) of Example 5.1. As another example, let us consider the case r = 2 with the vector λ = (−1, 1). The (2) (2) (2) coeﬃcients An,k (−1, 1) ≡ An,k ((−1, 1)) satisfy the recursion relation An+1,k (−1, 1) = (2)

(2)

(2)

An,k (−1, 1) − An,k−1 (−1, 1) + An,k−r (−1, 1) and are related to the trinomial coeﬃcients (2)

(2)

(2)

An,k corresponding to the vector λ = (1, 1) by An,k (−1, 1) = (−1)k An,k . Remark 5.5. It is tempting to consider r → ∞ in (35) (see also [16], p. 189, for a related discussion). Summing the geometric series yields as deﬁnition of the cor ∞ (∞) k responding coeﬃcients (1 − x)−n = k=0 An,k x . Thus, one has the explicit values (∞) (∞) (∞) (∞) (∞) = n+k−1 and the recursion relation An+1,k = An,k +An,k−1 +· · ·+An,0 An,k = (−1)k −n k k which one may consider as the limit r → ∞ of (36). Note, however, that this will not give rise to a duality triad of higher rank since the order of the recursion relation is not ﬁxed, but depends on the second index. Example 5.6. (Stirling triad of rank r) The generalized Stirling numbers of second kind Sr,s (n, k) were introduced in [2, 3]. In the special case r = s = 1 they reduce to the conventional Stirling numbers of Example 2.2, i.e., S1,1 (n, k) ≡ S(n, k), and in the case r = 2, s = 1 they reduce to Lah numbers. In the most important special case s = r (which we will call Stirling numbers of rank r) it was shown in [2, 3] that k (−1)k p k Sr,r (n, k) = [pr ]n (−1) (42) p k! p=r for r ≤ k ≤ rn (and Sr,r (n, k) = 0 otherwise) and that the recursion relation is given written in a form particularly suited for our purpose - by 0 r (k + l)! Sr,r (n + 1, k) = (43) Sr,r (n, k + l) −l (k − r)! l=−r (the recursion relation for the general case Sr,s (n, k) can be found in [17]). Furthermore, these Stirling numbers of rank r were also shown to be connection coeﬃcients, i.e., rn r n [x ] = Sr,r (n, k)xk . (44) k=r

M. Schork / Central European Journal of Mathematics

Now, we want to show that the Sr,r (n, k) lead to a duality triad of rank r which generalizes Example 2.2 in a beautiful way. Let us, therefore, deﬁne the (2r + 1) sequences μ(l) by ⎧ ⎪ ⎨ 0 if 0 < l ≤ r, (l) μk := (45) ⎪ ⎩ r k r+l if −r ≤ l ≤ 0. −l (−r)

Note that μk = 1 = 0. Furthermore, let pr (x) := xr and Ψk (x) := xk for k = 0, . . . , r−1. The coeﬃcients cn,k satisfy c0,k = δ0,k as well as the recursion relation (16) which is given with the above choice (45) by cn+1,k

0 0 r r (k + l)! r+l (k + l) cn,k+l = = cn,k+l . −l −l (k − r)! l=−r l=−r

(46)

Comparing this with (43) shows that the cn,k are given by the Stirling numbers of rank r, i.e., cn,k = Sr,r (n, k). Furthermore, comparing the inversion relation (18) with (44) (and recalling pr (x) = xr ) shows that the triad polynomials are given by Φk (x) = xk . It remains to be shown that these triad polynomials satisfy the dual recursion relation (17) (−l) which is in this case given by xr Φn (x) = rl=−r μn Φn+l (x), or, more explicitly, by r n

xx =

r r l=0

l

nr−l xn+l .

(47)

We will not check this explicitly but just remark that the identity xn+1 = xn (x − n) (and its iterations xn+l = xn lj=1 (x − (n + l) + j)) is particularly helpful. Let us summarize the above observations in the following theorem. Theorem 5.7. (Stirling triad of rank r) For a given r ≥ 1 let pr (x) := xr , Ψk (x) := xk for k = 0, . . . , r − 1 and the (2r + 1) sequences μ(l) deﬁned by (45) be given. Then the associated duality triad of rank r consists of the coeﬃcients cn,k ≡ Sr,r (n, k) of (42) and the triad polynomials Φk (x) ≡ xk satisfying the recursion relation (43), the dual recursion relation (47) and the inversion relation (44). Choosing r = 1 yields the Stirling triad treated in Example 2.2. Remark 5.8. Before we discuss the next example we want to give some comments concerning the q-deformed situation. As already mentioned at the end of Example 2.3 the inversion relation (9) does not look natural while the much more natural looking equation (10) is not an example of the inversion relation (3) since the occurring functions are polynomials in xq but not in x. To remedy this situation one has the following way out: One should make the domain of the “variable” explicit in Deﬁnition 3.2. This means that one should call the “duality triad of rank r” according to this deﬁnition more precisely a “duality triad of rank r over C[x]”. In the q-deformed situation one should replace the ring C[x] by C[xq ] where xq is the new variable. Thus, one should replace PC from above by PCq := {Ψ(xq ) = (Ψ0 (xq ), Ψ1 (xq ), . . .) | Ψk (xq ) = kj=0 akj xjq , akj ∈ C, akk = 0} and

M. Schork / Central European Journal of Mathematics

the bilinear pairing ·|· : PC × C∞ the corresponding q-deformed version f in → C[x] by q ∞ ·|· q : PC × Cf in → C[xq ] given by Ψ(xq )|c q := ∞ k=0 ck Ψk (xq ). Then one should give a deﬁnition of “duality triad of rank r over C[xq ]” in complete analogy to Deﬁnition 3.2 but where now the variable xq replaces x (and pr (xq ) is a polynomial of degree r in xq , etc.). Since all the manipulations are only algebraic (no considerations of convergence, limits, etc.) one is tempted to consider even more general situations where more or less arbitrary “variables” instead of x or xq are allowed. However, we will not do this but consider the q-deformed situation in a more formal way, mimicking as close as possible the undeformed case. Example 5.9. (q-deformed Stirling triad of rank r) In this example we consider the q-deformed version of Example 5.6. It is the common generalization of Example 5.6 (to the case where q = 1) and Example 2.3 (to the case r ≥ 1). Let us ﬁrst recall some properties of the q-deformed Stirling numbers of rank r Sr,r (n, k|q); these generalized Stirling numbers were introduced in [17] and all the properties mentioned can be found therein. First of all, they are explicitly given by k k−p (−1)k p ( 2 ) k Sr,r (n, k|q) = (−1) q (pr )n p q q kq ! p=r

(48)

(where r ≤ k ≤ rn) and are the obvious common generalization of (42) and (7). They satisfy the recursion relation Sr,r (n + 1, k|q) =

0

q

−l(k−r)

l=−r

r −l

(k + l)q ! Sr,r (n, k + l|q) q (k − r)q !

(49)

(which is the common generalization of (43) and (8)) and are also connection coeﬃcients [xrq ]n

=

rn

Sr,r (n, k|q)xkq

(50)

k=r

(which is the common generalization of (44) and (10)). Since the procedure is very similar to the one given in Example 5.6 we will be brief. The decisive step is to deﬁne the sequences μ(l) correctly. In analogy to (45) we deﬁne ⎧ ⎪ ⎨ 0 if 0 < l ≤ r, (l) (51) μk := ⎪ ⎩ q −l(k−l−r) r kqr+l if −r ≤ l ≤ 0. −l q

Since (l) μk+l

=q

−l(k−r)

r −l

(k +

q

r+l l)q

=q

−l(k−r)

r −l

(k + l)q ! q (k − r)q !

(52)

one sees immediately that the recursion relation (16) for the cn,k has the same form as (49) so that the coeﬃcients cn,k are given by the q-deformed Stirling numbers of rank r, i.e., cn,k = Sr,r (n, k|q). The inversion relation (50) shows that one has to choose pr (xq ) := xrq

M. Schork / Central European Journal of Mathematics

and that the triad polynomials are given by Φk (xq ) := xkq . The dual recursion relation (17) for the triad polynomials is in this case given by r r−l n+l r n l(n+l−r) r xq x q = q nq x q . (53) l q l=0 As above, we will not check this identity but immediately summarize the above observations in the following theorem. Theorem 5.10. (q-deformed Stirling triad of rank r) For a given r ≥ 1 let pr (xq ) := xrq , Ψk (xq ) := xkq for k = 0, . . . , r−1 and the (2r+1) sequences μ(l) deﬁned by (51) be given. Then the associated duality triad of rank r consists of the coeﬃcients cn,k ≡ Sr,r (n, k|q) of (48) and the triad polynomials Φk (xq ) ≡ xkq satisfying the recursion relation (49), the dual recursion relation (53) and the inversion relation (50). Choosing q = 1 yields the Stirling triad of rank r treated in Example 5.6 while choosing r = 1 yields the q-deformed Stirling triad treated in Example 2.3. In (54) the connections between the sequences μ(l) (with −r ≤ l ≤ 0) of the various (generalized) Stirling triads are sketched. In the upper left corner one has the usual (l) Stirling triad with μk = k 1+l (see Remark 3.1). The arrow down means going to the q-deformed situation so that one has in the lower left corner the q-deformed Stirling triad (see Remark 3.1). The arrow to the right means going from r = 1 to arbitrary r ≥ 1. Therefore, one has in the upper right corner the Stirling triad of rank r (Example 5.6) and in the lower right corner the q-deformed Stirling triad of rank r (Example 5.9). k 1+l ⏐ ⏐ ⏐ ⏐

−→

q −lk kq1+l −→ q

r r+l k −l ⏐ ⏐ ⏐ ⏐ −l(k−l−r) r

(54)

r+l k . −l q q

Recall that in the rank one case it is possible to construct duality triads associated to various extensions of the conventional Stirling numbers (of which the q-deformed versions are only a particular example), see Example 2.3. Here one should have in mind the Stirling numbers S(n, k|ψ) of ψ-extended umbral calculus [13]. It seems to be interesting to ﬁnd out whether there exists a natural extension of this ψ-extended Stirling triad to higher rank r ≥ 1.

6

Conclusions

In this paper we have discussed some properties of duality triads of rank r ≥ 1, in particular the connection to orthogonal polynomials. It was shown that a Hermitian duality triad of rank r gives rise to a sequence of orthogonal matrix polynomials (with matrices of size r × r), generalizing the scalar rank one case. Conversely, given a (2r + 1)-term recurrence

M. Schork / Central European Journal of Mathematics

relation for a sequence of polynomials, one obtains an associated duality triad of rank r by choosing the sequences μ(l) deﬁning the duality triad appropriately. This shows that one may associate a duality triad of higher rank to sequences of polynomials which satisfy a higher order recurrence relation. As examples for such sequences discrete Sobolev orthogonal polynomials and orthogonal polynomials on algebraic curves were mentioned. Some concrete examples of duality triads of higher rank were discussed explicitly. In particular, it was shown that the generalized Stirling numbers of higher rank give rise to a duality triad of higher rank. Its q-deformed analogue was also discussed in a slightly formal way and it was stressed that one could make this discussion completely rigorous by extending the notion of duality triad (of higher rank) slightly. Another example of a duality triad of higher rank associated to the polynomial coeﬃcients was discussed and it was shown that this represents a natural generalization of the Pascal triad. Turning to the relation between duality triads and dynamical data structures, it was shown that the deﬁning recursion relation of a duality triad of rank r can be interpreted in terms of histories of a ﬁle where up to r items can be inserted or deleted simultaneously (i.e., in one time step). Thus, one may hope that duality triads of higher rank might prove to be useful for the study of dynamical data structures where parallel access is allowed.

References [1] G.E. Andrews: The Theory of Partitions, Addison Wesley, Reading, 1976. [2] P. Blasiak, K.A. Penson and A.I. Solomon: “The Boson Normal Ordering Problem and Generalized Bell Numbers”, Ann. Comb., Vol. 7, (2003), pp. 127–139. [3] P. Blasiak, K.A. Penson and A.I. Solomon: “The general boson normal ordering problem”, Phys. Lett. A, Vol. 309, (2003), pp. 198–205. [4] E. Borak: “A note on special duality triads and their operator valued counterparts”, Preprint arXiv:math.CO/0411041. [5] T.S. Chihara: An Introduction to Orthogonal Polynomials, Gordon & Breach, New York, 1978. [6] L. Comtet: Advanced Combinatorics, Reidel, Dordrecht, 1974. [7] A.J. Duran and W. Van Assche: “Orthogonal matrix polynomials and higher order recurrence relations”, Linear Algebra Appl., Vol. 219, (1995), pp. 261–280. [8] P. Feinsilver and R. Schott: Algebraic structures and operator calculus. Vol. II: Special functions and computer science, Kluwer Academic Publishers, Dordrecht, 1994. [9] I. Jaroszewski and A.K. Kwa´sniewski: “On the principal recurrence of data structures organization and orthogonal polynomials”, Integral Transforms Spec. Funct., Vol. 11, (2001), pp. 1–12. [10] A.K. Kwa´sniewski: “On duality triads”, Bull. Soc. Sci. Lettres L ´od´z, Vol. A 53, Ser. Rech. D´eform. 42, (2003), pp. 11–25. [11] A.K. Kwa´sniewski: “On Fibonomial and other triangles versus duality triads”, Bull. Soc. Sci. Lettres L ´od´z, Vol. A 53, Ser. Rech. D´eform. 42, (2003), pp. 27–37. [12] A.K. Kwa´sniewski: “Fibonomial Cumulative Connection Constants”, Bulletin of the

M. Schork / Central European Journal of Mathematics

[13] [14] [15] [16] [17] [18] [19] [20] [21] [22] [23]

ICA, Vol. 44, (2005), pp. 81–92. A.K. Kwa´sniewski: “On umbral extensions of Stirling numbers and Dobinski-like formulas”, Adv. Stud. Contemp. Math., Vol. 12, (2006), pp. 73–100. F. Marcell´an and A. Ronveaux: “On a class of polynomials orthogonal with respect to a discrete Sobolev inner product”, Indag. Math., Vol. 1, (1990), pp. 451–464. F. Marcell´an and G. Sansigre: “On a Class of Matrix Orthogonal Polynomials on the Real Line”, Linear Algebra Appl., Vol. 181, (1993), pp. 97–109. J. Riordan: Combinatorial Identities, Wiley, New York, 1968. M. Schork: “On the combinatorics of normal-ordering bosonic operators and deformations of it”, J. Phys. A: Math. Gen., Vol. 36, (2003), pp. 4651–4665. M. Schork: “Some remarks on duality triads”, Adv. Stud. Contemp. Math., Vol. 12, (2006), pp. 101–110. M. Schork: “On a generalization of duality triads”, Cent. Eur. J. Math., Vol 4(2), (2006), pp. 304–318. A. Sinap and W. Van Assche: “Orthogonal matrix polynomials and applications”, J. Comput. Appl. Math., Vol. 66, (1996), pp. 27–52. R.P. Stanley: Enumerative Combinatorics, Vol. 2, Cambridge University Press, Cambridge, 1999. G. Szeg¨o: Orthogonal Polynomials, American Mathematical Society, 1948. V. Totik: “Orthogonal Polynomials”, Surv. Approximation Theory, Vol. 1, (2005), pp. 70–125.

DOI: 10.2478/s11533-006-0012-y Research article CEJM 4(3) 2006 525–530

Remarks on aﬃne complete distributive lattices Dominic van der Zypen∗ Allianz Suisse, CH-3001 Berne, Switzerland

Received 20 September 2005; accepted 14 MArch 2006 Abstract: We characterise the Priestley spaces corresponding to aﬃne complete bounded distributive lattices. Moreover we prove that the class of aﬃne complete bounded distributive lattices is closed under products and free products. We show that every (not necessarily bounded) distributive lattice can be embedded in an aﬃne complete one and that Q ∩ [0, 1] is initial in the class of aﬃne complete lattices. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Distributive lattice, aﬃne complete, Priestley spaces MSC (2000): 06D50, 06D99

1

Aﬃne complete lattices

A k-ary function f on a bounded distributive lattice L is called compatible if for any congruence θ on L and (ai , bi ) ∈ θ, (i = 1, ..., k) we always have (f (a1 , ..., ak ), f (b1 , ....bk )) ∈ θ. It is easy to see that the projections pri : Lk → L are compatible. With induction on polynomial complexity one shows that every polynomial function is compatible (see [4]). A lattice L is called aﬃne complete, if conversely every compatible function on L is a polynomial and if it is bounded and distributive. G. Gr¨atzer [2] gave an intrinsic characterization of bounded distributive lattices that are aﬃne complete: Theorem 1.1. [2] A bounded distributive lattice is aﬃne complete if and only if it does not contain a proper interval that is a Boolean lattice in the induced order. Note that in particular, no ﬁnite bounded distributive lattice L is aﬃne complete: ∗

E-mail: [email protected]

526

D. van der Zypen / Central European Journal of Mathematics 4(3) 2006 525–530

Let x ∈ L be an element distinct from 1. Then x has an upper neighbor, ie, there exists y ∈ L such that [x, y] = {x, y} which is isomorphic to the 2-element Boolean lattice. Example 1.2. The bounded distributive lattices [0, 1] and [0, 1]×[0, 1] are aﬃne complete. NOTE: From now on, all lattices considered are assumed to be bounded and distributive, unless otherwise stated.

2

Priestley duality

In [5], Priestley proved that the category D of bounded distributive lattices with (0, 1)preserving lattice homomorphisms and the category P of compact totally order-disconnected spaces (henceforth referred to as Priestley spaces) with order-preserving continuous maps are dually equivalent. A compact totally order-disconnected space (X; τ, ) is a poset (X; ) endowed with a compact topology τ such that, for x, y ∈ X, whenever x y, then there exists a clopen decreasing set U such that x ∈ U and y ∈ U . (A decreasing set or a down-set is a subset D of a partially ordered set P such that x ≤ y in P and y ∈ D imply x ∈ D.) The functor D : D → P assigns to each object L of D a Priestley space (D(L); τ (L), ⊆), where D(L) is the set of all prime ideals of L and τ (L) is a suitably deﬁned topology (the details of which will not be required here). The functor E : P → D assigns to each Priestley space X the lattice (E(X); ∪, ∩, ∅, X), where E(X) is the set of all clopen decreasing sets of X. Priestley duality therefore provides us with a“dictionary”between the world of bounded distributive lattices and a certain category of ordered topological spaces. This is interesting in particular because free products of lattices are “translated” into products of Priestley spaces. We will use this fact for showing that the class of aﬃne complete bounded distributive lattices is closed under free products.

3

Aﬃne complete Priestley spaces

The aim of this section is to characterize the Priestley spaces corresponding to aﬃne complete distributive (0,1)-lattices. Such spaces will be called aﬃne complete Priestley spaces. In other words, a Priestley space X is aﬃne complete iﬀ E(X) is aﬃne complete. The following theorem provides a rather straightforward translation of the algebraic concept of aﬃne completeness in order-topological terms. Theorem 3.1. Let X be a Priestley space. Then the following statements are equivalent: (1) E(X) is aﬃne complete. (2) If U ⊆ V are clopen down-sets and U = V , then the subposet V \ U of X is not an antichain, i.e. V \ U contains a pair of distinct comparable elements. Proof. (1) =⇒ (2). Suppose V \U is an antichain. Let C ∈ [U, V ] ⊆ E(X). Take

D. van der Zypen / Central European Journal of Mathematics 4(3) 2006 525–530

527

C = U ∪ (V \C). Claim: C is a clopen down-set of X. It is clear that C is a clopen subset of X since V \ C = V ∩ (X \ C). Now, let c ∈ C and assume x < c. Then if c ∈ U , we are done, since U is a down-set. Assume c ∈ V \ U . Since V is a down-set, we get x ∈ V , and the fact that V \ U is an antichain tells us that x cannot be a member of V \ U . Therefore x ∈ U ⊆ C which proves that C is indeed a (clopen) down-set. Moreover, C is the complement of C in [U, V ], i.e. C ∩ C = U and C ∪ C = V . Because C was arbitrary, we see that [U, V ] is a proper Boolean interval of E(X), whence E(X) is not aﬃne complete. (2) =⇒ (1). Suppose U ⊆ V are distinct clopen down-sets. By assumption, there are elements x, y ∈ V \U such that x < y. There is a clopen down-set A with x ∈ A and y∈ / A. Consider B = (A ∩ V ) ∪ U . So B ∈ [U, V ] and y ∈ / B. Now we show that B has no complement in [U, V ]: Take any C ∈ [U, V ] with C ∪ B = V . Then y ∈ C, but since C is a down-set, we have x ∈ C, thus x ∈ (B ∩ C)\U and B ∩ C = U . So whatever C we pick, C is no complement for B, i.e. B is not complemented, and consequently [U, V ] is not Boolean. It follows that no proper interval of E(X) is Boolean. We can formulate the above result in a more concise way: Corollary 3.2. A Priestley space X is aﬃne complete if and only if each nonempty open set contains two distinct comparable points. Proof. It follows directly from theorem 3.1 that if each nonempty open set contains two distinct points that are comparable, then X is aﬃne complete. Conversely, suppose that U is a nonempty open set which is an antichain. Recall that the clopen decreasing sets and their complements (clopen increasing sets) form a base of any Priestley space. So there exist open down-sets C1 , C2 such that ∅ = C1 ∩(X\C2 ) ⊆ U . Then [C1 ∩C2 , C1 ] is a proper interval such that C1 \(C1 ∩C2 ) = C1 ∩(X\C2 ) is an antichain (as a subset of the antichain U ). Thus theorem 3.1 implies that X is not aﬃne complete. Note that the proof works exactly the same way if each occurrence of “open” is replaced by “clopen” (basically because each Priestley space is zero-dimensional). So we can state as well: A Priestley space X is aﬃne complete if and only if each nonempty clopen set contains two distinct comparable points.

4

Products of aﬃne complete lattices

We prove in this section that arbitrary products of aﬃne complete lattices are aﬃne complete. We don’t need Priestley duality to do this. Priestley duals of aﬃne com-

528

D. van der Zypen / Central European Journal of Mathematics 4(3) 2006 525–530

plete lattices, i.e. aﬃne complete Priestley spaces, will come into play when we consider coproducts of aﬃne complete lattices. Theorem 4.1. If (Li )i∈I is a family of (bounded distributive) aﬃne complete lattices, then Πi∈I Li is aﬃne complete. Proof. We prove the contrapositive of the theorem. Suppose that Πi∈I Li is not aﬃne complete. Then it contains a proper interval [ξ, η] that is Boolean. There exists some k ∈ I such that ξ(k) < η(k). We claim that [ξ(k), η(k)] ⊆ Lk is a Boolean interval. Set x = ξ(k), y = η(k). Suppose l ∈ [x, y] and deﬁne λ ∈ Πi∈I Li by λ(k) = l and λ(i) = ξ(i) if i = k Because [ξ, η] is Boolean, there exists λ ∈ Πi∈I Li such that λ ∧ λ = ξ and λ ∨ λ = η. Thus it is easy to see that l := λ (k) is the complement of l ∈ [x, y]. Therefore, [x, y] is a proper Boolean interval of Lk and whence Lk is not aﬃne complete. Example 4.2. Theorem 4.1 implies that [0, 1]N is aﬃne complete.

5

Free products of aﬃne complete lattices

Now we turn our attention to free products of aﬃne complete bounded distributive lattices; we prove they are complete. A convenient way to obtain this result is to dualise the problem into the category of Priestley spaces. Recall that free products in D are categorically speaking coproducts in D. Since Priestley duality is a pair of contravariant functors, coproducts correspond to products in P and vice versa; this is stated in the following proposition in a more general way. Proposition 5.1. [3] Let A and B be categories, and assume that F : A → B and G : B → A are contravariant functors that form a dual equivalence. Then: (1) If A is a product of a family of objects (Ai )i∈I of A, then F(A) is a coproduct of (F(Ai ))i∈I . (2) If A is a coproduct of a family of objects (Ai )i∈I of A, then F(A) is a product of (F(Ai ))i∈I . We recall that a Priestley space was shown to be aﬃne complete if and only if each non-empty open subset contains two distinct comparable points. Theorem 5.2. If (Xi )i∈I is a family of aﬃne complete Priestley spaces, then Πi∈I Xi is aﬃne complete.

D. van der Zypen / Central European Journal of Mathematics 4(3) 2006 525–530

529

Proof. Suppose that Xi is aﬃne complete for every i ∈ I. It suﬃces to show that every nonempty subset V of Πi∈I Xi of the form (U1 ) ∩ ... ∩ πi−1 (Ur ) V = πi−1 r 1 contains two distinct comparable elements (where Uk ⊆ Xik open, nonempty). Take U1 . It contains elements a < b, because Xi1 is aﬃne complete. Now pick ξ ∈ V . Deﬁne ξ1 , ξ2 ∈ V by ξ1 (i1 ) = a and ξ1 (i) = ξ(i) if i = i1 and ξ2 (i1 ) = b and ξ2 (i) = ξ(i) if i = i1 . Clearly, ξ1 , ξ2 are distinct comparable elements of V .

Applying the Priestley duality now yields: Corollary 5.3. The class of (bounded distributive) aﬃne complete lattices is closed under free products.

6

Embedding lattices in aﬃne complete lattices

First we will stay away from aﬃne completeness in the worst possible way: we will embed each L into a powerset of some set, which, being Boolean, is as aﬃne incomplete as it gets. The following fact is well-known: Lemma 6.1. Let L be a distributive lattice (L need not be bounded). There is a set X and a lattice embedding j : L → P(X) where P(X) is the powerset of the set X. Next, we will embed that powerset in an aﬃne complete lattice. Lemma 6.2. Let X be a set and let Q = {q ∈ Q; 0 q 1}. Then there is a lattice embedding j : P(X) → QX . Moreover, Q is aﬃne complete. Proof. Set j : S → χS ∈ QX for every S ⊆ X, where χS is deﬁned by / S. χS (x) = 1 if x ∈ S and χS (x) = 0 if x ∈ It is easy to see that j is a lattice embedding. Next, we claim that Q is aﬃne complete. ∈ [x, y] has no complement a in [x, y]: Take any x < y in Q. Then the element a = x+y 2 Otherwise we would have a ∧ a = x which would imply a = x, but then a ∨ a = a = y.

530

D. van der Zypen / Central European Journal of Mathematics 4(3) 2006 525–530

So [x, y] is not Boolean, whence Q has no proper Boolean interval. Therefore, Q is aﬃne complete. Moreover, by 4.1, QX is aﬃne complete which concludes the proof. Lemmas 6.1 and 6.2 now imply: Corollary 6.3. Every distributive lattice (not necessarily bounded) can be embedded in a bounded aﬃne complete lattice. Admittedly, the construction provided by 6.1 and 6.2 is highly non-unique and has no minimality properties.

References [1] B.A. Davey and H.A. Priestley: Lattices and Order, Cambridge University Press, 1990. [2] G. Gr¨atzer: “Boolean functions on distributive lattices”, Acta Math. Acad. Sci. Hung., Vol. 15, (1964), pp. 195–201. [3] S. MacLane: Categories for the working mathematician, 2nd ed., Springer Verlag, (1998). [4] M. Ploˇsˇcica: “Aﬃne Complete Distributive Lattices”, Order, Vol. 11, (1994), pp. 385–390. [5] H.A. Priestley: “Representation of distributive lattices by means of ordered Stone spaces”, Bull. London Math. Soc., Vol. 2, (1970), pp. 186–190. [6] H.A. Priestley: “Ordered topological spaces and the representation of distributive lattices”, Proc. London Math. Soc., Vol. 3(24), (1972), pp. 507–530. [7] D. van der Zypen: Aspects of Priestley Duality, Thesis (PhD), University of Bern, 2004.

DOI: 10.2478/s11533-006-0022-9 Research article CEJM 4(3) 2006 531–546

Cauchy, Ferrers-Jackson and Chebyshev polynomials and identities for the powers of elements of some conjugate recurrence sequences Roman Witula, Damian Slota∗ Institute of Mathematics, Silesian University of Technology, 44-100 Gliwice, Poland

Received 8 March 2006; accepted 2 June 2006 Abstract: In this paper some decompositions of Cauchy polynomials, Ferrers-Jackson polynomials and polynomials of the form x2n + y 2n , n ∈ N, are studied. These decompositions are used to generate the identities for powers of Fibonacci and Lucas numbers as well as for powers of the so called conjugate recurrence sequences. Also, some new identities for Chebyshev polynomials of the ﬁrst kind are presented here. c Versita Warsaw and Springer-Verlag Berlin Heidelberg. All rights reserved. Keywords: Cauchy polynomials, Ferrers-Jackson polynomials, Chebyshev polynomials, Fibonacci and Lucas numbers, recurrence sequences MSC (2000): 11B83, 26C99, 11B39

1

A brief exposition of the content of the paper

In Section 2 the following decompositions of Cauchy polynomials: pn (x, y) := (x + y)2n+1 − x2n+1 − y 2n+1 ,

n ∈ N,

as well as Ferrers-Jackson polynomials qn (x, y) := (x + y)2n + x2n + y 2n , ∗

[email protected]

n ∈ N,

532

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

are discussed (see Lemma 2.2): 2k+1 2 2n + 1 n − k (x + x y + y 2 )n−1−3k x y (x + y) n − k 2k + 1

(n−1)/3

pn (x, y) =

k=0

and

n/3

qn (x, y) =

k=0

2k 2n n − k x y (x + y) (x2 + x y + y 2 )n−3k . n−k 2k

(1)

(2)

These decompositions are diﬀer from those already published – see especially [13]. Also, presented here proof of decompositions (1) and (2) by induction is based on a simple recurrence dependence between polynomials pn (x, y) and qn (x, y) seems to be new. In this paper, decompositions (1) and (2) are applied to generating the identities for powers of Fibonacci and Lucas numbers (some of them – the simplest ones – are identical with the identities discussed in [1, 6, 7]; also in [4] some of these simplest identities are presented). In Sections 4 and 5 the notion of pairs and triples of conjugate recurrence sequences is introduced. The identities for the powers of elements of such sequences are also derived. In the last section of the paper, the following decomposition of polynomials: x2n +y 2n , n ∈ N is discussed: n x2n + y 2n = ωr,n · (x y)r · (x2 + x y + y 2 )n−r . (3) r=0

The combinatorial and analytical descriptions of coeﬃcients ωr,n are provided, in particular the analytical description where Chebyshev polynomials of the ﬁrst kind are used. Formula (3) is applied to generate some identities for the powers of Fibonacci and Lucas numbers. Also, some new combinatorial identities for the Chebyshev polynomials of the ﬁrst kind are presented. Publications concerning the identities of the sums of the powers of elements of recurrence sequences, (especially second order), are fairly recent [1–3, 5–11, 14]. The results presented in the publications generally indicate two directions: generating formulas with a bounded number of elements (our paper falls into this category) and generating formulas with an increasing number of elements [6–9]. Some papers are focused on formulas for generating functions of sequences of the powers of elements of recurrence sequences [3, 5, 14], as well as generating the identities by means of such functions. Other papers [10, 11] discuss generalized forms of certain known identities (especially for Fibonacci and Lucas numbers).

2

Decompositions of Cauchy and Ferrers-Jackson polynomials

Lemma 2.1. The following recurrence relations are satisﬁed: pn+1 (x, y) = u pn (x, y) + t qn (x, y)

(4)

qn+1 (x, y) = u qn (x, y) + t pn−1 (x, y),

(5)

and

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

533

for every n = 1, 2, . . ., where p0 (x, y) ≡ 0, t := x y (x + y) and u := x2 + x y + y 2 . For example, we have 2 2 q2 (x, y) = q1 (x, y) , 2 q3 (x, y) = q1 (x, y) q2 (x, y) + 6 t2 , p1 (x, y)q1 (x, y) = 6 t u, etc. Proof. Only the proof of relation (4) is given here: q1 (x, y) pn (x, y) = pn+1 (x, y) + (x2 + y 2 )(x + y)2n+1 − − (x + y)2 + y 2 x2n+1 − (x + y)2 + x2 y 2n+1 = 2 pn+1 (x, y) − 2 x y (x + y)2n+1 − − (2 x y + 2 y 2 ) x2n+1 − (2 x y + 2 x2 ) y 2n+1 = 2 pn+1 (x, y) − 2 t qn (x, y). Lemma 2.2. The following identities hold 2k+1 2 2n + 1 n − k pn (x, y) = (x + x y + y 2 )n−1−3k x y (x + y) n − k 2k + 1 k=0 (n−1)/3 2n + 1 n − k t2k+1 un−1−3k = n − k 2k + 1 k=0 (n−1)/3

:= Vn (t, u),

n = 1, 2, . . .

(6)

2k 2n n − k qn (x, y) = x y (x + y) (x2 + x y + y 2 )n−3k n−k 2k k=0 n/3 2n n − k t2k un−3k = n−k 2k k=0 n/3

:= Wn (t, u),

n = 1, 2, . . .

where t := x y (x + y) and u := x2 + x y + y 2 .

(7)

534

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

Proof. The proof of Lemma (2.2) is followed by induction on n and by applied Lemma (2.1). For example, we have

2n + 1 n − k 2n n − k + t2k+1 un−3k + n − k 2k + 1 n − k 2k k=0 2n n − k 2k+1 n−3k + u = t n−k 2k k=n/3

(n−1)/3

Vn+1 = u Vn + t Wn =

(only if 3|n)

(n−1)/3

=

k=0

(n−1)/3

=

k=0

n/3

=

k=0

(n − k)!(2n + 3) 2k+1 n−3k t u + (2k + 1)!(n − 3k)!

3t2k+1 un−3k =

k=n/3 (only if 3|n)

2n + 3 n − k + 1 2k+1 n−3k u + t n − k + 1 2k + 1 2n + 3 n − k + 1 2k+1 n−3k + u = t n − k + 1 2k + 1 k=n/3

(only if 3|n)

2n + 3 n − k + 1 2k+1 n−3k u . t n − k + 1 2k + 1

The ﬁrst six polynomials Vn (t, u) and Wn (t, u) are presented below: n Vn (t, u)

Wn (t, u)

1

3t

2u

2

5tu

2 u2

3

7 t u2

2 u3 + 3 t2

4

3 t (3 u3 + t2 )

2 u4 + 8 t2 u

5

11 t u (u3 + t2 )

2 u5 + 15 t2 u2

6

13 t u2 (u3 + 2 t2 ) 2 u6 + 24 t2 u3 + 3 t4

Remark 2.3. If n ∈ N and t, u ∈ Z then the following relations hold true: i) if 2n + 1 is a prime number, then (2n + 1) | Vn (t, u); ii) n ≡ 1 (mod 3) =⇒ t | Vn (t, u) and u | Wn (t, u); iii) n ≡ 2 (mod 3) =⇒ t u | Vn (t, u) and u2 | Wn (t, u); iv) n ≡ 0 (mod 3) =⇒ t u2 | Vn (t, u).

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

3

535

Identities for the powers of Fibonacci and Lucas numbers

In this section, the polynomials Vn (t, u) and Wn (t, u) determined in Section 2 are used to generate two separate sets of identities for the powers of Fibonacci and Lucas numbers. Below we presented special cases of the following identities: pk (x, y) = Vn (t, u),

k∈N

(8)

qk (x, y) = Wn (t, u),

k ∈ N,

(9)

and for the values x, y which are Fibonacci and Lucas numbers with appropriately selected indices. For example, we obtain the following formulas: 11 11 n 3 Ln+2 − L11 × n − Ln+4 = 33 Ln Ln+2 Ln+4 8 L2n+4 + 11 (−1) 2

n 3 × 8 L2n+4 + 11 (−1) + 3 Ln Ln+2 Ln+4 , and 13 13 13 Fn+2m − F2m Fn+1 − F2m−1 Fn =

2 2 = 13 F2m−1 F2m Fn Fn+1 Fn+2m Fn+2m + F2m−1 F2m Fn Fn+1 × 3 2

2 + F2m−1 F2m Fn Fn+1 + 2 F2m−1 F2m Fn Fn+1 Fn+2m . × Fn+2m

We will also consider three variations of (8) for k = 7: p7 (x, y) − 3 t5 = 5 t u3 3 u3 + 10 t2 , p7 (x, y) − 36 u3 t3 = t 3 u3 + t2 5 u3 + 3 t2 , p7 (x, y) − t3 5 u3 + 3 t2 = 15 t u3 u3 + 3 t2 . The above mentioned identities, are discussed for the following values of arguments (three nontrivial examples are given below): a) for x = F2m Ln+4m+1 and y = F2m+1 Ln (n, m ∈ N0 ): t = F4m+1 F2m F2m+1 Ln Ln+2m Ln+4m+1 , x + y = F4m+1 Ln+2m ,

2 u = F4m+1 Ln+2m − F2m F2m+1 Ln Ln+4m+1 2 2 = F4m+1 L2n+4m − F2m F2m+1 L2n+4m+1 + 2 F4m+1 − F2m F2m+1 L4m (−1)n ; b) for x = Fn and y = Fn+4m+2 (n, m ∈ N0 ): x + y = F2m+1 Ln+2m+1 , t = F2m+1 Fn Ln+2m+1 Fn+4m+2 , 2 u = F2m+1 Ln+2m+1 −Fn Fn+4m+2 = 15 ((L4m+2 −3)L2n+4m+2 + (3L4m+2 −4)(−1)n ); and for x = Ln and y = Ln+4m+2 (n, m ∈ N0 ): x + y = 5 F2m+1 Fn+2m+1 , t = 5 F2m+1 Fn+2m+1 Ln Ln+4m+2 , 2 u = 5F2m+1 Fn+2m+1 − Ln Ln+4m+2 = (L4m+2 + 1)L2n+4m+2 − (3L4m+2 + 4)(−1)n ;

536

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

c) for x = −F2m−1 Fn and y = Fn+2m (or x = −F2m−1 Fn , y = −F−2m Fn+1 , n, m ∈ N0 ): t = −F2m−1 F2m Fn Fn+1 Fn+2m , x + y = F2m Fn+1 , 2 2 u = F2m Fn+1 + F2m−1 Fn Fn+2m = Fn+2m − F2m−1 F2m Fn Fn+1 ; and for x = −F2m−1 Ln and y = Ln+2m (or x = −F2m−1 Ln and y = −F2m Ln+1 , n, m ∈ N0 ): t = −F2m−1 F2m Ln Ln+1 Ln+2m , x + y = F2m Ln+1 , 2 u = F2m Ln+1 + F2m−1 Ln Ln+2m = L2n+2m − F2m−1 F2m Ln Ln+1 .

4

Identities for the powers of elements of conjugate recurrence sequences

The identities of the Fibonacci and Lucas numbers discussed in the previous Section may be generalized to certain pairs of recurrence sequences {xn } and {yn } satisfying the same recurrence equation, yet with diﬀerent initial conditions, i.e. the sequences described in the following lemma: Lemma 4.1. Let a, b, c, d, x1 , x2 , y1 , y2 ∈ C, a b c = 0. We assume that the following identities hold xn+2 = a xn+1 + b xn , yn+2 = a yn+1 + b yn (10) xn+2 + c xn = yn+1

(11)

yn+2 + c yn = d xn+1 .

(12)

and Moreover, we suppose that the following condition is satisﬁed: if A, B ∈ C and A xn+1 + B xn = 0 for suﬃciently large n ∈ N then A = B = 0. Then we have either c = b and d = a2 + 4 b or yn+1 ≡ a xn+1 . Proof. In turn, from (12), (11) and (10), we have: xn+3 + 2 c xn+1 + c2 xn−1 = d xn+1 , a xn+2 + (b + 2 c − d) xn+1 + c2 xn−1 = 0, (a2 + b + 2 c − d) xn+1 + a b xn + c2 xn−1 = 0, Hence, we obtain:

⎧ ⎪ ⎨ a b = −a (a2 + b + 2 c − d) ⎪ ⎩ c2 = −b (a2 + b + 2 c − d),

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

or equivalently:

537

⎧ ⎪ ⎪ a2 + b + 2 c − d = 0 ⎪ ⎪ ⎨ b2 = c2 ⇔ c = ±b ⎪ ⎪ ⎪ ⎪ ⎩ d = a2 + 2 b + 2 c.

If c = −b then yn+1 = a xn+1 , n ∈ N. If c = b then d = a2 + 4 b.

Now, let us suppose that the elements of recurrence sequences {xn } and {yn } satisfy the conditions (10)–(12) for c = b and d = a2 + 4 b. Then the following eight identities could be derived (more precisely 3 × 8 = 24 new identities if we count the additional identities for t and u): a) for x = xn+2 and y = b xn : 2k+1 2k+1 yn+1 − xn+2 − (b xn )2k+1 = Vk (t, u), 2 where t = b xn+2 yn+1 xn and u = yn+1 − b xn xn+2 ; b) for x = yn+2 and y = b yn : 2 2k+1 2k+1 (a + 4b) xn+1 − yn+2 − (b yn )2k+1 = Vk (t, u),

where t = b (a2 + 4 b) yn xn+1 yn+2 and u = (a2 + 4 b)2 x2n+1 − b yn yn+2 ; c) for x = (a2 + 4 b) xn and y = a yn : (2 yn+1 )2k+1 − (a yn )2k+1 − ((a2 + 4 b) xn )2k+1 = Vk (t, u), where t = 2 a (a2 + 4 b) xn yn yn+1 and u = (2 yn+1 )2 − a (a2 + 4 b) xn yn ; d) for x = a xn and y = yn : (2 xn+1 )2k+1 − (a xn )2k+1 − yn2k+1 = Vk (t, u), where t = 2 a xn yn xn+1 and u = (2 xn+1 )2 − a xn yn ; e) for x = zn+3 , y = −a b zn and z ∈ {x, y}: 2k+1 ((a2 + b) zn+1 )2k+1 − zn+3 + (a b zn )2k+1 = Vk (t, u),

where t = −a b (a2 + b) zn zn+1 zn+3 and u = ((a2 + b) zn+1 )2 + a b zn zn+3 ; f) for x = a zn+3 , y = b2 zn and z ∈ {x, y}: ((a2 + b) zn+2 )2k+1 − (a zn+3 )2k+1 − (b2 zn )2k+1 = Vk (t, u), where t = a b2 (a2 + b) zn zn+2 zn+3 and u = ((a2 + b) zn+2 )2 − a b2 zn zn+3 ; g) for x = yn+1 and y = −a xn+1 : 2k+1 (2 b xn )2k+1 − yn+1 + (a xn+1 )2k+1 = Vk (t, u),

where t = −2 a b xn xn+1 yn+1 and u = (2 b xn )2 + a xn+1 yn+1 ; h) for x = (a2 + 4 b) xn+1 and y = −a yn+1 : (2 b yn )2k+1 − ((a2 + 4 b) xn+1 )2k+1 + (a yn+1 )2k+1 = Vk (t, u), where t = −2 a b (a2 + 4 b) yn xn+1 yn+1 and u = (2 b yn )2 + a (a2 + 4 b) xn+1 yn+1 .

538

5

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

Some relationship for conjugate recurrence sequences of the third order

The following lemma is a generalization of Lemma 4.1 for recurrence sequences of the third order: Lemma 5.1. Let a, b, A, B, w0 , w1 , w2 ∈ C for every w ∈ {x, y, z}. Let us set wn+2 = awn+1 + bwn + wn−1

(13)

for every w ∈ {x, y, z} and n ∈ N and suppose that the following conjugate conditions are satisﬁed xn+2 + A xn = yn+1 (14) yn+2 + A yn = zn+1

(15)

zn+2 + A zn = B xn+1

(16)

for every n ∈ N. Then the natural condition: αxn+1 + βxn + γxn−1 = 0 for suﬃciently large n ∈ N ⇒ α = β = γ = 0 implies the following relations: a) B = (1 + a b)(1 + A3 ), b) if A3 = 1 then b2 = −3 A2 and a2 = −3 A, c) if A3 = 1 then b = (a2 + 3 A)/(A3 − 1) and (b2 − a) A3 + 3 A2 + a = 0 or, equivalently: −a A9 + 3 A8 + 3 a A6 + 3 A5 + 6 a2 A4 + (a4 − 3 a) A3 + 3 A2 + a = 0. Proof. Apply to equality (16) identities (15), (14) and (13) for w = x we obtain the identity (−b A3 + 3 A + a2 + b) xn+2 + ((1 + a b) (1 + A3 ) − B) xn+1 + ((b2 − a) A3 + 3 A2 + a) xn = 0 which implies

⎧ ⎪ ⎪ −b A3 + 3 A + a2 + b = 0 ⎪ ⎪ ⎨ B = (1 + a b) (1 + A3 ) ⎪ ⎪ ⎪ ⎪ ⎩ (b2 − a) A3 + 3 A2 + a = 0.

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

539

Corollary 5.2. If elements of sequences {xn }, {yn } and {zn } satisfy conditions (13)– (16) then the following identities hold true (only three identities – generalizations of the identities from Section 3 are presented below): a) for x = xn+2 and y = A xn : 2k+1 2k+1 − xn+2 − (A xn )2k+1 = Vk (t, u), yn+1 2 − A xn xn+2 ; where t = A xn+2 yn+1 xn and u = yn+1 b) for x = yn+2 and y = A yn : 2k+1 2k+1 − yn+2 − (A yn )2k+1 = Vk (t, u), zn+1 2 where t = A yn xn+1 yn+2 and u = zn+1 − A yn yn+2 ; c) for x = zn+2 and y = A zn : 2k+1 − (A zn )2k+1 = Vk (t, u), (B xn+1 )2k+1 − zn+2

where t = A B zn xn+1 zn+2 and u = (B xn+1 )2 − A zn zn+2 .

6

Third basic identity

The following identity relates to decompositions (6) and (7) and shall be used to generate a set of the identities for the powers of Fibonacci and Lucas numbers. Lemma 6.1. The following identity hold x2n + y 2n =

n r=0

r 2n − l n − l 2n (xy)r (x2 + xy + y 2 )n−r . (−1)l 2n − l l r − l l=0

(17)

ωr,n

Remark 6.2. For small values r (= 1, 2, . . .) coeﬃcients ωr,n of decomposition of polynomial x2n + y 2n from Lemma 6.1 have the form: ω0,n = 1,

ω4,n =

ω1,n = −n,

1 ω2,n = n(n − 3), 2

1 n(n − 3)(n2 − 15n + 38), 24

1 ω3,n = − n(n − 2)(n − 7), 6

1 ω5,n = − n(n − 3)(n − 4)(n − 6)(n − 17). 5!

Coeﬃcients ωr,n for small values of |n − r| it will be generated in Remark 6.5.

540

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

Proof (of Lemma 6.1). We have

x

2n

+y

2n

n 2n x+y 2n − k n k = Ω2n √ (−1) (xy) = (xy)k (x + y)2n−2k xy 2n − k k k=0 n n−k 2n − k 2n k = (−1) (xy)k (x2 + xy + y 2 ) + xy 2n − k k k=0 n n−k 2n − k n−k 2n k = (−1) (xy)k+i (x2 + xy + y 2 )n−k 2n − k k i k=0 i=0 n r 2n − l n − l 2n (xy)r (x2 + xy + y 2 )n−r . = (−1)l 2n − l l r − l r=0 l=0

where Ωn (x) := 2Tn (x/2) is the so called: modiﬁed Chebyshev polynomial of the ﬁrst kind (see [16] where many interesting properties of these polynomials are discussed). Corollary 6.3. We present an explicit form of identities (17) for n = 4, 5, 7. The identities were subsequently transformed to derive at the equivalent yet more attractive forms for further applications: x8 + x4 y 4 + y 8 = (x2 + xy + y 2 )4 − 4xy(x2 + xy + y 2 )3 + + 2x2 y 2 (x2 + xy + y 2 )2 + 4x3 y 3 (x2 + xy + y 2 ) 3 = 3(x2 + xy + y 2 )4 − 2 (x2 + kxy + y 2 ),

(18)

k=0 10

5 5

x +x y +y

10

2

2 5

= (x + xy + y ) − 5xy(x2 + xy + y 2 )4 + + 5x2 y 2 (x2 + xy + y 2 )3 + 5x3 y 3 (x2 + xy + y 2 )2 − − 5x4 y 4 (x2 + xy + y 2 ) = (x2 + xy + y 2 )5 − 5xy(x2 + xy + y 2 )(x + y)2 (x2 + y 2 )2 ,

(19)

x14 + (xy)7 + y 14 = (x2 + xy + y 2 )7 − 7xy(x2 + xy + y 2 )6 + + 14x2 y 2 (x2 + xy + y 2 )5 − 21x4 y 4 (x2 + xy + y 2 )3 + + 7x5 y 5 (x2 + xy + y 2 )2 + 7x6 y 6 (x2 + xy + y 2 ) = (x2 + xy + y 2 )7 − 7xy(x2 + xy + y 2 )(x + y)2 (x2 + y 2 )× × (x2 + y 2 )2 (x2 + xy + y 2 ) + x3 y 3 ,

(20)

Remark 6.4. The identities generated in Corollary 6.3 bring to mind easily veriﬁable divisibility relations. Accordingly, for each n ∈ N, 3 | n, there is a polynomial pn ∈ Z[x, y], such as: pn (x, y)

2 k=0

4n x2 + k x y + y 2 = 3 x 2 + x y + y 2 − x8n − (x y)4n − y 8n .

(21)

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

541

For example, we derive: p1 (x, y) = 2 (x2 + 3 x y + y 2 ), p2 (x, y) = 2 (x4 + 3 x2 y 2 + y 4 ) (x6 + 9 x5 y + 19 x4 y 2 + 24 x3 y 3 + 19 x2 y 4 + 9 x y 5 + y 6 ) = 2 (x4 + 3 x2 y 2 + y 4 ) (x2 + x y + y 2 )2 (x2 + 7 x y + y 2 ) + x2 y 2 (x2 − x y + y 2 ) . Conversely, for each n ∈ N there are polynomials qn± ∈ Z[x, y], such that: xy

qn± (x, y)

2

6n±1 x2 + k x y + y 2 = x 2 + x y + y 2 − x12n±1 − (x y)6n±1 − y 12n±1 . (22)

k=0

Additionally, if 6n + 1 or 6n − 1 is a prime number, the left side of equation (22) may be supplemented by multiplier 6n + 1 or 6n − 1, respectively. For example: q1+ (x, y) = 7 (x6 + x5 y + 3 x4 y 2 + 3 x3 y 3 + 3 x2 y 4 + x y 5 + y 6 ),

q1− (x, y) = 5 (x2 + y 2 ).

Proof. We have x2 + y 2 = 0

⇐⇒

y = ±i x

and then we get 4n 3 x2 + x y + y 2 − x8n − (x y)4n − y 8n = 4n = 3 ± i x2 − x8n − (±i x2 )4n − (±i x)8n = 0. Now, we have x2 + x y + y 2 = 0

⇐⇒

⎧ ⎪ ⎨ x3 − y 3 = 0, ⎪ ⎩ x − y = 0,

⇐⇒

y = x exp ± i 23 π .

Hence, we obtain: 4n 3 x2 + x y + y 2 − x8n − (x y)4n − y 8n =

8n 8 16 = −x 1 + exp ± i 3 n π + exp ± i 3 n π =

n π

8n n n 8n n nπ nπ = −x = 1+(−1) exp ∓i 3 +(−1) exp ±i 3 1+2 (−1) cos 3 = −x nπ (but cos 3 = (−1)n−1 /2 whenever 3 | n) = −x8n 1 + (−1)2n−1 = 0. Moreover, we get

and

2 2 4n 8n 4n 8n 3 x + xy + y − x − (x y) − y

=0 y=−x

∂ 2 2 4n 8n 4n 8n 3 x + xy + y − x − (x y) − y = 0. ∂x y=−x

2

542

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

6.1 Some Applications Identities (18), (19) and (20) (respectively) can be used to generate some identities for Lucas and Fibonacci numbers. First, for x = Ln , y = Ln+1 : 4 4 3 2 L2n+2 + (−1)n − L8n − Ln Ln+1 − L8n+1 = = 10 F2n+1 L2n+2 2 L2n+2 + (−1)n 2 L2n+3 + 3(−1)n ,

2 L2n+2 + (−1)n

5

5 − L10 − L10 n − Ln Ln+1 n+1 =

2 2 L2n+2 + (−1)n ; = 125 Ln Ln+1 L2n+2 F2n+1

next, for x = Fn , y = Fn+1 : 4 4 8 = 3 2 L2n+2 − (−1)n − 54 Fn8 + Fn Fn+1 + Fn+1 2 2 L2n+2 − (−1)n 2 L2n+3 − 3(−1)n , = 50 F2n+1 Fn+2

2 L2n+2 − (−1)n

5

5 10 = − 55 Fn10 + Fn Fn+1 + Fn+1

2 2 = 55 Fn Fn+1 Fn+2 2 L2n+2 − (−1)n ; F2n+1

now, for x = Ln , y = Ln+3 :

4 L2n+2 − Ln Ln+3

5

5 − L10 − L10 n − Ln Ln+3 n+3 =

2 2 4 Ln+2 − Ln Ln+3 , = 2000 Ln Ln+3 L2n+2 F2n+3

7 2 − L14 − L14 n − Ln Ln+3 n+3 = 280 Ln Ln+3 Ln+2 × 2 2 3 2 × F2n+3 4 Ln+2 − Ln Ln+3 100 F2n+3 4 Ln+2 − Ln Ln+3 + Ln Ln+3 ;

4 L2n+2 − Ln Ln+3

7

and ﬁnally, for x = Fn , y = Fn+3 :

2 4 Fn+2 − Fn Fn+3

5

5 10 − Fn10 − Fn Fn+3 − Fn+3 =

2 2 2 = 80 Fn Fn+3 Fn+2 4 Fn+2 − Fn Fn+3 , F2n+3

7 14 − Fn14 − Fn Fn+3 − Fn+3 = 2 3 2 2 2 = 56 Fn Fn+3 Fn+2 F2n+3 4 Fn+2 − Fn Fn+3 4 F2n+3 4 Fn+2 − Fn Fn+3 + Fn Fn+3 . 2 4 Fn+2 − Fn Fn+3

7

Remark 6.5. The coeﬃcients ωr,n from Lemma 6.1 for the values of r near to n are related to Chebyshev polynomials of the ﬁrst kind Tn (x) = cos(n arccos x), x ∈ [−1, 1] (see [12, 15]), because the following lemma holds:

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

543

Lemma 6.6. We have n/2 n (−1)k n − k (2x)n−2k , Tn (x) = 2 k=0 n − k k

x ∈ R.

(23)

Corollary 6.7. By diﬀerentiating the identity (23) for x ∈ (−1, 1) we obtain the following formulas: n sin(n arccos x) √ Tn (x) = =n 1 − x2

Tn (x) =

Tn (x) =

(n−1)/2

k=0

− 2k n − k (−1) (2x)n−2k−1 , n−k k kn

(24)

−n2 cos(n arccos x) n x sin(n arccos x) + = 1 − x2 (1 − x2 )3 n/2−1 n−k n − 2k 1 k (−1) (2x)n−2k−2 ; (25) = 4n n − k k 2 k=0 (n − n3 ) sin(n arccos x) 3 n2 x cos(n arccos x) 3 n x2 sin(n arccos x) − + = (1 − x2 )2 (1 − x2 )3 (1 − x2 )5 (n−3)/2 n−k n − 2k 1 k (−1) (2x)n−2k−3 ; (26) = 24 n n − k k 3 k=0

etc. Using identities (23)–(26), for x = 12 , the following ones can be deduced: (−1)k n− k 2 Tn 2 = 2 cos =n ; n− k k k=0 m 2 2m− l l 2m ωm,m = 2 cos 3 πm = ; (−1) 2m− l l l=0 1

n π3

n/2

(27) (28)

m−1 2 1 1 2√ 2m− k k m− k ωm−1,m = T2m 2 = (−1) 3m sin 3 πm = 2m ; (29) 2 3 2m− k k k=0 √ n/2−1 π π n2 3 n− k n− 2k 1 1 1 k = − cos n 3 + (−1) T n sin n 3 = n ; 4 n 2 3 9 n− k k 2 k=0 (30)

ωm−2,m

√ 2 m2 3 1 1 1 1 1

= T2m 2 − T2m 2 = − cos 3 πm − m sin 23 πm = 4 4 2 3 9 m−2 2m − l m − l l 2m = (−1) ; (31) 2m − l l 2 l=0

544

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

4 4√ 1 1 3m(1 − 2m2 ) sin 23 πm − m2 cos 23 πm = T2m 2 = 24 27 9 m−2 2m − k 2m − 2k 2m k (−1) ; (32) = 2m − k k 3 k=0 ωm−3,m =

1 1 1 1

1 1 1 1 − T2m 12 + T2m 12 = T2m 2 − 4 ωm−2,m = T 8 24 8 24 2m 2 4 2 1√ 1 = 3m(2 − m2 ) sin 23 πm + m2 cos 23 πm = 27 9 m−3 2m − l m − l l 2m = (−1) . (33) 2m − l l 3 l=0

Moreover, from identities (24)-(26) it is possible to generate the following identities between the derivatives of Chebyshev polynomials of the ﬁrst kind. Lemma 6.8. We have

n/2 Tn (x) k n−k Tn (x) k − 2n = −2 n (−1) (2x)−2k , (2x)n−1 (2x)n n − k k k=1

n/2 (−1)k k 2 n − k Tn (x) Tn (x) 2 Tn (x) (2x)−2k , − 2 (2 n − 1) + 4n = 8n (2x)n−2 (2x)n−1 (2x)n n − k k k=1

(34)

(35)

Tn (x) T (x) T (x) Tn (x) − 2 (3 n − 3) n n−2 + 4 (3 n2 − 3 n + 1) n n−1 − 8 n3 = n−3 (2x) (2x) (2x) (2x)n n/2 3 n−k k k = −32 n (−1) (2x)−2k , (36) n − k k k=1 and the following general formula hold: n/2 l (l−k) l Tn (x) n−k k 2l−1 k+l k (−2) pk,l (n) =2 n (−1) (2x)−2k , n−l+k (2x) n − k k k=0 k=1

(37)

where p0,l (n) = 1,

pl,l (n) = nl ,

pk+1,l+1 (n) = (n − l + k) pk,l (n) + pk+1,l (n),

k = 0, 1, . . . , l − 1.

Hence, for x = 12 we get the following identities: √ n/2 π k 3 π n−k k cos n (−1) − sin n = , 3 3 3 n−k k k=1 √ n/2 2 π π n−k 3 n k k (−1) cos n − (3n − 2) sin n = , 3 3 9 3 n−k k k=1 n/2 √ 3 π π n−k 3 n 2 k k (−1) cos n − (2n − 3n + 2) sin n = . 3 3 9 3 n − k k k=1

(38)

(39)

(40)

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

545

Remark 6.9. Because the derivatives of polynomials Tn (x) are recurrently related [12]: (1 − x2 )Tn(m+2) (x) = (2m + 1) x Tn(m+1) (x) + (m2 − n2 ) Tn(m) (x),

m = 0, 1, . . .

thus, for x = 12 , the following identity is derived: 3 4

Tn(m+2)

1 2

= 12 (2m + 1) Tn(m+1)

1 2

+ (m2 − n2 ) Tn(m)

1 2

hence, the following formulas can be obtained π 2√ 1 1 π Tn = cos n , Tn = , 3 n sin n 2 3 2 3 3 1 π π 4√ 4 2 3 n sin n Tn = − n cos n 2 9 3 3 3 and the general formula: π π (m) 1 = α(m) pm (n) sin n + β(m) qm (n) cos n , Tn 2 3 3 √ where α(m), β(m) ∈ Q, α(m) = 2a(m) 3b(m) 3, β(m) = 2c(m) 3d(m) , and pm (n), qm (n) ∈ Z[n] are polynomials, even and odd degree, respectively.

References [1] L. Carlitz and J.A.H. Hunter: “Sums of Powers of Fibonacci and Lucas Numbers”, Fibonacci Quart., Vol. 7, (1969), pp. 467–473. [2] A.F. Horadam: “Basic properties of a certain generalized sequence of numbers”, Fibonacci Quart., Vol. 3, (1965), pp. 161–176. [3] A.F. Horadam: “Generating functions for powers of a certain generalised sequence of numbers”, Duke Math. J., Vol. 32, (1965), pp. 437–446. [4] T. Koshy: Fibonacci and Lucas Numbers with Applications, Wiley, New York, 2001. [5] T. Mansour: “A formula for the generating functions of powers of Horadam’s sequence”, Australas. J. Combin., Vol. 30, (2004), pp. 207–212. [6] R.S. Melham: “Sums of Certain Products of Fibonacci and Lucas Numbers – Part I”, Fibonacci Quart., Vol. 37, (1999), pp. 248–251. [7] R.S. Melham: “Families of Identities Involving Sums of Powers of the Fibonacci and Lucas Numbers”, Fibonacci Quart., Vol. 37, (1999), pp. 315–319. [8] R.S. Melham: “Sums of Certain Products of Fibonacci and Lucas Numbers – Part II”, Fibonacci Quart., Vol. 38, (2000), pp. 3–7. [9] R.S. Melham: “Alternating Sums of Fourth Powers of Fibonacci and Lucas Numbers”, Fibonacci Quart., Vol. 38, (2000), pp. 254–259. [10] J. Morgado: “Note on some results of A.F. Horadam and A.G. Shannon concerning a Catalan’s identity on Fibonacci Numbers”, Portugal. Math., Vol. 44, (1987), pp. 243–252.

546

Roman Witula, Damian Slota / Central European Journal of Mathematics 4(3) 2006 531–546

[11] J. Morgado: “Note on the Chebyshev polynomials and applications to the Fibonacci numbers”, Portugal. Math., Vol. 52, (1995), pp. 363–378. [12] S. Paszkowski: Numerical Applications of Chebyshev Polynomials and Series, PWN, Warsaw, 1975 (in Polish). [13] P. Ribenboim: Fermat’s Last Theorem For Amateurs, Springer, New York 1999. [14] J. Riordan: “Generating functions for powers of Fibonacci numbers”, Duke Math. J., Vol. 29, (1962), pp. 5–12. [15] T. Rivlin: Chebyshev Polynomials from Approximation Theory to Algebra and Number Theory, 2nd ed., Wiley, New York, 1990. [16] R. Witula and D. Slota: “On Modiﬁed Chebyshev Polynomials”, J. Math. Anal. Appl., (2006), in print.