05chap1 PDF

CHAPTER
An Introduction to Groups
While we have no intention of presenting a comprehensive treatment of group theory in this text, there are a number of definitions that will facilitate a rigorous description of vector spaces. Furthermore, the concepts from abstract algebra that we shall introduce will be of great use to us throughout the text.
1.1 DEFINITIONS A group (G, ) is a nonempty set G together with a binary operation called multiplication (or a product) and denoted by that obeys the following axioms: (G1) (G2) (G3) (G4) a, b G implies ab G (closure); a, b, c G implies (ab)c = a(bc) (associativity); There exists e G such that ae = ea = a for all a G (identity); For each a G, there exists a G such that aa = aa = e (inverse).
Furthermore, a group is said to be abelian if it also has the property that (G5) ab = ba for all a, b G (commutativity).
30
DEFINITIONS
31
In the case of abelian groups, the group multiplication operation is frequently denoted by + and called addition. We will generally simplify our notation by leaving out the group multiplication symbol and assuming that it is understood for the particular group under discussion. The number of elements in a group G is called its order of and will be denoted by o(G). (The order of G is frequently denoted by \G\ although we shall not use this notation.) If this number is finite, then we say that G is a finite group. Otherwise, G is said to be infinite. While we have defined a group in the usual manner, it should be realized that there is a certain amount of redundancy in our definition. In particular, it is not necessary to require that a right inverse also be the left inverse. To see this, suppose that for any a G, we have the right inverse defined by aa = e. Then multiplying from the left by a yields aaa = a. But a G so there exists an (a) G such that (a)(a) = e. Multiplying our previous expression from the right by (a) results in aa = e, and hence we see that a is also a left inverse. Of course, we could have started with a left inverse and shown that it is also a right inverse. Similarly, we could have defined a right identity by ae = a for all a G. We then observe that a = ae = a(aa) = (aa)a = ea, and hence e is also a left identity. It is easy to show that the identity element is unique. To see this, suppose that there exist e, e G such that for every a G we have ea = ae = ea = ae = a. Since ea = a for every a G, we have in particular that ee = e. On the other hand, since we also have ae = a, it follows that ee = e. Therefore e = ee = e so that e = e. Before showing the uniqueness of the inverse, we first prove an important basic result. Suppose that ax = ay for a, x, y G. Let a be a (not necessarily unique) inverse to a. Then x = ex = (aa)x = a(ax) = a(ay) = (aa)y = ey = y. In other words, the equation ax = ay means that x = y. This is sometimes called the (left) cancellation law. As a special case, we see that aa = e = aa implies a = a so that the inverse is indeed unique as claimed. This also shows that (a) = a since (a)(a) = e and aa = e. Finally, another important result follows by noting that (ab)(ba) = a((bb)a) = a(ea) = aa = e. Since the inverse is unique, we then see that (ab) = ba . This clearly extends by induction to any finite product of group elements.
32
AN INTRODUCTION TO GROUPS
Example 1.1 The set of integers = 0, 1, 2, . . . forms an infinite abelian group where the group multiplication operation is just ordinary addition. It should be obvious that the (additive) identity element is 0, and the inverse of any number n is given by -n. However, it is easy to see that is not a group under the operation of ordinary multiplication. Indeed, while is both closed and associative under multiplication, and it also contains the (multiplicative) identity element 1, no element of (other than 1) has a multiplicative inverse in (for example, 2 = 1/2 ! ). On the other hand, if we consider the set of all rational numbers, then forms a group under ordinary addition (with identity element 0 and inverse -p/q to any p/q ). Moreover, the nonzero elements of also form a group under ordinary multiplication (with identity element 1 and inverse q/p to any p/q ). Example 1.2 A more complicated (but quite useful) example is given by the set of all rotations in the xy-plane. (This example uses some notation that we have not yet defined in this book, although most readers should have no difficulty following the discussion.) Consider the following figure that shows a vector r = (x, y) making an angle with the x-axis, and a vector r = (x, y) making an angle + with the x-axis: y y x x r = (x, y) r = (x, y)
We assume r = \ r \ = \ r \ so that the vector r results from a counterclockwise rotation by an angle with respect to the vector r . From the figure, we see that r has components x and y given by
x ' = r cos(! + " ) = r cos! cos " # r sin ! sin " = x cos! # y sin ! y ' = r sin(! + " ) = r sin ! cos " + r cos! sin " = x sin ! + y cos! .
Let R() denote a counterclockwise rotation by an angle . It should be clear that R(0) is just the identity rotation (i.e., no rotation at all), and that the inverse is given by R() = R(-). With these definitions, it is easy to see
DEFINITIONS
33
that the set of all rotations in the plane forms an infinite (actually, continuous) abelian group. A convenient way of describing these rotations is with the matrix # cos ! " sin ! & R(! ) = % (. $ sin ! cos ! ' (Such a matrix is said form a representation of the rotation group.) We then see that r = R() r , which in matrix notation is just
! x '$ ! cos' # &=# " y ' % " sin ' ( sin ' $ ! x $ &# &. cos' % " y %
Using this notation, it is easy to see that R(0) is the identity since
! x $ ! 1 0 $! x $ # &=# &# & " y % " 0 1 %" y %
and also that R() = R(-) because

# cos! R(! ) R("! ) = % $ sin ! " sin ! & # cos! (% cos! ' $" sin ! sin ! & # 1 0 & (=% ( = R("! ) R(! ). cos! ' $ 0 1 '
We remark that while the rotation group in two dimensions is abelian, the rotation group in three dimensions is not. For example, let Rz() denote a rotation about the z-axis (in the right-handed sense). Then, applied to any vector x lying along the x-axis, we see that Ry(90)Rz(45)x Rz(45)Ry(90)x since in the second case, the result lies along the z-axis, while in the first case it does not. While we will return shortly to discuss subgroups in more detail, it will be of use to define them now. If G is a group, then a subset H G is said to be a subgroup of G if the elements of H form a group under the same group multiplication rule as G. For example, the set of integers is a subgroup of the group of all rational numbers under ordinary addition. Furthermore, it is easy to show that a nonempty subset H of a group G is a subgroup of G if and only if a, b H implies that ab H, and a H implies that a H (see Exercise 1.1.9).
34 Exercises 1.
Decide which of the following sets G forms a group under the indicated operation. If G does not form a group, give the reason. (a) G = {all integers} under ordinary subtraction. (b) G = {all nonzero rational numbers} under ordinary division. (c) G = {a, a, . . . , a6} where
#ai + j ai a j = $ %ai + j ! 7 if i + j < 7 . if i + j " 7
(d) G = {2m 3n: m, n } under ordinary multiplication. 2. Let F denote the set of all mappings from into . For any f, g F we define (f + g)(x) = f(x) + g(x) for each x so that f + g F. Show that this defines a group. Show that the collection of all subsets of a set S, with the operation of taking symmetric differences (see Exercise 0.1.2) as the group multiplication operation, forms a group. [Hint: Show that the identity element is , and the inverse of any A S is A itself.] Prove that any group of order n 4 must be abelian. Given two groups A and B, we can form the Cartesian product A B = {(a, b): a A and b B} of these groups considered as sets. Prove that A B can be made into a group with respect to the operation defined by (a, b)(a, b) = (aa, bb) for all a, a A and b, b B. This group is called the direct product of A and B. Prove that {(x, x): x G} is a subgroup of G G (see the previous problem). This is called the diagonal subgroup of G G. Let G = {g1, . . . , gn} be a group, and let h G be arbitrary but fixed. Define the set hG = {hg1, . . . , hgn} = {gh, . . . , gh}. Show that hG = G, and conclude that the ordered set (h1, . . . , hn) is a permutation of the ordered set (1, . . . , n). (This simple but very useful result is frequently referred to as the rearrangement lemma.) Let H be a subgroup of a group G.
3.
4. 5.
6.
7.
8.
DEFINITIONS
35
(a) If e is the identity element in G and f is the identity element in H, show that f = e. (b) If a H, show that the inverse element a is the same in H as the a is in G. 9. Let H be a nonempty subset of a group G. Prove that H is a subgroup of G if and only if a, b H implies ab H and a H implies a H.
10. Let H be a collection of subgroups of a group G. Show that the intersection of all H H is a subgroup of G. 11. Let G be a group. An element a G is said to be conjugate to an element b G if there exists g G such that b = gag. Show that this defines an equivalence relation on G. (Mutually conjugate elements of G are said to form a (conjugate) class.) 12. Let X be a (nonempty) subset of a group G, and let {H: i I} be the collection of all subgroups of G that contain X. Then H is called the subgroup of G generated by the set X and denoted X. Prove that X consists of all finite products a1n1 a2 n2 ! ar nr where ai ! X and ni ! ! . [Hint : Show that the set H of all such products is a subgroup of G that contains X and is contained in every subgroup containing X. Thus H < X < H.]
1.2 PERMUTATION GROUPS Let G be any group and suppose a G. As a matter of notational convenience, we define a0 = e, a1 = a, a2 = aa, . . . , ak = aak-1, as well as a-2 = (a)2, a-3 = (a)3, . . . (where a is the usual inverse element to a). It is then easy to see that for any m, n we have am an = am+n and (am)n = amn. From now on we will assume the reader understands that this is what is meant when we write an element of any group to a power. Now consider three objects (, , O) where the parentheses mean that the given order is relevant. We define this to be the canonical (or standard) order on the set {, , O}. Given any other ordered triple, for example (O, , ), we define a permutation f of the set S = {, , O} by
" ! ! O% f =$ ' #O ! ! &
36
where the first line is the set of objects in their canonical order and the second line is the given order. In other words, a permutation on a set S is a bijection from S onto itself. Note that simply giving an arbitrary order to a collection of objects does not in itself define a permutation. It is necessary that some canonical order also be specified as a point of reference. This notation, where the top row defines the canonical order, is referred to as two-line notation. However, it is very important to realize that as long as the same pairing of objects is maintained between the top and bottom rows, we may rearrange these pairs any way we please. For example, the above permutation f can equally well be written as
"O ! ! % f =$ '. #! O !&
It is also common to use a simplified one-line notation. In this case, the canonical order must be understood. For example, in the first case above we would write simply f = (O, , ) where the canonical order is understood to be (, , O). While we have now given a precise definition of the term permutation, there are other ways of describing permutations that are very useful in practice. Two of these are given in the next (rather long) example, which will then be generalized to form one of the most useful groups in linear algebra. Example 1.3 Suppose we have three boxes that each contain a single object. Now, given three distinct objects and three boxes, any one of the three objects could go into the first box, then either of the two remaining objects could go into the second box, and finally only the remaining object can go into the third and last box. In other words, there are 3! = 6 possible placements of the three distinct objects in the three boxes such that each box receives a single object. Let us see how permutations can be used to describe the distribution of distinct objects among boxes. We give two common, intuitive interpretations. Imagine three boxes labelled 1, 2, 3 that contain objects x1, x2, x3 respectively, as shown below:
x1
1
x2
2
x3
3
We now redistribute these objects among the boxes as follows:
x3
1
x1
2
x2
3
1.2 PERMUTATION GROUPS
37
One way to describe the transition from the first distribution to the second is by the permutation ! $ ! = # 1 2 3& f " 2 3 1% which is to be interpreted as a rule for redistributing objects by saying take the object in box i (a number in the upper row) and place it in box f (i) (the number in the lower row directly below it). In this example, this means that we take the object in box i = 1 and place it in box f (1) = 2, the object in box i = 2 goes into box f (2) = 3, and the object in box i = 3 goes into box f (3) = 1. This rule yields the second distribution from the first. (Note also that in terms of our original definition of a permutation, we can interpret f as a reordering of boxes in space. In other words, we can equally well describe the above redistribution in effect by leaving the objects fixed in space and rearranging the boxes underneath them. It is easy to see that if we leave the objects in the order (x1, x2, x3) and label the boxes underneath them in the order (2, 3, 1), then we obtain the same pairing of objects and boxes.) Another approach to describing this transition is by using permutations on the set of objects. For example, if we let
!x f =# 1 " x3 x2 x1 x3 $ & x2 %
then the second distribution (the lower row) is obtained from the first distribution (the upper row) by interpreting f as replace object x1 (wherever it is) by object x3 , replace object x2 (wherever it is) by object x1 , and replace object x3 (wherever it is) by object x2. An equivalent way to describe this permutation is by the mapping f defined by
f ( x1 ) = x3 f ( x2 ) = x1 f ( x 3 ) = x2
which we can write in the simple one-line notation
f = ( x3 , x1, x2 ).
Let us denote the set of objects {x1, x2, x3} by S. Since there are only six possible distinct arrangements of S within the three boxes, there can be only six such permutations of S. We wish to make this set of permutations into a group. In particular, we will then have a group (denoted by S3) of permuta-
38
tions defined on the set S. This group is called the symmetric group (or the permutation group) of degree 3. Since S3 contains 3! = 6 elements, its order is 6. We define the group multiplication as the composition of our permutations. For example, consider the permutation in S3 defined by either
! 1 2 3$ ! =# g & " 2 1 3%
or
!x g=# 1 " x2 x2 x1 x3 $ & x3 %
which in our one-line notation is simply g = (x2, x1, x3). Composing this with the above permutation f = (x3, x1, x2) we have, for example, (fg)(x1) = f(g(x1)) = f(x2) = x1 and it is easy to see that the complete expression is given by fg = (x1, x3, x2). Note however, that gf = (x3, x2, x1) fg so that S3 is a nonabelian group. This composition of mappings also shows us how to multiply our permutations. Indeed, if we write out the equation fg = (x1, x3, x2) in terms of our two-line notation, we obtain
!x fg = # 1 " x3 x2 x1 x3 $ ! x1 &# x2 % " x2 x2 x1 x3 $ ! x1 &=# x3 % " x1 x2 x3 x3 $ &. x2 %
Reading the product from right to left, we first see that x1 is replaced by x2, and then this x2 is replaced by x1, and the net result is that x1 is replaced by x1. Next we see that x2 is first replaced by x1, and then this x1 is replaced by x3 with the net result of replacing x2 by x3 . Finally, x3 is replaced by x3 , and then this x3 is replaced by x2, resulting in the replacement of x3 by x2. Therefore we see that combining the product from right to left results in exactly the same permutation as shown on the right hand side. Now let us see how to combine the alternative descriptions in terms of f and g . We know that f takes the initial distribution
x1
1
x2
2
x3
3
39
to the redistributed form (contents of box 1 to box 2, contents of box 2 to box 3, and contents of box 3 to box 1)
x3
1
x1
2
x2
3
and g takes the initial distribution to the redistributed form
x2
1
x1
2
x3
3
Applying f to this last distribution we obtain (just take the contents of box 1 to box 2 etc.)
x3
1
x2
2
x1
3
With respect to the initial distribution, this composition of permutations is just the permutation !1 2 3$ # & " 3 2 1% In other words, simply following each permutation in sequence results in
! 1 2 3$ ! 1 2 3$ !1 2 3$ !g ! =# f &# &=# &. " 2 3 1 % " 2 1 3% " 3 2 1 %
Again reading the product from right to left, we see that the object in box 1 goes into box 2, and then the object in box 2 goes into box 3, with the net result that the object in box 1 goes into box 3. Next, the object in box 2 goes into box 1, and then the object in box 1 goes into box 2, resulting in the object in box 2 going into box 2. Finally, the object in box 3 goes into box 3, and then the object in box 3 goes into box 1, resulting in the object in box 3 going into box 1. Therefore, reading this type of product from right to left also results in the correct combination of permutations. We now observe that f 2(x1) = f(x3) = x2, and in general f 2 = (x2, x3, x1) and f 3 = (x1, x2, x3)
40
which shows that f 3 = ff 2 = e, and hence f = f 2. Similarly, we leave it to the reader to show that g2 = e, and hence g = g. Since S3 contains six elements and we have already constructed the six
distinct mappings {e, f, g, f 2, fg, gf}, it must be true that any combination of mappings may be reduced to one of these six. To see this, all we really need to calculate is (fg)(x1) = f(x2) = x3, and in general, fg = (x3, x2, x1) = gf so that fg = gf. For example, we have f(gf) = f(fg) = (ff)g = g. Other combinations are proved in a similar manner. We now generalize this example to the case of an arbitrary (but finite) number of elements. Let S be a set containing a finite number n of elements. Then the set S of all one-to-one mappings of S onto itself is called the permutation group of degree n. It should be clear that S is of order n!. If f S, then f has the effect of taking x f(x) which we may write as
! x1 f =# " xi1
x2 xi2
! xn $ & ! xin %
where (i1, . . . , in) is some permutation of (1, . . . , n). To simplify our notation, let us write this mapping as
!1 2 ! n $ # & "i1 i2 ! in %
where the top row stands for (x1, x2, . . . , xn) and the bottom row represents (xi, xi, . . . , xi) which is just (x1, . . . , xn) in some permuted order. This should not be confused with the interpretation (which we will no longer use) of permutations as the object in box 1 goes into box i1 etc. The identity element in S is
!1 2 ! n $ # & "1 2 ! n %
and the inverse to any given permutation is just the permutation that restores the original order. For instance, the inverse to the permutation f defined in Example 1.3 is the permutation f = f 2 given in this notation by
41
! 1 2 3$ # &. " 2 3 1%
In general, we will denote elements of S by Greek letters such as , and , so that we have expressions such as x, 2x = x, and so forth. In other words, if S3 is just the mapping f in the previous example, then we would have 1 = 3, 2 = 1 and 3 = 2. Now let S be any set of n elements, and consider any element S. Given any x, y S, we say that x is equivalent to y if y = i x for some i , and we write this as x y. Since x = 0 x = ex = x, we see that x x. Next, note that if x y, then x = i y so that y = -ix, and hence y x. In addition, if x y and y z, then x = i y = i jz = i+j z, and hence x z. We have therefore defined an equivalence relation on S as described in Section 0.3. Furthermore, Theorem 0.2 shows that this equivalence relation induces a decomposition of S into disjoint subsets called the equivalence classes of S. For each x S, the equivalence class of x is the set [x] = {i x: i } which is called the orbit of x under . Since S is finite, sooner or later repeated applications of to x must give back x. In other words, for each x S there exists some smallest positive integer m such that m x = x (where the value of m need not be the same for every x S). Thus the orbit of x under will be the set {x, x, . . . , m-1x}. If we consider these elements as being in a particular order, we then obtain what is called a cycle of , and we write this as (x, x, . . . , m-1x). In words, this means x is replaced by x, x is replaced by 2x, . . . , and m-1x is replaced by x. It should be clear that a knowledge of all the cycles of is the same as knowing , because we would then know the result of applying to any x S. (While the cycle notation is the same as the one-line notation for a permutation, the context should always make it clear which is meant.) Example 1.4 Let S = {x1, . . . , x6} which we denote by {1, . . . , 6} for simplicity. We consider the element S6 given by
"1 2 3 4 5 6 % ! =$ '. #2 1 3 5 6 4 &
Now observe that 1 = 2 and 21 = 2 = 1, so the orbit of 1 is the set {1, 2} and the corresponding cycle is (1, 2). Since this cycle is the equivalence class of 1 and equivalence classes are disjoint, we see that it must also be the equivalence class of 2. Continuing, the orbit of 3 is just {3}, and for 4 we have 4 = 5, 24 = 6, and 34 = 4 so that the orbit of 4 is {4, 5, 6}. Thus the cycles
42
of are (1, 2), (3) and (4, 5, 6). Notice that these cycles are disjoint ordered equivalence classes of S under the mapping S6. We can carry this idea one step further as follows. Consider a cycle of the form (i1, . . . , im) which we now interpret as that permutation which replaces i1 by i2 , i2 by i 3 , . . . , im-1 by im, and im by i1. For example, using the set S = {1, . . . , 6}, the cycle (2, 6, 3) is to be interpreted as the permutation
!1 2 3 4 5 6 $ # &. "1 6 2 4 5 3%
Since we already know how to multiply permutations, we now have a way to multiply cycles. Thus, using this same S and, for example, the cycles (1, 5) and (2, 6, 3), we have ! 1 2 3 4 5 6 $ !1 2 3 4 5 6 $ (1, 5)(2, 6, 3) = # &# & " 5 2 3 4 1 6 % "1 6 2 4 5 3%
!1 2 3 4 5 6$ =# &. " 5 6 2 4 1 3%
Note that while we have defined our multiplication as proceeding from right to left, in this case we would have obtained the same result by multiplying the cycles in either order. In fact, it should not be hard to convince yourself that this will always be the case when disjoint cycles are multiplied together. In other words, disjoint cycles commute. This is because each cycle only acts on a specific subset of elements that are not acted on by any other (disjoint) cycle. As another example, let us now find the cycles of the permutation
"1 2 3 4 5 6% ! =$ '. # 5 6 2 4 1 3&
We have 1 = 5 and 21 = 1 so that the orbit of 1 is {1, 5}. Also, 2 = 6, 22 = 3, and 32 = 2 so the orbit of 2 is {2, 6, 3}. Therefore has the cycles (1, 5) and (2, 6, 3) (and of course, also (4)). But now notice that is just the product of these cycles (which contain no elements in common) taken in any order. A little thought as we just mentioned shows that this is not unexpected, as we prove in our first theorem. Theorem 1.1 Every permutation can be expressed as the product of disjoint cycles.
43
Proof Consider any permutation S on a set S, and assume that has k cycles where each cycle is of the form (x, x, 2 x, . . . , m-1x) for some i with 1 i k. (Note that since each x S must be in some cycle, and since the cycles are disjoint, we must have !ik=1mi = n where n is the number of elements in S.) When these cycles are multiplied together, we see that each of the corresponding permutations affects only those elements contained in the orbit (i.e., cycle) it represents. Hence, by multiplying together all of the cycles, each element of S will be accounted for with the same result as . Another way to see this is to consider the effect of on any x S. The resulting element x is exactly the same as the image of x under the product of all the (disjoint) cycles of since only the cycle containing x will have any effect on it. Since both and the product of its cycles have the same effect on any x S, it must be true that equals the product of its cycles. At this point, there is no substitute for simply working out an example for yourself. Thus, the reader should pick some permutation, find its cycles, and then multiply them together. In so doing, the proof of Theorem 1.1 should become quite obvious (or see the exercises at the end of this section). Suppose that S = {1, 2, . . . , m} and consider the product of the 2-cycles (1, m), (1, m - 1), . . . , (1, 3), (1, 2). Expressing these in terms of their corresponding permutations, we have (note the order of factors since we are multiplying from right to left, and these cycles are not disjoint)
! 1 2 3 4 ! m$ !1 2 3 4 ! m $! 1 2 3 4 ! m $ # &! # &# & "m 2 3 4 ! 1 % " 3 2 1 4 ! m %" 2 1 3 4 ! m % !1 2 3 4 ! m $ =# &. "2 3 4 5 ! 1 %
But this last permutation is just the m-cycle (1, 2, . . . , m). A similar calculation shows that in fact any m-cycle of the form (a, a, . . . , am) may be written as the product (a, am) ~ ~ ~ (a, a3)(a, a). (We remark that the multiplication of 2-cycles is one place where multiplying from left to right would be more natural.) Example 1.5 Consider the permutation
!1 2 3 4 5 6 $ # & "3 5 2 6 1 4%
44
and its cycle (1, 3, 2, 5). We claim that this cycle may be written as the product (1, 5)(1, 2)(1, 3). There are actually two equivalent ways of seeing this. First, we could write out all of the complete permutations as
!1 2 3 4 5 6 $ (1, 3, 2, 5) = # & "3 5 2 4 1 6% !1 2 3 4 5 6$ (1, 2) = # & "2 1 3 4 5 6% !1 2 3 4 5 6 $ (1, 3) = # & "3 2 1 4 5 6% !1 2 3 4 5 6$ (1, 5) = # & "5 2 3 4 1 6%
It is then easy to see that (1, 3, 2, 5) = (1, 5)(1, 2)(1, 3). On the other hand, we could also leave out those elements in each permutation that are not affected by any of the cycles, and simply write
!1 3 2 5 $ (1, 3, 2, 5) = # & "3 2 5 1% !1 3 2 5$ (1, 2) = # & "2 3 1 5% !1 3 2 5 $ (1, 3) = # & "3 1 2 5% !1 3 2 5$ (1, 5) = # & "5 3 2 1%
Again, we obtain (1, 3, 2, 5) = (1, 5)(1, 2)(1, 3). At this point you should be sufficiently familiar with the cycle notation to be able to multiply cycles without reverting to two-line notation. All of this discussion has shown that any m-cycle may be written as a product of 2-cycles, which are usually called transpositions. However we could also write, for example, (1, 2, . . . , m) = (m, m - 1) ~ ~ ~ (m, 2)(m, 1) so that this decomposition is by no means unique. With all of this background, it is now easy to prove an important result in the description of permutations. Theorem 1.2 Every permutation can be written as the product of transpositions. Proof Theorem 1.1 showed that every permutation can be written as the product of disjoint cycles, while we just showed that any cycle can be written (in a non-unique manner) as the product of transpositions. In view of this theorem, we say that a permutation is even (odd) if it can be written as the product of an even (odd) number of transpositions. Of
45
course, since the decomposition of cycles into transpositions is not unique, we must be sure that such a designation is unambiguous. This is the intent of our next theorem. Theorem 1.3 If a permutation can be represented by an even (odd) number of transpositions in one decomposition, then any other decomposition must also be an even (odd) number of transpositions. Proof Define the polynomial p in n real variables by
p( x1,, xn ) = "
i< j
( xi ! x j )
= ( x1 ! x2 )( x1 ! x3 )!( x2 ! x3 )( x2 ! x4 )!( xn !1 ! xn )
and let S be any transposition. By p we mean p(x1, . . . , xn) = p(x1 , . . . , xn) . We claim that p = -p. To see this in detail, let be the transposition (xa, xb). We assume without loss of generality that xa < xb , and write out all of those terms in p(x1, . . . , xn) that contain either xa or xb (or both). Thus, those terms containing xa are
( x1 ! xa )( x2 ! xa )!( xa !1 ! xa )} { "$$$$$ #$$$$$%

a!1 terms
! {( xa " xa+1 )( xa " xa+ 2 )!( xa " xb "1 )} "$$$$$$ #$$$$$$ % b"a"1 terms ! {( xa " xb )} ! {( xa " xb+1 )%( xa " xn )} !#"#$ !### # "#### $ 1 term n"b terms
while those containing xb are
( x1 ! xb )( x2 ! xb )!( xa !1 ! xb )} " {( xa ! xb )} { "$$$$$ #$$$$$% "$ #$ %

a!1 terms already counted
! {( xa+1 " xb )( xa+ 2 " xb )!( xb "1 " xb )} "$$$$$$ #$$$$$$ % b"a"1 terms
46
! {( xb " xb+1 )!( xb " xn )} "$$$ $ #$$$$ % n"b terms

Since all of these terms are multiplied together, we see that if xa and xb are interchanged (but not xa+1 and xb+1 etc.), there will be no net effect on the polynomial p(x, . . . , x) except for the unpaired term (xa - xb) which results in a single change of sign. This shows that p = -p as claimed. Now, for any other S, Theorem 1.2 shows that = where each is a transposition. Thus, if
k
! = #" i
i =1
we see that
$ k ' k !p =& " # i) & ) p = (*1) p. % i =1 (

Similarly, if
m
! = #" i
i =1
we have p = (-1)m p. Therefore, if is represented by k transpositions and by m transpositions, we must have (-1)k p = (-1)m p, and hence k and m must both be even or both be odd. This result allows us to make the unambiguous definition of the sign of a permutation as follows. We define the sign of a permutation , sgn , by
#+1 if ! is even sgn ! = $ . %"1 if ! is odd
Our next theorem will be of great benefit to us when we come to discuss the theory of determinants in Chapter 4. Theorem 1.4 For any two permutations , S we have sgn() = (sgn )(sgn ) . Proof By Theorem 1.2, we may write as a product of k transpositions and as a product of m transpositions. Therefore it follows from Theorem 1.3 that sgn = (-1)k and sgn = (-1)m. But then
47
sgn() = (-1)k +m = (-1)k (-1)m = (sgn )(sgn ) . As the final topic in our treatment of permutations, we take a look at the inverse of a given transposition. For any given transposition (a1, a2), it should be obvious that (a1, a2)2 is just the identity transposition. This may be formally shown by noting that
!a (a1, a2 )2 = (a1, a2 )(a1, a2 ) = # 1 " a2 a2 $ ! a1 &# a1 % " a2 a2 $ ! a1 a2 $ &=# & = e. a1 % " a1 a2 %
Since the identity element in any group is unique, this means that for any transposition we have = . In view of this result, one might rightfully expect that the sign of an inverse permutation is the same as the sign of the permutation itself. Theorem 1.5 For any S we have sgn = sgn . Proof By Theorem 1.2, we write = 12 ~ ~ ~ m where each is a transposition. Then, using the fact that is just a product of elements in the group S2 , we see that = ( 2 ~ ~ ~ m) = m ~ ~ ~ 2 1 = m ~ ~ ~ 2 1 and hence sgn = (-1)m = sgn .
Exercises 1. Consider the following permutations

"1 2 3 4 % ! =$ ' #1 4 3 2 & "1 2 3 4 % ( =$ ' #3 1 4 2&
and compute each of the following: (a) (b) (c) (e) (f) (g) ()
(d) (h) ()
2. Referring to Example 1.3, evaluate gfgf 3gf. How is f related to f? 3. Find all of the orbits and cycles of the following permutations:
48
!1 2 3 4 5 6 7 8 9$ (a ) # &. "2 3 4 5 1 6 7 9 8% !1 2 3 4 5 6$ (b ) # &. "6 5 4 3 1 2%
4. Express each of the following as the product of disjoint cycles: (a) (1, 2, 3)(4, 5)(1, 6, 7, 8, 9)(1, 5) (b) (1, 2)(1, 2, 3)(1, 2) 5. Determine which of the following products of cycles is an even permutation: (a) (1, 2, 3)(1, 2) (b) (1, 2, 3, 4, 5)(1, 2, 3)(4, 5) (c) (1, 2)(1, 3)(1, 4)(2, 5) 6. Show that the set An Sn consisting of even permutations forms a group. Show that Sn consists of n!/2 even permutations and n!/2 odd permutations. 7. Compute for each of the following: (a) = (1, 3, 5)(1, 2) = (1, 5, 7, 9). (b) = (5, 7, 9) = (1, 2, 3). 8. Show that permutations with the same cycle structure belong to the same class (see Exercise 1.1.8). In other words, if , Sn, show that has the same cycle structure as . [Hint: Using two-line notation, show that may be evaluated by simply applying to the top and bottom rows of separately.] 9. Show that Sn is non-abelian if n 3.
49
1.3 HOMOMORPHISMS OF GROUPS We now turn our attention to a discussion of mappings from one group to another. These results will be absolutely fundamental to everything else that follows, and it is essential that the reader thoroughly understand the concepts to be presented in this section. Let : G G be a mapping from a group G to a group G. If for every x, y G we have (xy) = (x) (y) then is said to be a homomorphism, and the groups G and G are said to be homomorphic. In other words, a homomorphism preserves group multiplication, but is not in general either surjective or injective. It should also be noted that the product xy is an element of G while the product (x) (y) is an element of G. Example 1.6 Let G be the (abelian) group of all real numbers under addition, and let G be the group of nonzero real numbers under multiplication. If we define : G G by (x) = 2x, then (x + y) = 2x +y = 2x 2y = (x) (y) so that is indeed a homomorphism. Example 1.7 Let G be the group of all real (or complex) numbers under ordinary addition. For any real (or complex) number a, we define the mapping of G onto itself by (x) = ax. This is clearly a homomorphism since (x + y) = a(x + y) = ax + ay = (x) + (y) . However, if b is any other nonzero real (or complex) number, then we leave it to the reader to easily show that the (nonhomogeneous) mapping (x) = ax + b is not a homomorphism. Let e be the identity element of G, and let e be the identity element of G. If : G G is a homomorphism, then (x)e = (x) = (xe) = (x)(e), and we have the important result (e) = e. Using this result, we then see that e = (e) = (xx) = (x) (x), and hence the uniqueness of the inverse tells us that
50
(x) = (x) . It is very important to note that in general (x) (x) since if x G we have (x) G while if x G, then (x) G. Using these results, it should now be easy for the reader to show that (G) forms a subgroup of G (see Exercise 1.3.1). In general, there may be many elements x G that map into the same element x G under . It is of particular interest to see what happens if more than one element of G (besides e) maps into e. If k G is such that (k) = e, then for any x G we have (xk) = (x) (k) = (x)e = (x). Therefore if xk x we see that could not possibly be a one-to-one mapping. To help us get a hold on when a homomorphism is one-to-one, we define the kernel of to be the set Ker = {x G: (x) = e} . It is also easy to see that Ker is a subgroup of G (see Exercise 1.3.1). If a homomorphism : G G is one-to-one (i.e., injective), we say that is an isomorphism. If, in addition, is also onto (i.e., surjective), then we say that G and G are isomorphic. In other words, G and G are isomorphic if is a bijective homomorphism. (We point out that many authors use the word isomorphism to implicitly mean that is a bijection.) In particular, an isomorphism of a group onto itself is called an automorphism. From the definition, it appears that there is a relationship between the kernel of a homomorphism and whether or not it is an isomorphism. We now proceed to show that this is indeed the case. By way of notation, if H is a subset of a group G, then by Hg we mean the set Hg = {hg G: h H}. Recall also that if : G G and x G then, by an inverse image of x, we mean any element x G such that (x) = x. Theorem 1.6 Let be a homomorphism of a group G onto a group G, and let K be the kernel of . Then given any x G, the set of all inverse images of x is given by K x where x G is any particular inverse image of x. Proof Consider any k K . Then by definition of homomorphism, we must have (kx) = (k) (x) = ex = x . In other words, if x is any inverse image of x, then so is any kx K x. We must be sure that there is no other element y G, y ! K x with the property that (y) = x.
1.3 HOMOMORPHISMS OF GROUPS
51
that
To see that this is true, suppose (y) = x = (x). Then (y) = (x) implies e = (y)(x) = (y)(x) = (yx) .
But this means that yx K , and hence yx = k for some k K . Therefore y = kx K x and must have already been taken into account. Corollary A homomorphism mapping a group G to a group G is an isomorphism if and only if Ker = {e}. Proof Note that if (G) G, then we may apply Theorem 1.6 to G and (G). In other words, it is trivial that always maps G onto (G). Now, if is an isomorphism, then it is one-to-one by definition, so that there can be no element of G other than e that maps into e. Conversely, if Ker = {e} then Theorem 1.6 shows that any x (G) G has exactly one inverse image. Of course, if is surjective, then (G) is just equal to G. In other words, we may think of isomorphic groups as being essentially identical to each other. Example 1.8 Let G be any group, and let g G be fixed. We define the mapping : G G by (x) = gxg, and we claim that is an automorphism. To see this, first note that is indeed a homomorphism since for any x, y G we have
! ( xy) = g( xy)g "1 = g( xey)g "1 = g( xg "1gy)g "1 = ( gxg "1 )( gyg "1 ) = ! ( x )! ( y).
To see that is surjective, simply note that for any y G we may define x = gyg so that (x) = y. Next, we observe that if (x) = gxg = e, then rightmultiplying by g and left multiplying by g yields x = (gg)x(gg) = geg = e and hence Ker = {e}. From the corollary to Theorem 1.6, we now see that must be an isomorphism.
52 Exercises
1. Let : G G be a homomorphism. (a) Show that (G) is a subgroup of G. (b) Show that Ker is a subgroup of G. 2. Show that the composition : A C is a homomorphism if both : B C and : A B are. 3. Determine which of the following mappings : G G are homomorphisms, and for those that are, determine their kernel: (a) G = G = the group of nonzero real numbers under multiplication, and (x) = x2 for all x G. (b) Repeat part (a) but with (x) = 2x. (c) G = G = the group of all real numbers under addition, and (x) = 1 + x for all x G. (d) Repeat part (c), but with (x) = kx for any (fixed) number k. 4. Show that an isomorphism defines an equivalence relation on the set of all groups. 5. If G is abelian and G is isomorphic to G, prove that G is also abelian. 6. Let + denote the set of all real numbers > 0, and define the mapping : + by (x) = log10 x for each x +. Let + be a group with respect to multiplication, and let be a group with respect to addition. Show that is an isomorphism. 7. Let A and B be groups (with their own group operations). Show that A B is isomorphic to B A (see Exercise 1.1.5). 8. (a) (Cayleys theorem) Prove that every group G of order n is isomorphic to a subgroup of Sn for some S. [Hint: By the rearrangement lemma (Exercise 1.1.7), we know that hG = G for any h G. If G = {g1, . . . , gn}, define the mapping : G Sn by
" g ! gn % ! (a ) = $ 1 ' # ag1 ! agn &
for every a G. Using the techniques of Exercise 1.2.8, show that is a homomorphism, i.e., (ab) = (a)(b).]
1.3 HOMOMORPHISMS OF GROUPS
53
(b) Explain why this result shows that there can only be a finite number of non-isomorphic groups of order n.
1.4 RINGS AND FIELDS Before starting our discussion of vector spaces, let us first define precisely what is meant by a field. We shall see that this is simply a generalization of those essential properties of the real and complex numbers that we have been using all along. For the sake of completeness and future reference, we will do this in a somewhat roundabout manner. A nonempty set R together with two operations denoted by + and is said to be an associative ring if it obeys the following axioms for all a, b, c R: (R1) (R2) (R3) (R4) (R5) (R6) (R7) (R8) a + b R; a + b = b + a; (a + b) + c = a + (b + c); There exists an element 0 R such that a + 0 = a; There exists and element -a R such that a + (-a) = 0; ab R; (ab)c = a(bc); a(b + c) = ab + ac and (a + b)c = ac + bc.
Since every ring that we will ever discuss obeys (R7), we henceforth drop the adjective associative when discussing rings. It should also be noticed that (R1) - (R5) simply require that R be an abelian group under the operation + which we call addition. In addition to these axioms, if there exists an element 1 R such that a1 = 1a = a for every a R, then R is said to be a ring with unit element. Furthermore, if for every a, b R we have ab = ba, then R is called a commutative ring. As usual, we shall generally leave out the multiplication sign when dealing with rings. Example 1.9 The set of all real integers under the usual operations of addition and multiplication is a commutative ring with unit element. However, the set of even integers under addition and multiplication is a commutative ring with no unit element. Note also that the set + of positive integers is not a ring since there are no additive inverse (i.e., negative) elements in this set. Note that while the elements of a ring form an additive abelian group, we have not required that each element have a multiplicative inverse. However, if the nonzero elements of a ring R happen to form a group under multiplication, we say that R is a division ring. In this case we denote the unit element of R by 1, and we let a denote the inverse of any element a R. The reason that
54
only the nonzero elements are considered is that 0 has no inverse 0 such that 00 = 1. Finally, a field is defined to be a commutative division ring. We will generally denote an arbitrary field by the symbol F. Example 1.10 It should be clear that the real numbers form a field with the usual operations of addition and multiplication. However, the set of integers does not form a field because for any n with n 0, we have n = 1/n (except for n = 1). It is also true that the complex numbers form a field, but this is slightly more difficult to prove. To do so, let us denote a complex number a + ib by the ordered pair (a, b) . Referring to Section 0.6 for motivation, we define addition and multiplication on these pairs by (a, b) + (c, d) = (a + c, b + d) (a, b)(c, d) = (ac - bd, ad + bc) for all a, b, c, d . We claim that the set consisting of all such ordered pairs is a field. Some of the details will be left to the reader to fill in, but we will show the important points here. The additive identity element is clearly (0, 0), the negative of any (a, b) is (-a, -b), and the multiplicative identity is (1, 0). Multiplication is commutative since (a, b)(c, d) = (ac - bd, ad + bc) = (ca - db, cb + da) = (c, d)(a, b) . To prove associativity, we have
(a, b)[(c, d)(e, f)] = (a, b)(ce ! df, cf + de) = (ace ! adf ! bcf ! bde, acf + ade + bce ! bdf) = (ac ! bd, ad + bc)(e, f) = [(a, b)(c, d)](e, f).
Finally, we show that every (a, b) (0, 0) has an inverse in . Since a and b can not both be 0, we have a2 + b2 > 0. We leave it to the reader to show that
" a !b % (a, b) $ 2 , 2 ' = (1, 0). 2 # a + b a + b2 &

(In the notation of Chapter 0, we see that this is just the statement that zz* = \z\2 implies z = z*/\z\2.) We will be using the fields and almost exclusively in this text.
1.4 RINGS AND FIELDS
55
Since we will be using fields (and therefore rings) as our usual number system, let us use the defining relations to prove formally that a ring behaves in the manner we are accustomed to and expect. Theorem 1.7 Let R be a ring with unit element. Then for all a, b R we have (a) a0 = 0a = 0. (b) a(-b) = (-a)b = -(ab). (c) (-a)(-b) = ab. (d) (-1)a = -a. (e) (-1)(-1) = 1. Proof (a) a0 = a(0 + 0) (by (R4)) = a0 + a0 (by (R8)). But R is an additive abelian group, so that canceling a0 from both sides of this equation says that a0 = 0. Similarly, we see that 0a = (0 + 0)a = 0a + 0a implies 0a = 0. (b) ab + a(-b) = a(b + (-b)) (by (R8)) = a0 (by (R5)) = 0 (by (a)). Therefore the group property of R shows that a(-b) = -(ab). It is clear that we also have (-a)b = -(ab). (c) (-a)(-b) = -(a(-b)) (by (b)) = -(-(ab)) (by (b) again). But -(-(ab)) is the unique inverse to -(ab), and since we also have ab + (-(ab)) = 0, it follows that -(-(ab)) = ab. (d) a + (-1)a = 1a + (-1)a = (1 + (-1))a = 0a = 0 so that (-1)a = -a. (e) This follows from (d) using a = -1 since (-1)(-1) = -(-1) = 1. Note the fact that R contains a unit element was actually only required for parts (d) and (e) of this theorem.
Exercises 1. Let R be a ring. Prove that a 2 - b2 = (a + b)(a - b) and that (a + b)2 = a2 + 2ab + b2 for all a, b R if and only if R is commutative (where by terms of the form a2 we mean aa). 2. Let F denote the set of all mappings from into . For any f, g F, we define (f + g)(x) = f(x) + g(x) and (fg)(x) = f(x)g(x) for each x . In other words, f + g and fg are in F. Show that this defines a ring of functions.
56
3. In the previous problem, show that if we replace the product fg by the composition f g then this does not define a ring. 4. Show that the set of all rational numbers forms a field. 5. Consider the set [2] = {a + b2: a, b }. We define addition and multiplication in [2] by (a + b2) + (a + b2) = (a + a) + (b + b)2 and (a + b2)(a + b2) = (aa + 2bb) + (ab + ba)2 . Show that the set [2] with these operations forms a ring. Does it form a field? 6. Repeat the previous problem with instead of .
1.5 MORE ON GROUPS AND RINGS In this section we lay the foundation for future work in our chapter on polynomials. If the reader has not had much experience with abstract algebra, this section may prove somewhat long and difficult on a first reading. Because of this, the student should feel free to skim this section now, and return to it only when it becomes necessary in later chapters. We have seen that fields offer a distinct advantage over rings in that elements of the field can be divided (since the field contains the multiplicative inverse of each nonzero element). It will be of interest to know how certain rings can be enlarged to form a field. Rather than treat this problem directly, we choose to introduce some additional terminology that will be of use in discussing further properties of polynomials. In view of the fact that a ring has both addition and multiplication defined on it, we make the following definition. Let R and R be rings. A mapping : R R is said to be a ring homomorphism if (a + b) = (a) + (b) and for all a, b R. We see that (ab) = (a) (b)
1.5 MORE ON GROUPS AND RINGS
57
(a) = (0 + a) = (0) + (a) and therefore (0) = 0. Then we also have 0 = (a - a) = (a) + (-a) so that adding -(a) to both sides yields (-a) = -(a) . While these last two results are the exact analogues of what we found for groups, not all of our results can be carried over directly. In particular, it must be remembered that every element of a group had an inverse in the group, while no such requirement is made on the multiplication in an arbitrary ring (recall that a ring in which the nonzero elements form a multiplicative group is called a division ring). If R is a commutative ring, a nonzero element a R is said to be a zero divisor if there exists a nonzero element b R such that ab = 0. We then say that a commutative ring is an integral domain if it contains no zero divisors. For example, the ring of integers is an integral domain. Example 1.11 Consider the set of all integers, and let n + be fixed. A notation that is frequently used in algebra is to write a|b to mean a divides b, (i.e., in this case, that b is an integral multiple of a) and c| d to mean c does not divide d. We define a relationship between the integers a and b by writing a ^ b(mod n) if n|(a - b). This relation is called congruence modulo n, and we read it as a is congruent to b modulo n. We leave it as an exercise for the reader to show that this defines an equivalence relation on the set of integers (see Exercise 1.5.2). For example, it should be clear that 5 = 2(mod 3), 23 = 5(mod 6) and 21 = -9(mod 10). Now suppose we define a ring R to be the set of integers mod 6 (this ring is usually denoted by 6). Then the elements of R are the equivalence classes of the integers, and we denote the equivalence class of an integer n by [n]. Then the elements of R are [0], [1], [2], [3], [4] and [5]. For example, from the previous paragraph we see that [5] = [23] because 6|(23 - 5). We define addition in R by [a] + [b] = [a + b], and thus [0] is the zero element of R. Defining multiplication in R by [a][b] = [ab], we see, for example,
58
that [2][5] = [4]. However, note that [2][3] = [0] even though [2] [0] and [3] [0], and thus R is not an integral domain. We will have much more to say about this ring in Section 6.6. It should now be clear that arbitrary rings can have a number of properties that we generally find rather unpleasant. Another type of pathology that is worth pointing out is the following. Let D be an integral domain. We say that D is of finite characteristic if there exists some integer m > 0 and some nonzero a D such that ma = 0. Then the smallest positive integer p such that pa = 0 for some nonzero a D is called the characteristic of D. If D is an integral domain of characteristic p, then there exists a nonzero element a D such that pa = 0. Then for any x D we also have 0 = (pa)x = (a + ~ ~ ~ + a)x = ax + ~ ~ ~ + ax = a(x + ~ ~ ~ + x) = a(px) . But D has no zero divisors, and hence we must have px = 0 for every x D. If D has a unit element , then an equivalent requirement is to say that if D is of characteristic p, then 1 + ~ ~ ~ + 1 = 0, where there are p terms in the sum. Furthermore, any such sum consisting of less than p terms is nonzero. Obviously, the most important types of integral domain for our purposes are those of characteristic 0. In other words, to say that D is of characteristic 0 means that if m is an integer and a D is nonzero, then ma = 0 if and only if m = 0. The reason that we even bother to mention this is because most of the theory of matrices and determinants that we shall develop is valid over an arbitrary field F. For example, we shall obtain results such as det A = -det A which implies that 2 det A = 0. However, if F happens to be of characteristic 2, then we can not conclude from this that det A = 0. In this book, we will always assume that our fields are of characteristic 0 (except in Section 6.6). Returning to our general discussion, let 1 and 1 be the multiplicative identities of the rings R and R respectively, and consider any ring homomorphism : R R. Then (a) = (1a) = (1) (a) but this does not in general imply that (1) = 1. However, if R is an integral domain and (a) 0, then we have 0 = (a) - (1) (a) = (a)[1 - (1)] and hence (1) = 1 (note that we do not distinguish in our notation between 0 and 0).
59
As was the case with groups, we define the kernel of to be the set If a, b Ker then Ker = {a R: (a) = 0} . (a + b) = (a) + (b) = 0 + 0 = 0 so that a + b Ker also. Furthermore, if a Ker then (-a) = -(a) = 0 so that the (additive) inverse of a is also in Ker . Thus Ker forms a subgroup of R under addition. As we also did with groups, we say that a ring homomorphism of R into R is a (ring) isomorphism if it is an injective (i.e., one-to-one) mapping. If there exists a bijective ring homomorphism of R onto R, then we say that R and R are isomorphic. Theorem 1.6 also carries over directly to the present case, and we then have that a ring homomorphism is an isomorphism if and only if Ker = {0}. Now note that another very important property of Ker comes from the observation that if a Ker and r R, then (ar) = (a) (r) = 0(r) = 0 . Similarly (ra) = 0, and therefore both ar and ra are in Ker . We take this property as the prototype of a new object defined as follows. A nonempty subset I of a ring R is said to be a (two-sided) ideal of R if I is a subgroup of R under addition, and if ar I and ra I for all a I and all r R. It is important to realize that the element r can be any element of R, not just an element of I. Now let R be a commutative ring with unit element, and let a R be arbitrary. We denote by (a) the set of all multiples of a by elements of R. (While this is a somewhat confusing notation, it nevertheless conforms to standard usage.) In other words, (a) = {ra: r R} . We claim that (a) is actually an ideal of R. Indeed, if r, s R, then ra, sa (a) and therefore ra + sa = (r + s)a (a). Next, we have 0 = 0a (a), and finally, the negative (i.e., additive inverse) of ra (a) is (-r)a which is also in (a). This shows that (a) is a subgroup of R under addition. Lastly, for any ra (a) and any s R, we see that (ra)s = s(ra) = (sr)a (a). We have thus shown that (a) is an ideal. In general, any ideal of the form (a) is called a principal ideal,
60
and the element a R is called a generator of (a). A principal ideal (a) is thus the smallest ideal of R that contains a. Example 1.12 We show that any field F has no ideals other than (0) and F. Since the ideal (0) is quite trivial, let I be an ideal and assume that I (0). If a I, a 0, then a F implies that a F so that 1 = aa I by the definition of ideal. But now, for any r F we have r = 1r I (again by the definition of ideal), and hence I = F. The converse of this example is given in the next theorem. Recall that a field is a commutative division ring, and hence a commutative ring R with unit element 1 is a field if every nonzero a R has an inverse b R with ab = 1. Theorem 1.8 If R is a commutative ring with unit element whose only ideals are (0) and R, then R is a field. Proof Part of this was proved in the above discussion, but for the sake of completeness we repeat it here. Let a R be nonzero, and consider the set Ra = {ra: r R} . We shall first show that this set is an ideal of R. To see this, suppose x, y Ra. Then there exist r, r R such that x = ra and y = ra. But then (using the definition of a ring) we see that x + y = ra + ra = (r + r)a Ra . Next we note that -x = -ra = (-r)a Ra and therefore Ra is a subgroup of R under addition. Now, given any r R we have rx = r(ra) = (rr)a Ra and since R is commutative, it also follows that xr Ra. This shows that Ra is an ideal of R. By hypothesis, we see that Ra must equal either (0) or R. Since R is a ring with unit element, we have 0 a = 1a Ra and hence Ra (0). This means that we must have Ra = R so that every element of R is a multiple of a. In particular, since 1 R, there must exist an element b R with the property that ba = 1. In other words, b = a and thus R is a field.
61
Now let H be a subgroup of a group G, and let a G be arbitrary. Then the set Ha = {ha: h H} is called a right coset of H in G. Let a, b G be arbitrary, and suppose that the cosets Ha and Hb have an element in common. This means that ha = hb for some h, h H. But then using the fact that H is a subgroup, we see that a = hha = hhb Hb . Since this means that a = hb for some h = hh H, we see (using the rearrangement lemma of Exercise 1.1.7) that this implies Ha = Hhb = Hb and therefore if any two right cosets have an element in common, then they must in fact be identical. It is easy to see that the set of all right cosets of H in G defines a partition of G and hence an equivalence relation that decomposes G into disjoint subsets (see Exercise 1.5.15). Recall that o(G) denotes the order of G (i.e., the number of elements in the group G). We claim that if H is a subgroup of G, then o(H) = o(Ha) for any a G. Indeed, to prove this we show that there is a bijection of H to Ha. Define the mapping : H Ha by (h) = ha. This is clearly a surjective mapping since Ha consists precisely of elements of the form ha for h H. To see that it is also injective, suppose that for some h, h H we have (h) = (h) or, equivalently, ha = ha. Multiplying from the right by a then implies that h = h, thus showing that is one-to-one. In the particular case of finite groups, the previous paragraph shows that any two right cosets of H in G must have the same number o(H) of elements. We also showed above that any two distinct right cosets have no elements in common. It then follows that any a G is in the unique right coset Ha, and therefore the set of all right cosets of H in G must contain every element of G. This means that if there are k distinct right cosets of H in G, then we must have ko(H) = o(G) (i.e., o(H)|o(G)), and hence we have proved Lagranges theorem: Theorem 1.9 If G is a finite group and H is a subgroup of G, then o(H) is a divisor of o(G).
62
The number o(G)/o(H) will be denoted by iG(H), and is usually called the index of H in G. (This is frequently denoted by [G : H].) The index of H in G is thus the number of distinct right cosets of H in G. While we have restricted our discussion to right cosets, it is clear that everything could be repeated using left cosets defined in the obvious way. It should also be clear that for a general subgroup H of a group G, we need not have Ha = aH for any a G. However, if N is a subgroup of G such that for every n N and g G we have gng N, then we say that N is a normal subgroup of G. An equivalent way of phrasing this is to say that N is a normal subgroup of G if and only if gNg N for all g G (where by gNg we mean the set of all gng with n N). Theorem 1.10 every g G. A subgroup N of G is normal if and only if gNg = N for
Proof If gNg = N for every g G, then clearly gNg N so that N is normal. Conversely, suppose that N is normal in G. Then, for each g G we have gNg N, and hence gNg = gN(g) N . Using this result, we see that N = (gg)N(gg) = g(gNg)g gNg and therefore N = gNg (This also follows from Example 1.8). The reader should be careful to note that this theorem does not say that gng = n for every n N and g G. This will in general not be true. The usefulness of this theorem is that it allows us to prove the following result. Theorem 1.11 A subgroup N of G is normal if and only if every left coset of N in G is also a right coset of N in G. Proof If N is normal, then gNg = N for every g G, and hence gN = Ng. Conversely, suppose that every left coset gN is also a right coset. We show that in fact this right coset must be Ng. Since N is a subgroup it must contain the identity element e, and therefore g = ge gN so that g must also be in whatever right coset it is that is identical to gN. But we also have eg = g so that g is in the right coset Ng. Then, since any two right cosets with an element in common must be identical, it follows that gN = Ng. Thus, we see that gNg = Ngg = N so that N is normal.
63
If G is a group and A, B are subsets of G, we define the set AB = {ab G: a A, b B} . In particular, if H is a subgroup of G, then HH H since H is closed under the group multiplication operation. But we also have H = He HH (since e H), and hence HH = H. Now let N be a normal subgroup of G. By Theorem 1.11 we then see that (Na)(Nb) = N(aN)b = N(Na)b = NNab = Nab . In other words, the product of right cosets of a normal subgroup is again a right coset. This closure property suggests that there may be a way to construct a group out of the cosets Na where a is any element of G. We now show that there is indeed a way to construct such a group. Our method is used frequently throughout mathematics, and entails forming what is called a quotient structure. Let G/N denote the collection of all right cosets of N in G. In other words, an element of G/N is a right coset of N in G. We use the product of subsets as defined above to define a product on G/N. Theorem 1.12 group. Let N be a normal subgroup of a group G. Then G/N is a
Proof We show that the product in G/N obeys properties (G1) - (G4) in the definition of a group. (1) If A, B G/N, then A = Na and B = Nb for some a, b G, and hence (since ab G) AB = NaNb = Nab G/N . (2) If A, B, C G/N, then A = Na, B = Nb and C = Nc for some a, b, c G and hence
(AB)C = (NaNb)Nc = (Nab)Nc = N(abN)c = N(Nab)c = N(ab)c = Na(bc) = Na(Nbc) = Na(NbNc) = A(BC).
(3) If A = Na G/N, then and similarly AN = NaNe = Nae = Na = A NA = NeNa = Nea = Na = A . Thus N = Ne G/N serves as the identity element in G/N.
64
(4) If Na G/N, then Na is also in G/N, and we have as well as NaNa = Naa = Ne NaNa = Naa = Ne . Therefore Na G/N is the inverse to any element Na G/N. Corollary o(G)/o(N). If N is a normal subgroup of a finite group G, then o(G/N) =
Proof By construction, G/N consists of all the right cosets of N in G, and since this number is just the definition of iG(N), we see that o(G/N) = o(G)/o(N). The group defined in Theorem 1.12 is called the quotient group (or factor group) of G by N. Let us now apply this quotient structure formalism to rings. Since any subgroup of an abelian group is automatically normal, and since any ring R is an abelian group under addition, any ideal I of R is therefore a normal subgroup of R (under addition). It is clear that we can now form the quotient group R/I where the elements of R/I are the cosets of I in R (since R is abelian, there is no need to distinguish between right and left cosets). We write these cosets as I + r (or r + I) for each r R. In the next theorem we show that R/I can in fact be made into a ring which is called the quotient ring of R by I. Theorem 1.13 define and Let I be an ideal of a ring R. For any I + a, I + b R/I we (I + a) + (I + b) = I + (a + b) (I + a)(I + b) = I + ab . Then, with these operations, R/I forms a ring. Proof From the proof of Theorem 1.12, it is obvious that R/I forms a group under addition if we use the composition rule (I + a) + (I + b) = I + (a + b) for all a, b R. We now turn our attention to the multiplication rule on R/I, and we begin by showing that this rule is well-defined. In other words, we must show that if I + a = I + a and I + b = I + b, then I + ab = I + ab. From I + a = I + a, we have a = x + a for some x I, and similarly b = y + b for some y I. Then
65
ab = (x + a)(y + b) = xy + xb + ay + ab . But I is an ideal so that xy, xb, and ay are all elements of I, and hence z = xy + xb + ay I. Therefore, ab = z + ab so that ab + I = ab + z + I = ab + I as desired. To show that R/I is a ring, we must verify that the properties (R1) - (R8) given in Section 1.4 hold in R/I. This is straightforward to do, and we give one example, leaving the rest to the reader (Exercise 1.5.5). To prove the first part of (R8), suppose a, b, c R. Then I + a, I + b, I + c R/I and hence
(I + a)[(I + b) + (I + c)] = (I + a)[I + (b + c)] = I + a(b + c) = I + (ab + ac) = (I + ab) + (I + ac) = (I + a)(I + b) + (I + a)(I + c).
Example 1.13 Recall that the set of all integers forms a commutative ring with unit element (see Example 1.9). If we choose any n , then n generates a principal ideal (n) that consists of all numbers of the form na for each a . For example, the number 2 generates the principal ideal (2) that is nothing more than the ring of all even integers. The quotient ring /(2) is then the set of all cosets of (2). Each of these cosets is either the set of even integers, or the even integers plus some odd integer. We have now finished essentially all of the mathematical formalism necessary to undertake a rigorous study of linear algebra. In the next chapter we begin our treatment of the subject matter proper of this text.
Exercises 1. Let be a homomorphism of a group G into a group G, and let K be the kernel of . Prove that K is a normal subgroup of G. This exercise refers to the relation congruence modulo n defined in Example 1.11. Throughout this exercise, let n + be arbitrary but fixed. (a) Show that this relation defines an equivalence relation. (b) Using Theorem 0.8 to divide a by n, show that the congruence relation has exactly n distinct equivalence sets.
2.
66
(c) If a ^ b(mod n) and c ^ d(mod n), show that a + c ^ (b + d)(mod n) and ac ^ bd(mod n). (d) Show /(n) is isomorphic to the integers mod n. 3. 4. 5. 6. 7. 8. Let : R R be a ring isomorphism. Show that R is commutative if R is. Let : R R be a ring isomorphism. Show that R is an integral domain if R is. Finish the proof that R/I forms a ring in Theorem 1.13. Prove that an integral domain is a field if and only if every nonzero element has a multiplicative inverse. Show that the kernel of a ring homomorphism is an ideal. Determine all the subgroups of the permutation group S3. Which of these is normal? Let N be a collection of normal subgroups of a group G. Show that the intersection of all N N is a normal subgroup of G.
9.
10. Prove or disprove the following statement: If : R R is a ring homomorphism, then the image of is an ideal of R. 11. Let be a homomorphism of a group G onto a group G, and let K be the kernel of . By Exercise 5.1, we know that K is a normal subgroup of G, and hence we may form the quotient group G/K. Prove that G/K is isomorphic to G. [Hint: Since any element in X G/K is of the form Kg where g G, define the mapping : G/K G by (X) = (Kg) = (g). To show that is an isomorphism, first show that is well-defined, that is, X = Kg = Kg implies (g) = (g). Next, show that is a homomorphism, i.e., that (XY) = (X) (Y). Now show that is surjective (use the fact that is surjective). Finally, show that Ker = {0} (you will need the additional fact that the identity in G/K is K = Ke).] 12. Show that a field F can have no zero divisors. 13. Let H be a subgroup of a group G. Show that the set of all right cosets of H in G decomposes G into disjoint subsets.
67
14. The center of a group G is the set Z = {z G: zg = gz for all g G}. Show that Z is a normal subgroup of G. 15. Show that the set {0, 1} with the usual addition and multiplication operations, but subject to 1 + 1 = 0, forms a field of characteristic 2. (This is an example of a finite field.) 16. Let G be a group and let G, G be subgroups of G with G G = e. Show that G and G commute if and only if they are normal subgroups.

05chap1 PDF

Enviado por

Dados do documento

Título original

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

05chap1 PDF

Enviado por

Direitos autorais:

Formatos disponíveis

CHAPTER

and also that R() = R(-) because

We now redistribute these objects among the boxes as follows:

1.2 PERMUTATION GROUPS

1.2 PERMUTATION GROUPS

and g takes the initial distribution to the redistributed form

1.2 PERMUTATION GROUPS

1.2 PERMUTATION GROUPS

1.2 PERMUTATION GROUPS

( x1 ! xa )( x2 ! xa )!( xa !1 ! xa )} { "$$$$$ #$$$$$%

( x1 ! xb )( x2 ! xb )!( xa !1 ! xb )} " {( xa ! xb )} { "$$$$$ #$$$$$% "$ #$ %

! {( xb " xb+1 )!( xb " xn )} "$$$ $ #$$$$ % n"b terms

$ k ' k !p =& " # i) & ) p = (*1) p. % i =1 (

1.2 PERMUTATION GROUPS

Exercises 1. Consider the following permutations

!1 2 3 4 5 6 7 8 9$ (a ) # &. "2 3 4 5 1 6 7 9 8% !1 2 3 4 5 6$ (b ) # &. "6 5 4 3 1 2%

1.2 PERMUTATION GROUPS

1.3 HOMOMORPHISMS OF GROUPS

1.3 HOMOMORPHISMS OF GROUPS

" a !b % (a, b) $ 2 , 2 ' = (1, 0). 2 # a + b a + b2 &

1.4 RINGS AND FIELDS

1.5 MORE ON GROUPS AND RINGS

1.5 MORE ON GROUPS AND RINGS

1.5 MORE ON GROUPS AND RINGS

1.5 MORE ON GROUPS AND RINGS

1.5 MORE ON GROUPS AND RINGS

1.5 MORE ON GROUPS AND RINGS

Você também pode gostar