Let us take a closer look at the field Q[x]/(x2 - 2). Since this is a field of remainders after division by a quadratic polynomial, the elements of the field are all the linear polynomials Q[x]/(x2 - 2) = {a + bx : a,b Î Q}. We already know how the as and bs operate, they are just rational numbers. What is new and different about this field is the x. If we multiply x ´ x = x2 and then ask for the remainder after division by x2 - 2 we get x2 = 1(x2 - 2) + 2, so the remainder is 2 and we have x2 = 2 in Q[x]/(x2 - 2). Our extension field Q[x]/(x2 - 2) differs from the base field Q only in the addition of a new element x, which satisfies x2 = 2, or in other words x = Ö2. So we have proved Q[x]/(x2 - 2) @ Q(Ö2), where Q(Ö2) is the field generated by the rational numbers Q with the real number Ö2 adjoined.
This is actually the standard pattern. We can look at field extensions generated either by adjoining a new element to the field or by moving from field to the polynomial domain and then "modding out" by an irreducible polynomial. These will turn out to be the same thing if the polynomial used to "mod out" is the minimal polynomial of the element you adjoin. In a graduate algebra course we would spend some time on this point, but for our purposes we will focus mainly on some applications of the study of extension fields. In particular, we will prove the impossibility of trisecting an angle with straightedge and compass, or of duplicating a cube with the same tools. These were classical questions of Greek geometry, but it turns out their answers depend on the theory of extension fields.
The usual approach in secondary and college algebra is just to toss in all the real numbers, or even all the complex numbers, and not to worry about Ö and 3 Ö and the like when looking for answers to polynomial equations. This is not the historical approach. The Greeks were very concerned about the philosophical implications of the existence of irrational numbers and kept careful track of whether they were manipulating "numbers" (which had to be rational) or "lengths" (which could be irrational). It was not until the 15th and 16th centuries that mathematicians started to manipulate real numbers without worry. It is no accident that the 15th and 16th centuries also saw tremendous developments in algebraic reasoning. When you are first working with algebra and polynomials, you usually dont want to worry about what field you are working in. But in the 19th century, mathematicians discovered that when they wanted to understand the theory of polynomial equations, it did help to be very careful about such things. And now that our historical interlude is over, back to the properties of extension fields.
Definition: A basis for an extension field E over a base field F is a set of elements {e1, e2, ... ,en} that satisfies the following two properties
The first rule says that for any e Î E, there is at least one way it can be constructed as a linear combination of the eis. A consequence of the second rule is that there is at most one way to construct any e. So together the rules say a basis is a set of elements such that any element of the extension field can be constructed in exactly one way as a linear combination of the basis elements where the coefficients are drawn from the base field.
Example: The pair {1, x} is a basis for Q[x]/(x2 - 2). Every element can be written as a + bx with a,b Î Q, the base field, in exactly one way.
A basis for an extension field is not unique. For example, if we consider the extension field Q(Ö2,Ö3) over Q, one possible basis is {1, Ö2, Ö3, Ö6}. Another basis is {1, Ö2+Ö3, (Ö2+Ö3)2, (Ö2+Ö3)3}. We wont bother to prove this, but it isnt that hard to check. One thing to note is that while the bases look quite different, they both have 4 elements in them. While there are many possible choices for the elements to make up a basis, it turns out that how many elements you need is forced on you by the extension.
Theorem: Suppose E is an extension field over F. If e1, ... , en and g1, ... , gm are both bases for E over F, then n = m.
Proof: Suppose m ³ n. We will show it is possible to replace the ei by gi while keeping a basis. This process is straightforward, but involves some messy algebra. Since g1 Î E, we can write g1 = f11e1 + + f1nen for some choice of fij Î F. Not all the fij can be 0, else g1 would be 0. Say f11 ¹ 0. Then we can solve our for e1 in our expression for g1 to get e1 = (1/f11)g1 - (f12/f11)e2 - - (f1n/f11)en. Now I claim g1, e2, , en is another basis for E. To check this we must check our two conditions. To verify the first condition, suppose e Î E. Then since e1, , en is a basis, we can find x1, , xn Î F so that e = x1e1 + + xnen = (x1/f11)g1 + (x2 - x1f12/f11)e2 + + (xn - x1f1n/f11)en and condition 1 is satisfied. To check the second condition, we assume x1g1 + x2e2 + + xngn = 0. Then x1f11e1 + (x2 + x1f12)e2 + + (xn + x1f1n)en = 0 and since the ej form a basis, we must have x1f11 = 0, x2 + x1f12 = 0, , xn + x1f1n = 0. Since f11 ¹ 0, the first equality implies x1 = 0. Once we have x1 = 0, the other equations then become x2 = 0, , xn = 0, and the second condition for a basis is also satisfied.
Now we repeat our process and to replace another of the ej with g2. Since g2 Î E, we can write g2 = f21g1 + f22e2 + + f2nen. Now one of f22, , f2n must not be 0, since if all these terms were 0 then we would have g2 = f21g1, from which we could get f21g1 + (-1)g2 = 0, in contradiction of condition 2 of the gj forming a basis. So let's say f22 ¹ 0. Then we can write e2 = -(f21/f22)g1 + (1/f22)g2 - (f23/f22)e3 - - (f2n/f22)en and run through the same process as before to show g1, g2, e3, , en is also a basis. We proceed in this manner until we finally replace all the ej and conclude g1, , gn is a basis.
Now if m > n, we have gn+1 Î E and we can write gn+1 = x1g1 + + xngn for some x1, , xn Î F, since g1, , gn is a basis. But we can rewrite this as x1g1 + + xngn + (-1)gn+1 = 0. But this contradicts condition 2 of the gj being a basis since not all the coefficients are 0 (in particular, the coefficient of gn+1 is -1 ¹ 0). So it can't happen that m > n, and the two bases must have the same number of elements. This is the end of the proof.
Now that we know that all bases must have the same number of elements, we are justified in making the following definition.
Definition: If E is an extension field of F, the degree of the extension, deg[E : F], is the number of elements in a basis for E over F.
It should be noted that not all extensions have bases. For example, Q(p) doesn't have a basis over Q. The problem here is that there is no finite set of elements that satisfies condition 1 for a basis. We will stick to nice finite degree extensions as much as possible. On the other hand, the extension field Q(p) does come up in the question of squaring a circle, which is why I wanted to point out the limits of our definition of degree of extension.
In the next lecture we will extend our study of degree of extension to towers of extension fields. Our big theorem will be that if K is an extension field of E which is an extension field of F, then deg[K : F] = deg[K : E]΄deg[E : F]. With this result we will be ready to tackle trisecting the angle and the other construction problems of antiquity.