Автор: Sheldon Axler  

Теги: linear algebra  

ISBN: 0387982590

Год: 1997

Текст
                    Linear Algebra Done Right
Solutions Manual
Sheldon Axler
7 July 1997
© 1997 Sheldon Axler


This solutions manual provides answers to all the exercises in my book Linear Algebra Done Bight. It is distributed (without charge) only to instructors who are using Linear Algebra Done Right as a textbook. If you are such an instructor, please contact me (preferably via e-mail) to obtain the appropriate version of the solutions manual. This version of the solutions manual is intended to accompany the second edition (1997) of Linear Algebra Done Right. This solutions manual is copyrighted by me, but instructors using Linear Algebra Done Right as a textbook have permission to modify and/or abridge this solutions manual and to distribute copies only to students enrolled in their courses (if students are charged for copies, the price may not exceed the cost of materials and reproduction). If you want to modify and/or abridge this solutions manual, you can request (via e-mail) the ЖЩК file from me (or from Springer if I am no longer around). Some instructors may not want to distribute any of these solutions to students; other instructors may want to save class time by distributing to students solutions to some or even many of the exercises. Obviously students should not sec the solution to an exercise until they have attempted the exercise themselves. Please do not distribute solutions needlessly, especially if there is a danger that many solutions will get passed down to next year's class. I will not distribute this solutions manual to students, so if neither you nor colleagues in your department make copies for students, you can be reasonably confident that your students are working the exercises without benefit of these solutions. This solutions manual has not been subjected to the same amount of scrutiny as the book, so errors are more likely. I would be grateful for information about any errors that you notice. If you know nicer solutions to any of the exercises than the solutions given here, please let me know so that I can improve future versions of this solutions manual. Please check my web site for errata arid other information about Linear Algebra Done Right. I welcome comments about either the book or the solutions manual. Have fun! Sheldon Axler Mathematics Department San Francisco State University San Francisco, CA 94132, USA e-mail: axlerQmath.sfsu.edu www home page: http://math.sfsu.edu/axler
Chapter 1 Vector Spaces 1. Suppose α and b are real numbers, not both 0. Find real numbers с and d such that l/(a + Ы) = с + di. SOLUTION: Multiplying both the numerator and the denominator of the left side of the equation above by α - Ы gives a — bi ,. —z—^ = c + dt. Thus we must have α — b C = 2 ■ 13. and d = 2 ■ 1.2' a2 + br az + bz because α and b are not both 0, we are not dividing by 0. Comment: Note that these formulas for с and d are derived under the assumption that a + bi has a multiplicative inverse. However, we can forget about the derivation and verify (using the definition of complex multiplication) that {а + ы)Ьть^--Лт^ = ^ which shows that every nonzero complex number does indeed have a multiplicative inverse. 1
Chapter 1. Vector Spaces Show that -1 + y/3i 2 is a cube root of 1 (meaning that its cube equals 1). SOLUTION: Using the definition of complex multiplication, we have l + \/3i\2 -l-y/3i (™r = Thus /-1 + у/Зг\3 = /-1-у/Зг\ /-1 + у/Зг\ = 1. Prove that - (-υ) = υ for every υ € V. Solution: Let ν G V. By the definition of additive inverse, we have ν + (-v) = 0. The additive inverse of —v, which by definition is —(—v), is the unique vector that when added to -v gives 0. The equation above shows that ν has this property. Thus -(-v) = v. Comment: Using 1.6 twice leads to another proof that -(-υ) = υ. However, the proof given above uses only the additive structure of V, whereas a proof using 1.6 also uses the multiplicative structure. Prove that if α € F, υ G V, and av = 0, then α = 0 or υ = 0. Solution: Suppose that α G F, v G V, and av = 0. We want to prove that α = 0 or ν = 0. If α = 0, then we are done. So suppose that α φ 0. Multiplying both sides of the equation above by 1/a gives
3 Chapter 1. Vector Spaces The associative property shows that the left side of the equation above equals Iv, which equals v. The right side of the equation above equals 0 (by 1.5). Thus υ = 0, completing the proof. 5. For each of the following subsets of F3, determine whether it is a subspace ofF3: (a) {(xi, x2,x3) € F3 : xi + 2x2 + 3x3 = 0}; (b) {(Х1,х2,хз) € F3 : xi + 2x2 + 3x3 = 4}; (c) {(хьхг.хз) G F3 : xix2X3 = 0}; (d) {(xi,x2,X3)GF3:xi = 5x3}. Solution: (a) Let U = {(χι,x2, хз) G F3 : χι + 2x2 + 3x3 = 0}. To show that U is a subspace of F3, first note that (0,0,0) € U, so U is nonempty. Next, suppose that (x\,X2txz) G U and (уъУ2,Уз) G U. Then xi + 2x2 + Зхз = 0 yi + 2y2 + 3y3 = 0. Adding these equations, we have (*i +Vi) + 2(x2 + y2) + 3(x3 + уз) = 0, which means that (xi + yi,X2 + 2/2,^3 + Уз) G U. Thus U is closed under addition. Next, suppose that (х1,Х2,хз) G U and α € F. Then xi + 2x2 + Зхз = 0. Multiplying this equation by a, we have (αχι) + 2(ахг) + З(ахз) = 0, which means that (ах1,ахг,ахз) € U. Thus U is closed under scalar multiplication. Because U is a nonempty subset of F3 that is closed under addition and scalar multiplication, U is a subspace of F3.
4 Chapter 1. Vector Spaces (b) Let C/ = {(xi,x2,:c3)€F3:xi+2x2 + 3x3 = 4}. Then (4,0,0) € U but 0(4,0,0), which equals (0,0,0), is not in U. Thus U is not closed under scalar multiplication. Thus U is not a subspace of F3. (c) Let U = {(хьхг.хз) € F3 : X1X2X3 = 0}. Then (1,1,0) € U and (0,0,1) € U, but the sum of these two vectors, which equals (1,1,1), is not in U. Thus U is not closed under addition. Thus U is not a subspace of F3. (d) Let U = {(хьхг.хз) € F3 :χχ = 5x3}. To show that U is a subspace of F3, first note that (0,0,0) € U, so U is nonempty. Next, suppose that (хьХ2,хз) € U and (2/1,2/21 Уз) € U. Then X\ = 5Хз У1 = 52/3. Adding these equations, we have xi +yi = 5(х3+2/з), which means that (xi + 2/i,X2 + 2/2,хз + Уз) € U. Thus U is closed under addition. Next, suppose that (хьХ2)Хз) € U and α € F. Then xi = 6x3. Multiplying this equation by a, we have αχι = 5(ахз), which means that (ах1,ахг,ахз) € U. Thus U is closed under scalar multiplication. Because U is a nonempty subset of F3 that is closed under addition and scalar multiplication, U is a subspace of F3.
Chapter 1. Vector Spaces Give an example of a nonempty subset U of R2 such that U is closed under addition and under taking additive inverses (meaning —u € U whenever и € J7), but U is not a subspace of R2. SOLUTION: Let U = {(τη,ή) : m and η are integers}. Then clearly U is closed under addition and under taking additive inverses. However, (1,1) € U but j(l, 1), which equals (^, 5), is not in U, so U is not closed under scalar multiplication. Thus U is not a subspace of R2. Of course there are also many other examples. Give an example of a nonempty subset U of R2 such that U is closed under scalar multiplication, but U is not a subspace of R2. Solution: Let U be the union of the two coordinate axes in R2. More precisely, let U = {(χ, 0) : χ € R} U {(0, у) : у € R}. Then clearly U is closed under scalar multiplication. However, (1,0) and (0,1) are in U but their sum, which equals (1,1) is not in U, so U is not closed under addition. Thus U is not a subspace of R2. Of course there are also many other examples. Prove that the intersection of any collection of subspaces of V is a subspace of V. Solution: Suppose {UQ}aer is a collection of subspaces of V; here Г is an arbitrary index set. We need to prove that Паег ^<*> which equals the set of vectors that are in Ua for every a € Г, is a subspace of V. The additive identity 0 is in UQ for every а € Г (because each Ua is a subspace of V). Thus 0 € ПаеГ^а· ^n particular, По€Г^а 'IS a nonempty subset of V. Suppose u, υ € Паег Ua· Then u, v. € UQ for every a € Г. Thus u + v € UQ for every а € Г (because each UQ is a subspace of V). Thus и + υ € Паег U<*· Thus Паег ^ is closed under addition. Suppose и € Паег Ua and а € F. Then и € Ua for every а € Г. Thus au € Ua for every а € Г (because each Ua is a subspace of V). Thus au € Паег U<*· Thus Паег U<* ls closed under scalar multiplication. Because Паег UQisa. nonempty subset of V that is closed under addition and scalar multiplication, Паег U<* 1S a subspace of V. COMMENT: For many students, the hardest part of this exercise is understanding the meaning of an arbitrary intersection of sets. Instructors who
б Chapter 1, Vector Spaces do not want to deal with this issue should change the exercise to "Prove that the intersection of any finite collection of subspaces of V is a subspace of V." Many students will then prove that the intersection of two subspaces of V is a subspace of V and use induction to get the result for finite collections of subspaces. 9. Prove that the union of two subspaces of V is a subspace of V if and only if one of the subspaces is contained in the other. Solution: Suppose U and W are subspaces of V such that UuW is a subspace of V. We will use proof by contradiction to show that U dW ox W С U. Suppose that our desired result is false. Then U <£ W and W gLU. This means that there exists и € U such that и £ W and there exists w € W such that w £ U. Because и and w are both in C/U W, which is a subspace of V, we can conclude that и + w € U U W. Thus u + tu€C/oru + tu€ W. First consider the possibility that u+w € C/. In this case w, which equals (u + w) + (—it), would be in the sum of two elements of U and hence we would have w € U, contradicting our assumption that w £U. Now consider the possibility that u+w € W. In this case u, which equals (u + w) + (—го), would be in the sum of two elements of W and hence we would have и € W, contradicting our assumption that и ф W. The two paragraphs above show that u + w £ U and u + w £ W, contradicting the final sentence of the first paragraph of this solution. This contradiction completes our proof that U С W or W С U. The other direction of this exercise is trivial: if we have two subspaces of V", one of which is contained in the other, then the union of these two subspaces equals the larger of them, which is a subspace of V. 10. Suppose that U is a subspace of V. What is U + U1 Solution: By definition, U+U = {u+υ :u,veU}. Clearly U С U+U because if u € 17, then и equals и + 0, w*hich expresses и as a sum of two elements of U. Conversely, U + U CU because the sum of two elements of U is an element of U (because U is a subspace of V). Conclusion: U + U = U. 11. Is the operation of addition on the subspaces of V commutative? Associative? (In other words, if J7i, С/г, Us are subspaces of V, is J7i +U2 = U2 + U1I Is (Ux + U2) +U3 = Ui+ (U2 + J73)?) Solution: Suppose U\, С/г, С/3 are subspaces of V. A typical element of C/i +C/2 is a vector of the form щ +u2, where щ € U\ and u2 € U2. Because addition of vectors is commutative, u\ + u2 equals
Chapter 1. Vector Spaces u2 + щ, which is a typical element of U2 + U1. Thus C/i + C/2 = C/2 + C/i. In other words, the operation of addition on the subspaces of V is commutative. A typical element of (C/i + U2) + Щ is a vector of the form (щ + u2) + из, where щ G C/i, иг G C/2, and из € C/3. Because addition of vectors is associative, (ui + иг) + из equals щ + (иг + из), which is a typical element of C/i + (C/2 + C/3). Thus (C/i + C/2) + C/3 = C/i + (C/2 + C/3). In other words, the operation of addition on the subspaces of V is associative. Does the operation of addition on the subspaces of V have an additive identity? Which subspaces have additive inverses? Solution: The subspace {0} is an additive identity for the operation of addition on the subspaces of V. More precisely, if C/ is a subspace of V, then U + {0} = {0} + U = U. For a subspace U of V to have an additive inverse, there would have to be another subspace W of V such that U + W = {0}. Because both U and W are contained in C/ + W, this is possible only if U = W = {0}. Thus {0} is the only subspace of V that has an additive inverse. Prove or give a counterexample: if C/i, С/г, W are subspaces of V such that Ui + W = C/2 + W, then C/i = C/2. Solution: To construct a counterexample for the assertion above, choose V to be any nonzero vector space. Let C/i = {0}, C/2 = V, and W = V. Then C/j + W and C/2 + W are both equal to V, but C/i 7ε C/2. Of course there are also many other examples. Suppose U is the subspace of ^(F) consisting of all polynomials ρ of the form p(z) =az? +bz5, where a, b G F. Find a subspace W of V(F) such that "P(F) = U φ W. Solution: Let W be the set of all polynomials (with coefficients in F) whose z2-coefficient and ^-coefficient both equal 0. Then every polynomial in V(F) can be written uniquely in the form ρ+ 9, where ρ G U and q &W. Thus "P(F) = U φ W.
8 Chapter I. Vector Spaces Comment: There are other possible choices for W that give a correct solution to this exercise, but the choice for W made above is certainly the most natural one. 15. Prove or give a counterexample: if U\, U2, W are subspaces of V such that V = UX@W and V = U2@W, then Ui = U2. SOLUTION: To construct a counterexample for the assertion above, let V = F2, let J7i = {(χ,Ο) : χ € F}, let U2 = {(0,y) : у € F}, and let W = {(z,z):zeF}. Then F2 = Ui®W and F2 = J72©H^ as is easy to verify, but J7i φ U2. Of course there are also many other examples.
Chapter 2 Finite-Dimensional Vector Spaces 1. Prove that if (υι,..., vn) spans V, then so does the list (v\ -v2,V2 -υ3,...,υη_ι - vn,vn) obtained by subtracting from each vector (except the last one) the following vector. Solution: Suppose (v\,. ..,vn) spans V. Let υ € V. To show that υ € span(vi - щ, щ — vz, ■ ■ ■, υη_ι - vn, vn), we need to find αϊ,..., αη € F such that ν = αι(υι - v2) + а2(г>2 - v3) + ...αη_ι(υη-ι - vn) +anvn. Rearranging terms of the equation above, we see that we need to find αϊ,..., an € F such that (a) υ = aw + (a2 - ai)v2 + (a3 - a2-)vz + ■ ■ ■ + (an - α„_ι)υη. Because (v\,..., vn) spans V, there exist b\,..., bn € F such that (b) υ = b\Vi + b2V2 + 63^3 Η 1- bnvn- Comparing equations (a) and (b), we see that (a) will be satisfied if we choose αϊ to equal b\ and then choose a2 to equal 62 + o-ι and then choose аз to equal 63 + a2, and so on. 9
10 Chapter 2. Finite-Dimensional Vector Spaces 2. Prove that if (i>i,..., vn) is linearly independent in V, then so is the list (i>i -V2,v2 -ι>3,...,ι>η-ι -νη,υη) obtained by subtracting from each vector (except the last one) the following vector. Solution: Suppose (i>i,..., vn) is linearly independent in V. To prove that the list displayed above is linearly independent, suppose oi,...,a„€F are such that a\(vi -v2) + a2(v2 - i>3) + ■ ■ ·+ αη_ι(ι>η-ι - νη) + α„ι>η = 0. Rearranging terms, the equation above can be rewritten as a-ivi + (a2 -αι)υ2 + (α3 - a2)v3 + ■·· + (an- θη-{)υη = 0. Because (υι,..., vn) is linearly independent, the equation above implies that αϊ =0 a2 - αϊ = 0 α3 - α2 = 0 о-п -о,п-\ =0. The first equation above tells us that αϊ = 0. That information, combined with the second equation, tells us that a2 = 0. That information, combined with the third equation, tells us that a3 = 0. Continue in this fashion, getting αϊ = -■· = an = 0. Thus (vi - v2,V2 - v3,. ..,υη_ι - νη,νη) is linearly independent. 3. Suppose (υι,..., vn) is linearly independent in V and w € V. Prove that if (v\ +w,...,vn+w) is linearly dependent, then w G span(vi,...,υη)- Solution: Suppose {v\ + w,... ,vn + w) is linearly dependent. Then there exist scalars ab ... ,an, not all 0, such that αι(υι +w) Η Ι-αη(υη + го) = 0. Rearranging this equation, we have αιΐ>ι Η 1- OnVn = -(ai + 1- dn)w.
11 Chapter 2. Finite-Dimensional Vector Spaces If αϊ Η \-an were 0, then the equation above would contradict the linear independence of (υι,..., vn)· Thus αϊ + · · · + an φ 0. Hence we can divide both sides of the equation above by -(αϊ + ··· + αη), showing that w G span(vi,...,t;n). 4. Suppose m is a positive integer. Is the set consisting of 0 and all polynomials with coefficients in F and with degree equal to m a subspace of V(F)? Solution: The set consisting of 0 and all polynomials with coefficients in F and with degree equal to m is not a subspace of ^(F) because it is not closed under addition. Specifically, the sum of two polynomials of degree m may be a polynomial with degree less than m. For example, suppose m = 2. Then 7 + 4z + 5z2 and 1 + 2z - 5z2 are both polynomials of degree 2 but their sum, which equals 8 + 6z, is a polynomial of degree 1. 5. Prove that F°° is infinite dimensional. Solution: For each positive integer m, let е™, be the element of F°° whose mth coordinate equals 1 and whose other coordinates equal 0: em = (0I... ,0,1,0,...). ΐ mth coordinate Then (ei,..., em) is a linearly independent list of vectors in F°°, as is easy to verify. This implies, by the marginal comment attached to 2.6, that F°° is infinite dimensional. 6. Prove that the real vector space consisting of all continuous real-valued functions on the interval [0,1] is infinite dimensional. Solution: Let V denote the real vector space of all continuous real- valued functions on the interval [0,1). For each positive integer m, the list (l,x, ...,xm) is linearly independent in V (because if αο,...,α„, G R are such that an + αχχ Λ 1- Omim = 0 for every χ G [0,1], then the polynomial above has infinitely many roots and hence all its coefficients must equal 0). This implies, by the marginal comment attached to 2.6, that V is infinite dimensional.
12 Chapter 2. Finite-Dimensional Vector Spaces 7. Prove that V is infinite dimensional if and only if there is a sequence vi,V2,... of vectors in V such that (v\,...,vn) is linearly independent for every positive integer n. Solution: First suppose that V is infinite dimensional. Choose v\ to be any nonzero vector in V. Choose V2,vs,... by the following inductive process: suppose that v\,..., υη-1 have been chosen; choose any vector vn € V such that vn £ span(ui,.. .,υη_ι)—because V is not finite dimensional, span(ui,... ,υη_ι) cannot equal V so choosing vn in this fashion is possible. The linear dependence lemma (2.4) implies that (υι,..., vn) is linearly independent for every positive integer n, as desired. Conversely, suppose there is a sequence υι, иг,... of vectors in V such that (vi,..., vn) is linearly independent for every positive integer n. This implies, by the marginal comment attached to 2.6, that V is infinite dimensional. 8. Let U be the subspace of R5 defined by U = {(11,12,13,14^5) € R5 : χι = 3x2 and хз = 7x4}. Find a basis of U. Solution: Obviously U = {(3X2,X2,7X4,X4,X5) : X2,X4,X5 G R}. From this representation of U, we see easily that ((3,1,0,0,0), (0,0,7,1,0), (0,0,0,0,1)) is a basis of U. Of course there are also other possible choices of bases of U. 9. Prove or disprove: there exists abasis (ро,Р\,Р2,Рз) of ^(F) such that none of the polynomials ро,Р\,Р2,Рз has degree 2. Solution: Define Ро,РьР2,Рз е ^(F) by PoOO = 1. Pi{z) = z, P2(z) = z2 + z3, Pz{z) = zz.
13 Chapter 2. Finite-Dimensional Vector Spaces None of the polynomials ро,РьР2)Рз has degree 2, but (рсьРьРг.Рз) is a basis of ^(F), as is easy to verify. Of course there are also other possible choices of bases of ^(F) without using polynomials of degree 2. 10. Suppose that V is finite dimensional, with dim V = n. Prove that there exist one-dimensional subspaces U\,..., Un of V" such that V = Ux φ · · · Φ Un. Solution: Let (vi,...,vn) be a basis of V. For each j, let Uj equal span(vj); in other words, Uj = {avj : a € F}. Because (v\,..., vn) is a basis of V, each vector in V can be written uniquely in the form axvi + ■ · ■ + anvn, where αϊ,..., αη € F (see 2.8). By definition of direct sum, this means that V = Ux®---®Un. 11. Suppose that V is finite dimensional and U is a subspace of V such that dimJ7 = dim V. Prove that U = V. Solution: Let (ui,...,ti„) be a basis of U. Thus η = dim[7, and by hypothesis we also have η = dim V. Thus (ui,..., un) is a linearly independent (because it is a basis of U) list of vectors in V with length dimV". From 2.17, we see that (щ,...,ип) is a basis of V. In particular every vector in V is a linear combination of (u\,... ,Un)· Because each Uj € U, this implies that U = V. 12. Suppose that ρο,Ρι, · · · ,pm are polynomials in Vm(F) such that Pj(2) = 0 for each j. Prove that (ρο,Ρι, · · · ,Pm) is not linearly independent in Vm(F). Solution: Because py(2) = 0 for each j, the constant polynomial 1 is not inspan(po,... ,pm). Thus (prj,-..,Pm) is not a basis of Vm(F). Because (poi · · ■ iPm) is a list of length m + 1 and Vm(F) has dimension m + 1, this implies (by 2.17) that (po,... ,pm) is not linearly independent. 13. Suppose U and W are subspaces of R8 such that dimC/ = 3, dimW = 5, and U + W = R8. Prove that U П W = {0}. Solution: We know (from 2.18) that dim(J7 + W) = dimU + dim W - dim(J7 П W).
14 Chapter 2. Finite-Dimensional Vector Spaces Because dim(J7 + W) = 8, dim С/ = 3, and dimH^ = 5, this implies that dim(C7 nW^) = 0. Thus U П W = {0}. 14. Suppose that U and W are both five-dimensional subspaces of R9. Prove that UnW φ {Ъ}. Solution: Using 2.18 we have 9 > dim(C7 + W) = dim U + dim W - dim{U П W) = lO-dim(tfnW). Thus dim(C/ П W) > 1. In particular, U П W φ {0}. 15. You might guess, by analogy with the formula for the number of elements in the union of three subsets of a finite set, that if Όγ,ϋι,£/3 are subspaces of a finite-dimensional vector space, then dimit/i + U2 + Uz) = dim U\ + dim U2 + dim £/3 - dim(Ui П U2) - dim(C/i П Uz) - dim(C72 П Uz) + dim(C7i П U2 П U3). Prove this or give a counterexample. Solution: To give a counterexample, let V = R2, and let U\ ={(x,0) :x€R}, ^2 = {(0,y):y€R}, UZ = {(x,x) :x€ R}. Then Ui + U2 + Uz = R2, so dim(C7i + U2' + Uz) = 2. However, dim Ui = dim U2 = dim C/3 = 1 and dim(C/i П U2) = dim(C7i П Щ) = dim(C72 П Uz) = dim(C/i П U2 П Uz) = 0. Thus in this case our guess would reduce to the formula 2 = 3, which obviously is false. Of course there are also many other examples.
15 Chapter 2. Finite-Dimensional Vector Spaces 16. Prove that if V is finite dimensional and U\,...,Um are subspaces of V, then dim(J7i + · · · + Um) < dim U\ 4 + dim Um. Solution: For each j = l,...m, choose a basis for Uj. Put these bases together to form a single list of vectors in V. Clearly this list spans U\ -\ \-Um. Hence the dimension of U\ Η fJ7m is less than or equal to the number of vectors in this list (by 2.10), which equals dim J7i -\ l· dim Um. In other words, dim(J7i Η 1- Um) < dim U\ Λ 1- dim Um. 17. Suppose V is finite dimensional. Prove that if U\,..., Um are subspaces of V such that V = U\ φ · · · φ Um, then dim V = dim J7i Η \- dim Um. Comment: This exercise deepens the analogy between direct sums of subspaces and disjoint unions of subsets. Specifically, compare this exercise to the following obvious statement: if a finite set is written as a disjoint union of subsets, then the number of elements in the set equals the sum of the number of elements in the disjoint subsets. Solution: Suppose that J7i,..., Um are subspaces of V such that V = U\ θ · · · θ Um. For each j = 1,... m, choose a basis for Uj. Put these bases together to form asingle list В of vectors in V. Clearly В spans J7i Η \-Um, which equals V. If we show that В is also linearly independent, then it will be a basis of V. Thus the dimension of V will equal the number of vectors B. In other words, we will have dim V = dim U\ Η + dim J7m, as desired. We still need to show that В is linearly independent. To do this, suppose that some linear combination of В equals 0. Write this linear combination as щ -\ 1- ιι,η, where we have grouped together the terms that come from the basis vectors of J7i and called their sum «i, and similarly up to 1%. Thus we have Щ + ■ · · + Щп = 0, where each Uj € Uj. Because V — U\ φ · - · φ J7m, this implies that each Uj equals 0. Because each Uj is a linear combination of our basis of Uj, all the
16 Chapter 2. Finite-Dimensional Vector Spaces coefficients in the linear combination defining щ must equal 0. Thus all the coefficients in our original linear combination of В must equal 0. In other words, В is linearly independent, completing our proof.
Chapter 3 Linear Maps 1. Show that every linear map from a one-dimensional vector space to itself is multiplication by some scalar. More precisely, prove that if dim V = 1 and Τ G £(V, V), then there exists α G F such that Tv = av for all υ G V. Solution: Suppose dim V = 1 and Τ € C(V, V). Let и be any nonzero vector in V. Then every vector in V is a scalar multiple of u. In particular, Tu = au for some α G F. Now consider a typical vector υ G V. There exists b G F such that ν = bu. Thus Tv = T(bu) = bT(u) = b(au) = a(bu) = av. 2. Give an example of a function /: R2 —> R such that f(av) = af{y) for all α G R and all ν G R2 but / is not linear. Solution: Define /: R2 -> R by /(s,y) = (s3 + y3)1/3. Then f(av) — af(v) for all α G R and all ν G R2. However, / is not linear because /(1,0) = 1 and /(0,1) = 1 but 17
18 Chapter 3. Linear Maps /((1,0)+ (0,1)) =/(1,1) 7^/(1,0) + /(0,1). Of course there are also many other examples. Comment: This exercise shows that homogeneity alone is not enough to imply that a function is a linear map. Additivity alone is also not enough to imply that a function is a linear map, although the proof of this involves advanced tools that are beyond the scope of this book. 3. Suppose that V is finite dimensional. Prove that any linear map on a sub- space of V can be extended to a linear map on V. In other words, show that if U is a subspace of V and S € C(U, W), then there exists Τ € C(V, W) such that Tu = Su for all и €U. Solution: Suppose U is a subspace of V and S € C(U, W). Let (щ,..., ιι,η) be a basis of U. Then (ui,..., ttm) is a linearly independent list of vectors in V, and so can be extended to a basis (щ,..., Um, v\,..., vn) of V (by 2.12). Define Τ € C(V, W) by Т(<цщ +...0^11^. + biVi + ...bnvn) = aiSui Η {-a^Sum. Then Tu = Su for all и € U. COMMENT: Defining T: V -> W by Tv={Sv if* €17; \0 \[у$и. does not work because this map is not linear. 4. Suppose that Γ is a linear map from V to F. Prove that if и € V is not in nullT, then V = nullTe {au: a € F}. Solution: Suppose и € V is not in nullT. If a € F and au € nullT, then 0 = T(au) = aTu, which implies that α = 0 (because Tu φ 0). Thus nullTn{au:a€F} = {0}. If ν € V, then
19 Chapter 3. Linear Maps V={V-¥aU) Tv Tu Note that Τ (υ — |^ιι) = Tv-lj^Tu = 0. Thus the equation above expresses an arbitrary vector υ € V as the sum of a vector in null Γ and a scalar multiple of u. Hence V = null Τ + {au : a € F}. Using 1.9, we conclude that V = null Τ φ {au : a € F}. Suppose that Τ € C(V, W) is injective and (v\,..., υη) is linearly independent in V. Prove that {Tv\,...,Tvn) is linearly independent in W. Solution: To show that {Tv\,..., Tvn) is linearly independent, suppose αϊ,..., On € F are such that αιΓυι +··· + αηΤνη = 0. Because Γ is a linear map, this equation can be rewritten as Γ(αιυι +···+αηυη) = 0. Because Τ is injective, this implies that a\V\ -\ Ι-αηυη = 0. Because {v\,..., vn) is linearly independent, the equation above implies that a\ = '" = α„ = 0. Thus {Tv\,...,Tvn) is linearly independent. Prove that if S\,...,Sn are injective linear maps such that S\...Sn makes sense, then S\ ...Sn is injective. Solution: Suppose that Si,...,Sn are injective linear maps such that Si...Sn makes sense (which means that the domains of S\,..., Sn are such that Si... Sn is well defined). Suppose υ is.a vector in the domain of S\... Sn (which equals the domain of Sn) such that {Sl...Sn)v = 0. To show that S\... Sn is injective, we need to show that ν = 0 (see 3.2). To do this, rewrite the equation above as Si((S2...Sn)v) = 0. Because Si is injective, this implies that
20 Chapter 3. Linear Maps (S2...Sn)v = 0. The same argument, now applied to the equation above, shows that {S3...Sn)v = 0. Repeat this process until reaching the equation Snv = 0, which implies (because Sn is injective) that υ = 0, as desired. 7. Prove that if (vi,...,vn) spans V and Τ € C(V,W) is surjective, then (Tvu...,Tvn) spans W. Solution: Suppose that {v\,..., vn) spans V and Τ € £(V, W) is surjective. Let w € W. Because Τ is surjective, there exists ν ζ. V such that ϋΓυ = tu. Because (υι,..., vn) spans V, there exist αϊ,..., αη € F such that ν = aivi Η Ι-αηυη. Applying Τ to both sides of this equation, we get Tv = axTvi + · · · + OnTvn. Because Tv = w, the equation above implies that w € span(!Tt;i,..., Tvn). Because w was an arbitrary vector in W, this implies that {Tv\,..., Tvn) spans W. 8. Suppose that V is finite dimensional and that Τ € C(V, W). Prove that there exists a subspace U of V such that U П null Γ = {0} and range Τ = {TuiueU}. Solution: There exists a subspace U of V such that V = mniT®U; this follows from 2.13 (with null Γ playing the role of U and U playing the role of W). From the definition of direct sum, we have U Π null Γ = {0}. Obviously range Γ D {Tu : и € U}. To prove the inclusion in the other direction, suppose ν € V. Then there exist w € null Γ and u' €U such that υ = w + u'. Applying Τ to both sides of this equation, we have Tv = Tw + Tu' = Tu'. Thus Tv € {Tu : и € 17}. Because υ was an arbitrary vector in V (and thus Tv is an arbitrary vector in range T), this implies that
21 Chapter 3. Linear Maps ranger С {74i :u€C/}. Thus rangeX1 = {Tu : и € 17}, as desired. 9. Prove that if Τ is a linear map from F4 to F2 such that null Τ = {(χι, X2, яз, £<t) € F4 : X! = 5x2 and X3 = 7x4}, then Τ is surjective. Solution: Suppose Τ € £( FA, F2) is such that null Γ is as above. Then ((5,1,0,0), (0,0,7,1)) is a basis of nullT, and hence dim null Γ = 2. From 3.4 we have dim range Τ = dim F4 - dim null Τ = 4-2 = 2. Because range Γ is a two-dimensional subspace of R2, we have range Τ = R2. In other words, Τ is surjective. 10. Prove that there does not exist a linear map from F5 to F2 whose null space equals {(χι,Χ2)^3)^4)^5) e F5 : χι = 3x2 and X3 = x4 = x5}. Solution: Suppose U is the subspace of F5 displayed above. Then ((3,1,0,0,0), (0,0,1,1,1)) is a basis of U, and hence dim U = 2. If Γ € £(F5, F2) then from 3.4 we have dim null Τ = dim F5 — dim range Τ = 5 - dim range Τ >3 >dimJ7, where the first inequality holds because range Г С F2. The inequality above shows that if Г € £(F5, F2), then null Γ φ U, as desired. 11. Prove that if there exists a linear map on V whose null space and range are both finite dimensional, then V is finite dimensional. Solution: Suppose there exists a linear map Τ from V into some vector space such that null Τ and range Τ are both finite dimensional. Thus
22 Chapter 3. Linear Maps there exist vectors щ,...,!!^ € V and w\,.. .,wn € range Τ such that {u\,...,Uj„) spans null Γ and (w\,... ,wn) spans range T. Because each Wj € range T, there exists Vj € V such that tuj = Tvj. Suppose υ € V". Then Γυ € range Γ, so there exist b\,...,bn € F such that Tv = b\Wi Η \r bnwn = biTvi + ■ ■ ■ + bnTvn = T(blVl + ...bnvn). The equation above implies that T(v — b\Vi — ... — bnvn) = 0. In other words, υ -b\V\ - ··· - bnvn € nullT. Thus there exist αϊ,...,am € F such that υ - bivi bnvn = сцщ Η 1- amum. The equation above can be rewritten as ν = aim -{ 1- amum + Mi -\ 1- bnvn. The equation above shows that an arbitrary vector ν € V is a linear combination of («i,..., um, v\,..., vn). In other words, (u\,..., Um, V\,..., vn) spans V. Thus V is finite dimensional. Comment: The hypothesis of 3.4 is that V is finite dimensional (which is what we are trying to prove in this exercise), so 3.4 cannot be used in this exercise. 12. Suppose that V and W are both finite dimensional. Prove that there exists a surjective linear map from V onto W if and only if dim W < dim V. Solution: First suppose that there exists a surjective linear map Τ from V onto W. Then dim W = dim range Τ = dim V - dim null Τ < dim V, where the second equality comes from 3.4. To prove the other direction, now suppose that dim W < dim V. Let (tui,..., wm) be a basis of W and let (v\,. ..,vn) be a basis of V. For oi,..., α„ € F define Γ(αιυι Η 1- anvn) by
23 Chapter 3. Linear Maps T{axvx + · · · + anvn) = a^wi +■·■ + OmWm. Because dim W < dim V, we have m < η and so a„, on the right side of the equation above makes sense. Clearly Γ is a surjective linear map from V onto W. 13. Suppose that V and W are finite dimensional and that U is a subspace of V. Prove that there exists Τ € C(V, W) such that null Τ - U if and only ifdimt/>dimVr-dimH'. Solution: First suppose that there exists Τ € C(V, W) such that nuUT = J7. Then dimJ7 = dimuullT = dim V — dim range Τ > dim V - dim W, where the second equality comes from 3.4. To prove the other direction, now suppose that dim U > dim V — dim W. Let (iti,... ,Um) be a basis of U. Extend to a basis (ui,... ,umii;i,... ,vn) of V. Let (w\,..., wp) be a basis of W. For a\,...,a„,,b\,...,bn € F define T(aiui + · · · + anUm + Mi + · · · + bnvn) by T{aiu\ Η 1- CLmUm + biv\ Η 1- bnvn) = biwi Η \- bnwn. Because dim W > dim V — dim U, we have p>n and so wn on the right side of the equation above makes sense. Clearly Τ € C(V, W) and null Γ = U. 14. Suppose that W is finite dimensional and Τ € C(V, W). Prove that Τ is injective if and only if there exists S € C(W, V) such that ST is the identity map on V. Solution: First suppose that Τ is injective. Define S': range Γ —> V by S'(Tv) = v; because Τ is injective, each element of range Γ can be represented in the form Tv in only one way, so Γ is well defined. As can be easily checked, S' is a linear map on range T. By Exercise 3 of this chapter, S' can be extended to a linear map S € C(W, V). If ν € V, then (ST)v = S{Tv) = S'{Tv) = v. Thus ST is the identity map on V", as desired.
24 Chapter 3. Linear Maps To prove the implication in the other direction, now suppose that there exists S € C(W, V) such that ST is the identity map on V. If u, ν e V are such that Tu = Tv, then и = {ST){u) = S{Tu) = S(Tv) = (ST)v = ν and hence и = v. Thus Г is injective, as desired. 15. Suppose that V is finite dimensional and Τ € C(V, W). Prove that Τ is surjective if and only if there exists S € C(W, V) such that TS is the identity map on W. Solution: First suppose that Τ is surjective. Thus W, which equals rangeT, is finite dimensional (by 3.4). Let (tui,... ,wm) be a basis of W. Because Τ is surjective, for each j there exists Vj Ε V such that Wj = Tvj. Define S € C(W, V) by S(aiioi Η \- OmWm) = a\V\ -\ (- amvm. Then {TS){a\Wi -\ \-amWm) = T(cliVi Η l-amt;m) = αιΓυι +-··+αηχΤυτη = aitoi Η |-amtum. Thus TS is the identity map on W. To prove the implication in the other direction, now suppose that there exists S € C(W, V) such that TS is the identity map on W. If w e W, then ш = T(Sw), and hence tu € range T. Thus range Γ = W. In other words, Τ is surjective, as desired. 16. Suppose that U and V are finite-dimensional vector spaces and that S € £(V, W), Τ Ε C(U, V). Prove that dim null ST < dim null S + dim null T. Solution: Define a linear map T": null ST -> V by T'u = Tu. If u € null ST, then 5(Ги) = 0, which means that Tu € null S. In other words, range Τ С null 5. Now dim null ST = dim null Τ + dim range Γ7 < dim null T' + dim null S < dim null Τ + dim null S,
25 Chapter 3. Linear Maps where the first line follows from 3.4 (applied to T1), the second line holds because range T1 с null S, and the third line holds because of the obvious inclusion null T1 с null T. 17. Prove that the distributive property holds for matrix addition and matrix multiplication. In other words, suppose A, B, and С are matrices whose sizes are such that A(B+C) makes sense. Prove that AB + AC makes sense and that A(B +C) = AB + AC. Solution: Because A(B+C) makes sense, В and С must have the same size. Furthermore, the number of columns of A (let's call this number n) must equal the number of rows of В and C. All this means that AB + AC makes sense. To prove that A(B + C) = AB + AC, just use the definition of matrix addition, the definition of matrix multiplication, and the usual distributive property for elements of F. Specifically, let a.jtk, bj,fc, and Cj·^ denote the entries in row j, column к of A, B, and C, respectively. The entry in row j, column к of В + С is bj^ + Cjfc. Thus the entry in row j, column к of A(B + C) is η ^2ajAbr,k +Cr,fc). r=l which equals η π У] 0-j,rbr,k + Σ аЗ,г^г,ку г=1 г=1 which equals the entry in row j, column к of AB + AC, as desired. 18. Prove that matrix multiplication is associative. In other words, suppose A, B, and С are matrices whose sizes are such that (AB)C makes sense. Prove that A(BC) makes sense and that (AB)C = A(BC). Solution: This exercise can be done by a brute force calculation, in the style of the solution to the previous exercise. Here is a solution that uses only the associativity of the product of linear maps (which is easy to verify because composition of functions is clearly associative) and the nice property that the matrix of the product of two linear maps equals the product of the matrices of the two linear maps (see 3.11). Suppose A is an m-by-n matrix, β is an n-by-p matrix, and С is a p-by-g matrix; the sizes much match up like this in order for (AB)C to make sense.
Let R e £(Fn, Fm), S e C(F", Fn),Te £(F«, F") be such that, with respect to the standard bases, M(R) = A,M(S) = B,M(T) = C; 3.19 insures that such linear maps exist. Now (AB)C = (M(R)M(S))M(T) = M(RS)M(T) = M((RS)T) = M(R(ST)) = M(R)M(ST) = M(R)(M(S)M(T)) = A(BC). Suppose Τ Ε £(Fn, Fm) and that M(T) = 0.1,1 αι,η im,n O-rn.,1 · ■■ O-r, where we are using the standard bases. Prove that T{x\, . . . , Xn) = (αι,ιΧι Η +a\,nXn,---,<hn,lXl Η Ютп.пЯп) for every (χι,..., xn) e Fn. Comment: This exercise shows Τ has the form promised on page 39. Solution: Let χ = (xi,...,xn) e Fn. Using the standard bases, we then have M{Tx) = M{T)M{x) 0.1,1 αι,η x\ am,\ ··· Οτη,η Xr ο-ι,ιΧι Η Ι-αι,ηΧη Οτη,Ι^Ι Η \-<hn,nXn where the first equality comes from 3.14. The last equation implies that
27 Chapter 3. Linear Maps Tx = (αι,ιΧι Η 1- αιιηχη,..., amiiXi Η 1- α„,ιηχη), as desired. 20. Suppose (i>i,...,vn) is a basis of V. Prove that the function T: V —> Mat(n, 1,F) defined by Tv = M(v) is an invertible linear map of V onto Mat(n, 1, F); here M{v) is the matrix of υ € V with respect to the basis (vi,..., vn)· Solution: Suppose u, w e V. We can write и = a\V\ + ■ · · + OnVn and w = b\V\ + · - · + bnvn for some αι,...,αη,ί>ι,... ,bn EF. Thus и + w = (αϊ + b\)v\ Η 1- (αη + bn)vn. Hence T(u + w) = M{u + w) = αϊ +bi _an+bn a\ ; . °" . + ' bi ' ; - b" . = M(u)+M(w) = 5 ^u + 2 X which shows that Τ satisfies that additivity property required for linearity. If с € F, then cu = cai^i + - ■ - + canvn. Hence
28 Chapter 3. Linear Maps T{cu) = M{cu) ca.\ can αϊ = с On = cM{u) = cTu, which shows that Τ satisfies the homogeneity property required for linearity. Thus Τ is linear. If Tu = 0, then αϊ = · · · = α„ = 0, which implies that и = 0. Thus Τ is iujective. If Ci,...,c € F, then T(civi + --- + cnvn) = Cl Cn which implies that Τ is surjective. Because the linear map Τ is iujective and surjective, it is invertible (see 3.17). 21. Prove that every linear map from Mat(n, 1,F) to Mat(m, 1,F) is given by a matrix multiplication. In other words, prove that if Τ € £(Mat(n, 1, F), Mat(m, 1, F)), then there exists an m-by-n matrix A such that ТВ = AB for every В € Mat(n,l,F). Solution: The vector spaces Mat(n, 1,F) and Mat(m, 1,F) have obvious bases (consisting of matrices that have 0 in all entries except for a 1 in one entry). Let A be the matrix of Τ with respect to these bases. Note that if В € Mat(n, 1, F), then M(B) = В and M(TB) = ТВ. Thus
29 Chapter 3. Linear Maps ТВ = M{TB) = M(T)M(B) = AB, where the second equality comes from 3.14. 22. Suppose that V is finite dimensional and 5,Te £{V)· Prove that ST is invertible if and only if both S and Τ are invertible. Solution: First suppose ST is invertible. Thus there exists R e C(V) such that R(ST) = (ST)R = I. If ν € V is such that Tv = 0, then υ — Iv = R(ST)v = 0. Because υ was an arbitrary vector in nullT1, this shows that null Γ = {0}. Thus Τ is injective (by 3.2), and hence Τ is invertible (by 3.21), as desired. If и € V, then u = Iu = (ST)Ru = S(TRu), which shows that и € range S. Because и was an arbitrary vector in V, this implies that range S = V. Thus V is surjective, and hence V is invertible (by 3.21), as desired. To prove the implication in the other direction, now suppose that both S and Τ are invertible. Then (st)(t-1 s-1) = sctt-^s-1 = SS~l = 1 and (T^S'^iST) = T'^S^SW = T-iT = 1.
30 Chapter 3. Linear Maps Thus Τ 1S 1 satisfies the properties required for an inverse of ST. Thus ST is invertible and (ST)~l = Γ_Ι5_Ι. 23. Suppose that V is finite dimensional and S, Τ € C(V). Prove that ST = I if and only if TS = I. Solution: First suppose that ST = I. Because / is invertible, the previous exercise implies that S and Τ are both invertible. Multiply both sides of the equation above by T~l on the right, getting Now multiply both sides of the equation above by Τ on the left, getting TS=I, as desired. To prove the implication in the other direction, simply reverse the roles of S and Τ in the direction we have already proved, showing that if TS = I, then ST = I. 24. Suppose that V is finite dimensional and Τ € C(V). Prove that Γ is a scalar multiple of the identity if and only if ST = TS for every S e C{V). Solution: First suppose that Τ = o.I for some a £ F. Let S € £(V). Then ST = S(al) = aS m = (al)S = TS. To prove the implication in the other direction, suppose now that ST = TS for all S € £(V). We begin by proving that (υ, Τν) is linearly dependent for every υ € V. To do this, fix υ € V, and suppose that (υ, Τν) is linearly independent. Then {ν,Τν) can be extended to a basis (v,Tv,Ui,...,Un) of V. Define S € C(V) by S(av + bTv + cibi Η 1- CnU„) = bv.
31 Chapter 3. Linear Maps Thus S{Tv) = ν and Sv = 0. Thus the equation S(Tv) = T{Sv) becomes the equation ν = 0, a contradiction because (ν,Τυ) was assumed to be linearly independent. This contradiction shows that (v, Tv) is linearly dependent for every υ € V. This implies that for each υ € V \ {0}, there exists av € F such that Tv = avv. To show that Γ is a scalar multiple of the identity, we must show that av is independent of v. To do this, suppose v, w € V \ {0}. We want to show that ay = aw. First consider the case where (υ, го) is linearly dependent. Then there exists 6 € F such that w = bv. We have aww = Tw = T(bv) = bTv = b(avv) = avw, which shows that a„ = aw, as desired. Finally, consider the case where (v, w) is linearly independent. We have а„+ш(1> + w) = T{v + w) = Tv + Tw = avv +aww, which implies that (av+w — av)v + {av+w — aw)w = 0. Because (υ, го) is linearly independent, this implies that av+w = av and o-v+w = ciyj, so again we have Oy = dy,, as desired. 25. Prove that if V is finite dimensional with dim V > 1, then the set of non- invertible operators on V is not a subspace of C(V). Solution: Suppose that V is finite dimensional with dim V > 1. Let η = dim V and let (ιί ,..., i>n) be a basis of V. Define S,T Ε C(V) by S(diVi Η \- OnVn) = CLiVi and
32 Chapter 3. Linear Maps T{axv\ Η 1- OnVn) = a2V2 Η 1- α„υη. Then 5 is not injective because Sv2 = 0 (this is where we use the hypothesis that dim V > 1), and Τ is not injective because Tv\ = 0. Thus both S and Τ are not invertible. However, S + Τ equals /, which is invertible. Thus the set of noninvertible operators on V is not closed under addition, and hence it is not a subspace of C(V). Comment: If dim V = 1, then the set of noninvertible operators on V equals {0}, which is a subspace of C(V). 26. Suppose η is a positive integer and aij € F for i, j = 1,..., n. Prove that the following are equivalent: (a) The trivial solution x\ = ■ ■ ■ = xn = 0 is the only solution to the homogeneous system of equations η ^2ahkxk = 0 Y^anikxk = 0. k=i (b) For every ci,...,c„ € F, there exists a solution to the system of equations η 22а1<кХк = Cl 22an>kX.k = c"· k=\ Note that here we have the same number of equations as variables. Solution: Define Τ € £(Fn) by η η Γ(χι,...,χ„) = (У^а^кХк, ■ · ·, У]ап,кХк)- k=i k=i Then (a) above is the assertion that Τ is injective, and (b) above is the assertion that Τ is surjective. By 3.21, these two assertions are equivalent.
Chapter 4 Polynomials 1. Suppose m and η are positive integers with m < n. Prove that there exists a polynomial ρ G Pn(F) with exactly m distinct roots. Solution: Define peVn(1?) by p{z) = {z- i)n~m+l(z-2)(z-Z)...(z-m). Then ρ is a polynomial of degree η with exactly m distinct roots (which are l,...,m). 2. Suppose that z\,..., z,„+i are distinct elements of F and w\,..., wm+\ € F. Prove that there exists a unique polynomial ρ € Vm(F) such that p(Zj) = Wj for j = l,...,m + 1. Solution: Define Γ: "Pm(F) -* Fm+1 by Tp= (ρ{ζι),...,ρ(ζτη+ι)). We need to prove that Τ is injective (which implies that at most one polynomial ρ satisfies the condition required by the exercise) and surjective (which implies that at least one polynomial ρ satisfies the condition required by the exercise). Clearly Γ is a linear map. If ρ e null T, then p(Zl) = ■■■=p(zm+i) = 0, which means that ρ is a polynomial of degree m with at least m + l distinct roots, which means that ρ = 0 (by 4.3). Thus ρ is injective, as desired. 33
34 Chapter 4. Polynomials Now dim range Τ = dim "Pm(F) - dim null Τ = (m + l)-0 = dimFm+1, where the first equality comes from 3.4 and the second equality holds because null Γ = {0}. The last equality above implies that range Τ = Fm+1. Thus Τ is surjective, as desired. Comment: Surjectivity of Τ can also be proved by using an explicit construction. But linear algebra, specifically 3.4, gives us surjectivity easily once we get injectivity. 3. Prove that if p, q € ^(F), with ρ φ 0, then there exist unique polynomials s,reV{F) such that q = sp + r and degr < degp. In other words, add a uniqueness statement to the division algorithm (4.5). Solution: Suppose p, q € V(F), with ρ φ 0. We know from the division algorithm (4.5) that there exist s,r € V(F), with degr < degp, such that q = sp + r. To prove that s and г are unique, suppose that s,f are in ^(F), with degr < degp and q = sp + f. Subtracting the last two equations are rearranging, we have (s — s)p = r — f. The right side of the equation above is a polynomial whose degree is less than degp. If s were not equal to s, then the left side of the equation above would be a polynomial whose degree is at least degp. Thus we must have s = s, which, from the equation above, implies that f = r. Thus the choices of s and r were indeed unique.
35 Chapter 4. Polynomials 4. Suppose ρ € V{C) has degree m. Prove that ρ has m distinct roots if and only if ρ and its derivative j/ have no roots in common. Solution: First suppose that ρ has m distinct roots. Because ρ has degree m, this implies that ρ can be written in the form p(z) = c(z -\i)...(z- \m), where X\,...,km are distinct. To prove that ρ and j/ have no roots in common, we must show that p'(Xj) φ 0 for each j. To do this, fix j. The expression above for ρ shows that we can write ρ in the form p(z) = (z-kj)q(z), where q is a polynomial such that q(Xj) φ 0. Differentiating both sides of this equation, we have p'{z) = {z-\j)q'{z)+q{z). Thus p'(Ai) = ϊ(λ,) as desired. To prove the other direction, we will proved the contrapositive, meaning that we will prove that if ρ has less than m distinct roots, then ρ and p1 have at least one root in common. To do this, suppose that ρ has less than m distinct roots. Then for some root λ of p, we can write ρ in the form p(z) = (z-\)nq(z), where η > 2 and q is a polynomial. Differentiating both sides of this equation, we have p'(z) = (z- X)nq'(z)+n(z - X)n-lq(z). Thus ρ'(λ) = 0, and so λ is a common root of ρ and p7, as desired. 5. Prove that every polynomial with odd degree and real coefficients has a real root. Solution: Suppose that ρ is a polynomial with odd degree and real coefficients. By 4.14, ρ is a constant times the product of factors of the form
36 Chapter 4. Polynomials χ — λ and/or χ2 + ax + β, where λ, α, β € R. Not all the factors can be of the form x2 + ax + β, because otherwise ρ would have even degree. Thus at least one factor must be of the form χ — λ. Any such λ is a real root of p. Comment: Here is another proof, using calculus but not using 4.14. Suppose ρ is a polynomial with odd degree m. We can write ρ in the form p(x) = ao+ a{x Η h amxm, where Go, ·.., Ощ G R and am φ 0. Replacing ρ with —ρ if necessary, we can assume that dm > 0. Now / \ m/ a0 , al , am-l , ч Ρ(*) = * (^Γ + ^ΓΤ+··· + —+ a-)· This implies that lim p(x) = -co and lim p(x) = oo. X—t—OO X-»00 The intermediate value theorem now implies that there is a real number λ such that ρ(λ) = 0. In other words, ρ has a real root.
Chapter 5 Eigenvalues and Eigenvectors 1. Suppose Τ G C(V). Prove that if U\,..., Um axe subspaces of V invariant under T, then U\-\ + Um is invariant under T. Solution: Suppose U\,..., Um are subspaces of V invariant under T. Consider a vector и Ε U\ + ■■■ + Um. There exist щ G U\,...,Um G Um such that и = щ Η 1- um. Applying Τ to both sides of this equation, we get Tu = Тщ + ■ ■ ■ + Trim. Because each Uj is invariant under T, we have Тщ G U\,..., Tum G Um. Thus the equation above shows that Tu G U\ -\ 1- Um, which implies that U\ -i + Um is invariant under T. 2. Suppose Τ € C( V). Prove that the intersection of any collection of subspaces of V invariant under Τ is invariant under Τ. Solution: Suppose {UQ}Q^r is a collection of subspaces of V invariant under T\ here Γ is an arbitrary index set. We need to prove that П<*ег UQ> which equals the set of vectors that are in UQ for every α G Г, is invariant under T. To do this, suppose и G Паег ^*· Then и G Ua for every α G Г. Thus Tu G Ua for every α G Γ (because every Ua is invariant under T). Thus Tu G Паег ^ai which implies that f\aer UQ is invariant under T. 3. Prove or give a counterexample: if U is a subspace of V that is invariant under every operator on V, then U = {0} or U = V. 37
38 Chapter 5. Eigenvalues and Eigenvectors Solution: We will prove that if U is a subspace of V that is invariant under every operator on V, then U = {0} or U = V. Actually we will prove the (logically equivalent) contrapositive, meaning that we will prove that if U is a subspace of V such that U Φ {0} and Ό φ V, then there exists Τ € C(V) such that U is not invariant under T. To do this, suppose U is a subspace of V such that U φ {0} and U φ V. Choose и € U \ {0} (this is possible because U φ {0}) and w € V \ U (this is possible because U φ V). Extend the list (u), which is linearly independent because и Ф 0, to a basis (u, «i,..., vn) of V. Define Τ € C(V) by T(au + b\ v\ + ... bnvn) = aw. Thus Tu = w. Because u€ U but w £ U, this shows that U is not invariant under T, as desired. 4. Suppose that S, Τ € C(V) are such that ST = TS. Prove that null(T - XI) is invariant under S for every λ € F. Solution: Fix λ € F. Suppose υ G пи11(Г - λ/). Then {Τ - XI){Sv) = TSv -XSv = STv — XSv = S(Tv - Xv) = 0. Thus Sv € nul\(T - XI). Hence null(r - XI) is invariant under S. 5. Define Τ € £(F2) by T(w,z) = (z,w). Find all eigenvalues and eigenvectors of T. Solution: Suppose λ is an eigenvalue of T. For this particular operator, the eigenvalue-eigenvector equation T(w,z) = X(w,z) becomes the system of equations ζ = Xw w = Xz. Substituting the value for ζ from the first equation into the second equation gives w = X2w. Thus 1 = λ2 (we can ignore the possibility that w = 0
39 Chapter 5. Eigenvalues and Eigenvectors because if w = 0, then the first equation above implies that ζ = 0). Thus A = lorA = -l. The set of eigenvectors corresponding to the eigenvalue 1 is {(го, го) : го € F}; The set of eigenvectors corresponding to the eigenvalue -1 is {(го, -го) : го € F}. 6. Define Τ € £(F3) by T{z\,zi,zz) = (2*2,0,5*3 )· Find all eigenvalues and eigenvectors of T. Solution: Suppose λ is an eigenvalue of T. For this particular operator, the eigenvalue-eigenvector equation T{z\,zi,zz) = λ{ζι,ζι,ζζ) becomes the system of equations 2*2 = Xz\ 5*3 = λ*3· If λ φ 0, then the second equation implies that zi = 0, and the first equation then implies that z\ = 0. Because an eigenvalue must have a nonzero eigenvector, there must be a solution to the system above with *з φ 0. The third equation then shows that λ = 5. In other words, 5 is the only nonzero eigenvalue of T. The set of eigenvectors corresponding to the eigenvalue 5 is {(0,0,*3):^eF}. If λ = 0, the first and third equations above show that zi = 0 and *з = 0. With these values for Z2,z$, the equations above are satisfied for all values of z\. Thus 0 is an eigenvalue of T. The set of eigenvectors corresponding to the eigenvalue 0 is {(*i,0,0):*i €F}. 7. Suppose η is a positive integer and Τ € £(Fn) is defined by T(xi,..., xn) = (χι Η \- xn,..., χι Η 1-1„); in other words, Τ is the operator whose matrix (with respect to the standard basis) consists of all 1 's. Find all eigenvalues and eigenvectors of T.
40 Chapter 5. Eigenvalues and Eigenvectors Solution: Suppose λ is an eigenvalue of T. For this particular operator, the eigenvalue-eigenvector equation Tx = Xx becomes the system of equations x\ + hin = λχι χι + · · · + xn = Xxn- Thus λχι = ■■ · = λχη. Hence either λ = 0 or x\ = · · · = xn. Consider first the possibility that λ = 0. In this case all the equations in the eigenvector-eigenvalue system of equations above become the equation x\ + · · · + xn = 0. Thus we see that 0 is an eigenvalue of Τ and that the corresponding set of eigenvectors equals {(xb...,xn)€Fn:xi +···+χη = 0}. Now consider the possibility that x\ = ■ · · = xn; let t denote the common value of xi,... ,xn. In this case all the equations in the eigenvector- eigenvalue system of equations above become the equation nt = Xt. Hence λ must equal η (an eigenvalue must have a nonzero eigenvector, so we can take t φ 0). Thus we see that η is an eigenvalue of Τ and that the corresponding set of eigenvectors equals {(xb...,Xn) € Fn :xi =··· =xn}· Because the eigenvector-eigenvalue system of equations above implies that λ = 0 or xi = ·■■ = xn, we see that Τ has no eigenvalues other than 0 and n. 8. Find all eigenvalues and eigenvectors of the backward shift operator Τ € £(F°°) defined by T{Z\ ,Z2,Z3,...) = (Z2, Zz,... ). Solution: Suppose λ is an eigenvalue of T. For this particular operator, the eigenvalue-eigenvector equation Tz = Xz becomes the system of equations
41 Chapter 5. Eigenvalues and Eigenvectors zi = \z\ zz = λζ-ι Zi = \zz From this we see that we can choose z\ arbitrarily and then solve for the other coordinates: Zl = \Z\ Zz = λΖ2 = λ2 2ι Z\ = kzz = A Zy Thus each λ € F is an eigenvalue of Τ and the set of corresponding eigenvectors is {(го, Аго, λ2 го, λ3 го...) : го G F}. 9. Suppose Τ € C(V) and dim range Τ = к. Prove that Τ has at most к + 1 distinct eigenvalues. Solution: Let Ai,...,Am be the distinct eigenvalues of T, and let ui,...,um be corresponding nonzero eigenvectors. If \j φ 0, then T(vj/Xj) = vj. Because at most one of Ai,..., Am equals 0, this implies that at least m— 1 of the vectors v\,..., vm are in range T. These vectors are linearly independent (by 5.6), which implies that 77i — 1 < dim range Τ = к. Thus τη < к + 1, as desired. 10. Suppose Τ G C{V) is invertible and A G F\{0}. Prove that A is an eigenvalue of Τ if and only if ^ is an eigenvalue of T~l. Solution: First suppose that A is an eigenvalue of T. Thus there exists a nonzero vector υ G V such that Tv = Xv.
42 Chapter 5. Eigenvalues and Eigenvectors Applying T~l to both sides of the equation above, we get ν = XT~lv, which is equivalent to the equation T~lv = ^v. Thus j is an eigenvalue of T~l. To prove the implication in the other direction, replace Τ by T~l and λ by j and then apply the result from the paragraph above. 11. Suppose S,T € C(V). Prove that ST and TS have the same eigenvalues. Solution: Suppose that λ € F is an eigenvalue of ST. We want to prove that λ is an eigenvalue of TS. Because λ is an eigenvalue of ST, there exists a nonzero vector ν € V such that {ST)v = Xv. Now (TS){Tv) = T(STv) = T(Xv) = XTv. If Τυ φ 0, then the equation above shows that λ is an eigenvalue of TS, as desired. If Tv = 0, then λ = 0 (because S(Tv) = Xv) and furthermore Τ is not invertible, which implies that TS is not invertible (by Exercise 22 in Chapter 3), which implies that λ (which equals 0) is an eigenvalue oiTS. Regardless of whether or not Tv = 0, we have shown that λ is an eigenvalue of TS. Because λ was an arbitrary eigenvalue of ST, we have shown that every eigenvalue of ST is an eigenvalue of TS. Reversing the roles of S and T, we conclude that every eigenvalue of TS is also an eigenvalue of ST. Thus ST and TS have the same eigenvalues. 12. Suppose Τ € C(V) is such that every vector in V is an eigenvector of T. Prove that Γ is a scalar multiple of the identity operator. Solution: For each ν € V, there exists α„ € F such that Tv = CLyV. Because TO = 0, we can choose Go to be any number in F, but for ν € V"\{0} the value of av is uniquely determined by the equation above. To show that Γ is a scalar multiple of the identity, we must show that av is independent of ν for ν € V \ {0}. To do this, suppose v, w € V \ {0}. We want to show that av = a^,. First consider the case where (v,w) is linearly dependent. Then there exists b € F such that w = bv. We have
43 Chapter 5. Eigenvalues and Eigenvectors OwW = Tw = T(bv) = bTv = b(avv) = OvW, which shows that a„ = a™, as desired. Finally, consider the case where (v, w) is linearly independent. We have Ov+w(v + w) = T(v + w) = Tv + Tw = dvv + aww, which implies that (а„+ш - av)v + (α„+ω - aw)w = 0. Because (v,w) is linearly independent, this implies that av+w = a^ and dv+xu = аш, so again we have tiy = aw, as desired. 13. Suppose Τ € C(V) is such that every subspace of V with dimension dimV- 1 is invariant under T. Prove that Γ is a scalar multiple of the identity operator. Solution: Suppose that Τ is not a scalar multiple of the identity operator. By the previous exercise, there exists и € V such that и is not an eigenvector of T. Thus (u, Tu) is linearly independent. Extend (u, Tu) to a basis (it, Tu, v\,..., vn) of V. Let U = span(u, vi,... ,vn). Then U is a subspace of V and dimJ7 = dimV" — 1. However, U is not invariant under Τ because и € U but Tu ^ U. This contradiction to our hypothesis about Τ shows that our assumption that Τ is not a scalar multiple of the identity must have been false. 14. Suppose S, Τ € C(V) and S is invertible. Prove that if ρ € V(F) is a polynomial, then p(STS~l) = Sp(T)S~l. Solution: First suppose m is a positive integer. Then
44 Chapter 5. Eigenvalues and Eigenvectors (STS~l)m = (STS-l)(STS-l)...(STS~l) = ST(S-lS)T(S-lS)...(S~lS)TS-1 =STmS-\ which is our desired equation in the special case when p(z) = zm. Multiplying both sides of the equation above by a scalar and then summing a finite number of equations of the resulting form shows that p(STS~l) = Sp(T)S~l for every polynomial ρ € V(F). 15. Suppose F = C, Τ € C(V), ρ € V(C), and α € С. Prove that α is an eigenvalue of p(T) if and only if α = ρ(λ) for some eigenvalue λ of T. Solution: First suppose that α is an eigenvalue of p(T). Thusp(T)-aI is not injective. Write the polynomial p(z) — a in factored form: p(z) - a = c(z - λι)... (ζ - km), where c,Ai,...,Am € C. We can assume that с φ 0 (otherwise ρ is a constant polynomial, in which case the desired result clearly holds). The equation above implies that p(T) -aI = c(T-klI)...(T- kmI). Because p(T) - al is not injective, this implies that Τ — Xjl is not injective for some j. In other words, some Xj is an eigenvalue of T. The formula above for p(z) — a shows that p(kj) — a = 0. Hence α = p(kj), as desired. For the other direction, now suppose that α = ρ(λ) for some eigenvalue λ of T. Thus there exists a nonzero vector υ € V such that Tv =■ kv. Repeatedly applying Τ to both sides of this equation shows that Tkv =■ kkv for every positive integer k. Thus p(7> = p{k)v = av. Thus α is an eigenvalue of p(T). 16. Show that the result in the previous exercise does not hold if С is replaced with R.
45 Chapter 5. Eigenvalues and Eigenvectors Solution: Define Τ € £(R2) by T(x, y) = (-y, x). Define ρ € V(R) by p(x) = x2. Then p(T) = T2 = —I, and hence -1 is an eigenvalue of p(T). However, Τ has no eigenvalues (as we saw on page 78 of the textbook; the point here is that eigenvalues are required to be real because we are working on a real vector space), so there does not exist an eigenvalue λ of Γ such that -1 =ρ(λ). Of course there are also many other examples. 17. Suppose V is a complex vector space and Τ € C(V). Prove that Τ has an invariant subspace of dimension j for each j = 1,..., dim V. Solution: There is a basis {v\,..., udim v) with respect to which Τ has an upper-triangular matrix (see 5.13). For each j = 1,..., dim V, the span of {v\,..., vj) is a j-dimensional subspace of V that is invariant under Τ (by 5.12). 18. Give an example of an operator whose matrix with respect to some basis contains only O's on the diagonal, but the operator is invertible. Solution: Let Τ € £(F2) be the operator whose matrix (with respect to the standard basis) is 0 1' 1 0 " Obviously this matrix has only O's on the diagonal, but Τ is invertible (because TT — I, as is clear from squaring the matrix above). Of course there are also many other examples. Comment: This exercise and the next one show that 5.16 fails without the hypothesis that an upper-triangular matrix is under consideration. 19. Give an example of an operator whose matrix with respect to some basis contains only nonzero numbers on the diagonal, but the operator is not invertible. Solution: Define Τ € £(F2) to be the operator whose matrix (with respect to the standard basis) is Then T(1,0) = T(0,1) = (1,1), so Γ is not injective, so Γ is not invertible, even though the diagonal of the matrix above contains only nonzero numbers.
46 Chapter 5. Eigenvalues and Eigenvectors Of course there are also many other examples. 20. Suppose that Τ € C{V) has dim V distinct eigenvalues and that S € C{V) has the same eigenvectors as Τ (not necessarily with the same eigenvalues). Prove that ST = TS. Solution: Let η = dimV. There is a basis (υι,...,υη) of V consisting of eigenvectors of Τ (see 5.20 and 5.21). Letting λχ,..., λη be the corresponding eigenvalues, we have Tvj = Xjvj for each j. Each vj is also an eigenvector of S, so Svj = a3-vj for some acj € F. For each j, we have (ST)vj = S{Tvj) =■ kjSvj = ctjkjVj and {TS)vj = T(Svj) = ajTvj = ajXjvj. Because the operators ST and TS agree on a basis, they are equal. 21. Suppose Ρ € £(V) and P2 = P. Prove that V = null Ρ θ range P. Solution: First suppose и € null Ρ Π range P. Then Pu = 0, and there exists w € V such that и = Pw. Applying Ρ to both sides of the last equation, we have Pu = Ρ w = Pw. But Pu = 0, so this implies that Pw = 0. Because и = Pw, this implies that и = 0. Because и was an arbitrary vector in null Ρ Π range Ρ, this implies that null Ρ Π range Ρ = {0}. Now suppose υ € V. Then obviously υ = (υ - Ρυ) + Ρυ. Note that Ρ{υ - Ρυ) = Ρυ - Ρ2υ = 0, so (υ - Ρυ) € null P. Clearly Ρυ € range P. Thus the equation above shows that υ € null Ρ + range P. Because υ was an arbitrary vector in V, this implies that V = null Ρ + range P. We have shown that null Ρ Π range Ρ = {0} and V = null Ρ + range P. Thus V = null Ρ θ range Ρ (by 1.9).
47 Chapter 5. Eigenvalues and Eigenvectors 22. Suppose V = U θ W, where U and W are nonzero subspaces of V. Find all eigenvalues and eigenvectors of F\j,w· Solution: Because V = U θ W, each vector υ G V can be written uniquely in the form υ = и + w, where и G U and w G W. Recall that if υ is represented as above, then Рцу/υ = u. Suppose λ G F is an eigenvalue of Рцлу/. Then there exists a nonzero vector ν € V such that Pu,wv = λυ. Writing this equation using the representation of υ given above, we have и = X(u + w). Thus (1 - X)u-Xw = 0. Because V = U θ W, if 0 is written as the sum of a vector in U and a vector in W, then both vectors must be 0. Thus the equation above implies that (1 - X)u = Xw = 0. Because и and w are not both 0 (because υ φ 0), this implies that λ = 1 or λ = 0. For ν G V with representation as above, the equation Puy/v = 0 is equivalent to the equation и = 0, which is equivalent to the equation υ = w, which is equivalent to the statement that υ G W. This means that 0 is an eigenvalue of P(j,w (because W is a nonzero subspace of V) and that W equals the set of eigenvectors corresponding to the eigenvalue 0. For ν G V with representation as above, the equation P(j,wv — ν is equivalent to the equation υ = и, which is equivalent to the statement that ν G U. This means that 1 is an eigenvalue of P(j,w (because U is a nonzero subspace of V) and that U equals the set of eigenvectors corresponding to the eigenvalue 1. 23. Give an example of an operator Τ € £(R4) such that Τ has no (real) eigenvalues. Solution-. Define Τ e £(R4) by Suppose λ G R. For this particular operator, the eigenvalue-eigenvector equation Т(х\,Х2,хз,хл) = λ(χι,Χ2>^3ι^4) becomes the system of equations
48 Chapter 5. Eigenvalues and Eigenvectors —£2 = λχι Xl = XX2 —χ\ = Ахз хз = λχ4. Multiplying together the first two equations and also multiplying together the last two equations gives -X1X2 = λ2χΐΧ2 and —X3X4 = А2хзХ4. If either xi or X2 does not equal 0, then the first two equations show that neither of xi, X2 equals 0. Similarly, if either X3 or X4 does not equal 0, then the last two equations show that neither of X3, X4 equals 0. Thus if λ is an eigenvalue of T, then there is a solution to the system of equations above with X1X2 φ 0 or X3X4 φ 0. Either way, we conclude that -1 = λ2, which is impossible for any real number λ. Thus Τ has no real eigenvalues. 24. Suppose V is a real vector space and Τ € C(V) has no eigenvalues. Prove that every subspace of V invariant under Τ has even dimension. Solution: Suppose U is a subspace of V that is invariant under T. Thus T\u € C{U). If dim C/ were odd, then T\u would have an eigenvalue λ € R (by 5.26), so there would exist a nonzero vector и € U such that Т\ии = ku. Obviously this would imply that Tu = Xu, which would imply that λ is an eigenvalue of T. But Τ has no eigenvalues, so dirnC/ must be even, as desired.
Chapter 6 Inner-Product Spaces 1. Prove that if x, у are nonzero vectors in R , then (x,y) = ||z||||]/||cos0, where θ is the angle between χ and у (thinking of χ and у as arrows with initial point at the origin). Hint: draw the triangle formed by x, y, and x-y; then use the law of cosines. Solution: Suppose that x,y are nonzero vectors in R2 and θ is the angle between χ and y. Consider the triangle formed by x, y, and χ — у: The law of cosines states that iix-y|i2 = iNi2 + iiyii2-2iixiniyiicos0. As usual, we compute the norm of a vector squared by taking the inner product of the vector with itself: l|s-y||2 = (x-y,x-y) = (x, x) - (x, y) - (y, x) + (y, y) = \\x\\2 + \\y\\2-2(x,y). 49
50 Chapter 6. Inner-Product Spaces Substitute the last expression for ||r — y\\2 into the left side of the law of cosines, obtaining ||x||2 + ||y||2 - 2(x,y) = \\xf + \\yf - 2\\x\\\\y\\ cos*. Now subtract ||x||2 + ||y||2 from both sides of the equation above, and then divide both sides by —2, obtaining <*,!/> = IMIIIl/ll a**· 2. Suppose u, ν € V. Prove that (u, v) = 0 if and only if Nl<||u + a«|| for all a € F. SOLUTION: First suppose that (u,v) = 0. Let α € F. Then u,av are orthogonal. The Pythagorean theorem thus implies that ||u + H|2 = |H2 + ||H|2 > INI2· Taking square roots gives ||u|| < ||ιι + αυ||, as desired. To prove the implication in the other direction, now suppose that ||u|| < ||ιι + αυ|| for all α € F. Squaring this inequality, we get H|2<||u + H|2 = (u + av, и + αν) = (и, и) + (и, αν) + (αν, и) + (αν, αν) = \\u\\2 + a(u,v)+afcv) + \a\2\\v\\2 = \\u\\2 +2Bea(u,v) + \a\2\\v\\2 for all α € F. Thus -2Rea(u,v) < \a\2\\v\\2 for all α € F. In particular, we can let α equal — t(u, v) for t > 0. Substituting this value for α into the inequality above gives 2i|(u,u>|2<i2|<u,t;>|2H2 for all t > 0. Divide both sides of the inequality above by t, getting
51 Chapter 6. Inner-Product Spaces 2|<u,u>|2<i|<u,t;>|2H|2 for all t > 0. If υ = 0, then (u,v) = 0, as desired. If ν φ 0, set t equal to 1/||υ||2 in the inequality above, getting 2\{u,v)\2 <\{u,v)\\ which implies that (u, v) = 0, as desired. 3. Prove that (£<*)'*(έν)(έ¥) .7=1 i=i i=i J for all real numbers αϊ,..., α„ and bi,..., bn. SOLUTION: Suppose αϊ,... ,On,bi,... ,bn € R. Using the usual inner product on Rn, we have (EaA-)2 = (E(^i)(Vv7))s 3=1 .7 = 1 = ((αϊ, y/2a2,..., Vnan), (bi, &2/V2,..., bn/y/n)) < ||(a,, V2a2,..., v^rOflKbb&a/v^,..., bn/Vn)\\2 = (Σ^-2)(Σ^·), i=i J=i where the inequality above comes from the Cauchy-Schwarz inequality. Suppose u, ν € V are such that ||u||=3, ||и + «|=4, ||u-«||=6. What number must ||υ|| equal? Solution: Prom the parallelogram equality, we have |M|2 = \\u + v\\2 + \\u-v\\2-2\\u\\2 _ 16 + 36- 18 2 = 17.
52 Chapter 6. Inner-Product Spaces Thus ||v|| = ν/Ϊ7. 5. Prove or disprove: there is an inner product on R2 such that the associated norm is given by ||(xi,x2)|| = |xi| + |x2| for all (xi,X2) € R2· SOLUTION: We will show that there does not exist an inner product on R2 such that the associated norm is given by the formula above by showing that the parallelogram equality is violated. Let и = (3,2) and υ =(1,3). Then u + v = (4., 5) and u — υ = (2, — 1). Using the formula above, we then have ||u + u||2 + ||u-t;||2 = 81+9 = 90 and 2(N|2 + H2) = 2(25 + 16) = 82. Thus the parallelogram equality fails, as desired. 6. Prove that if V is a real inner-product space, then Цц + г,Ц2-1|и-г;112 {и, ν) = for all и, υ € V. Solution: Suppose V is a real inner-product space and u, ν € V. Then \\u + v\\2 — ||u - v\\2 _ (u + v, и + ν) — (и — ν, и — ν) 4 = 4 [[и»2 + 2(ti, υ) + HI2 - ([[и»2 - 2(ц, υ) + IN2) 4 _ 4(u, ν) = 4~ = (u,v),
53 Chapter 6. Inner-Product Spaces as desired. 7. Prove that if V is a complex inner-product space, then . . ||u + t;||2- ||u - v||2 + ||u + tt;||2i - ||u - гг;||2г {и, ν) = for all и, υ € V. Solution: Suppose V is a complex inner-product space and u,v € V. Then Ци + υΙΙ2 = (u + v,u + v) = \\u\\2 + (u,v) + (v,u) + \\v\\2 and — \\u — v\\2 = —(u — v,u — v) = -\\u\\2 + (u,v) + (v,u)-\\v\\2 and i\\u + iv\\2 = i(u + iv, и + iv) = i\\u\\2 + (u,v)-(v,u)+i\\v\\2 and —i\\u — iv\\2 = —i(u — iv,u — iv) = -г||и||2 + (и, ν) - (υ,и) - i\\vf. Adding the four equations, we have ||и + г;||2 - ||ιι-υ||2 -(-гЦи + гг;^ - i||u - iv\\2 = 4.(u,v), as desired. 8. A norm on a vector space U is a function || ||: U —* [0, oo) such that ||u|| = 0 if and only if и = 0, \\au\\ = |a|||u|| for all a € F and all u € 17, and II" + υ|| < ||u|| + ||υ|| for all u,v € U. Prove that a norm satisfying the parallelogram equality comes from an inner product (in other words, show that if || || is a norm on U satisfying the parallelogram equality, then there is an inner product ( , ) on U such that ||u|| = (u,u)1/2 for all и € 17).
54 Chapter 6. Inner-Product Spaces Comment: This is among the hardest exercises in the book. Instructors may want to simplify this exercise slightly by allowing students to consider only the case where F = R. Solution·. Suppose that U is a vector space and || || is a norm on U satisfying the parallelogram equality. We want to find an inner product (, ) on U such that ||u|| = (u,u)1/2 for all и € U. First consider the case where F = R. For u, υ € U, define (u, v) by \\u+v\\2-\\u-v\\2 {u,v) = . This definition is motivated by Exercise 6 of this chapter, which gives a formula for the inner product in terms of norms. For и € U we have \\u + u\\2-\\u-u\\2 (u, u) = - Pull2 - 11Q112 4 = INI2- Thus ||u|| = (u, u)1/2, as desired. However, we still must show that ( , ) satisfies the properties required of an inner product. Because (u, u) = \\u\\2 (as shown above), we have (u, u) > 0 for all u € 17, with equality if and only if и = 0; these properties follow from the properties of a norm. Thus ( , ) satisfies the positivity and definiteness properties required of an inner product. To prove that ( , ) is additive in the first slot, let u, v,w € U. Then 4((u + v, w) — (u, w) - (v, to)) = \\u + v + io||2 -\\u + v- io||2 - ||u + w||2 + ||u - wf - \\v + w\\2 + \\v - wf = \\u + v + w\\2 + (||u - w\\2 + \\v - HI2) -\\u + v- w\\2 - (||t* + wf + \\v + wf), where the first equality comes from the definition of ( , ). In the last equality above, the parentheses indicate groupings to which we will apply
55 Chapter 6. Inner-Product Spaces the parallelogram equality, which asserts that the sum of the norms squared of two vectors x,y € U can be computed from the formula Μ|.+μ. = Ιξ+ϊΙ! + 1-ιϊ1!. Applying the parallelogram equality to the two terms in parentheses above (take χ = и — го, у = ν — го for the first sum in parentheses, then χ = u + w,y = υ + w for the second term in parentheses) gives 4((u + v, w) — (u, го) — {υ, го)) 2 ||u-Ы> - 2го||2 ||u-i>||2 \u + v — го 2 2 2 ||и + г; + 2го||2 ||и-г;||2 /ιι н2 и н2ч llu + υ — 2го||2 = (||u + v + wf + HI2) + ^ ϋ- /н n2 ιι н2\ ||u + г; +2го||2 -(||и + г;-го||2 + ||го||2)-^^ U-. Applying the parallelogram equality to the two terms in parentheses above (take χ = u + г; + го, у = го for the first sum in parentheses, then χ = u + v — w,y = w for the second term in parentheses) gives 4((u + v, w) - (u, w) — (v, го)) ||u + v + 2го||2 ||и + г;||2 ||u + υ - 2го||2 2 + ~^_ + 2 ||и + г;||2 ||ц + ν - 2гоЦ2 ||u + v + 2го||2 = 0. Thus (u + v, w) = (и, го) + (ν, го), completing the proof that ( , ) is additive in the first slot. To prove that ( , ) is homogeneous in the first slot, let u,v € U. If η is a positive integer, then
56 Chapter 6. Inner-Product Spaces (nu,v) = (u + · · · + u,v) η times = (u,v) -\ \-(u,v) N ν ' η times = n(u,v), where the second equality comes from additivity in the first slot, which we have already verified. Replacing it with u/n in the equality above gives (u,v) = n(u/n,v), which implies that (-») = ~(u,v). η η Let 77i be another positive integer, and replace и with mu in the equality above, getting τη . 1 (—u, v) = —lmu,v) η η 771. . = — {u,v}, η where the second equality holds because we have already shown that ( , ) is homogeneous in the first slot with respect to positive integers. We have now shown that (, ) is homogeneous in the first slot with respect to positive rational numbers. Prom the definition of ( , ), we have (-u,v) = \\-u + v\\2 \\u + v\\2 -(u,v). — 4 4 ||-u ll«- -«II2 «II2 Combining this with the result from the previous paragraph, we can now conclude that (, ) is homogeneous in the first slot with respect to all rational numbers. Now suppose that λ € R. There exists a sequence η, гг,... of rational numbers such that Ηπίπ-,α, r„ = λ. Thus
57 Chapter 6. Inner-Product Spaces X(u, v) = lim rn(u,v) n—»oo = lim (rnu,v) = Шпд-- '2 2 |r„u + t;r- \\rnu-v\ In the next paragraph we will show that limn_*oo |jr„it + г>|| = ||Au + v|| and ϋπΐη-,οο ||rnu —υ|| = ||Au —υ||. Combining this with the last equation above we can conclude that \\Xu+v\\2-\\ku-v\\2 X(u, v) = = (Au,«), which will complete the proof that ( , ) is homogeneous in the first slot. If x, у € U, then Ы = \\у + (х-у)\\ <Ы\ + \\х-у\\ and thus INI-||y||<l|x-y||- Interchanging the roles of χ and y, we get IMI-||*|| < II*-y||. Because |||x|| - \\y\\\ equals ||x|| - \\y\\ or ||y|| - ||x||, we can now conclude that |IMI-llvll|<ll*-vl|. With χ = rnu + ν and у = ku + υ, this inequality gives |||r„u + v||-||Au + v||| < ||r„u - Au|| = |r„-A|H|. Because Ηπίπ-,α, r„ = λ, this shows that lim ||r„u + v|| = ||λ« + υ||. n—*oo Replacing ν with —v, we have
58 Chapter 6. Inner-Product Spaces lim ||rnu - υ|| = ||Au —υ||. η—»oo The last two equations are the promised ingredients that were needed for the proof that ( , ) is homogeneous in the first slot. Finally, we must show that (u, v) = (v,u) (recall that we are considering the case where F = R). This last step is easy: (u, v) = ||u + t;||2-||u- 4 ll« + u||2-||t;- -Hl2 -«||2 4 = (v,u). This completes the proof that ( , ) is an inner product when F = R. Whew! Now consider the case where F = C. For u,υ € U, define (u,v) by (u, v) = llu + υ||2 - llu - v||2 + llu + iv\\2i - llu - iv\\2i 4 This definition is motivated by Exercise 7 of this chapter, which gives a formula for the inner product in terms of norms. For и € U we have (u, u) = \\u + u||2 - ||u - u||2 + ||u + ttt||2z - ||u - iu||2i 4 ||2и||2 + |1 + г|2Н|2г-|1-г-|2Н2г 4 4||и||2+2|Ы|2г-2|Ы|2г = IMI2. Thus ||u|| = (u, u)1/2, as desired. However, we still must show that ( , ) satisfies the properties required of an inner product. Because (u, u) = \\u\\2 (as shown above), we have (u, u) > 0 for all и € U, with equality if and only if и = 0; these properties follow from the properties of a norm. Thus ( , ) satisfies the positivity and definiteness properties required of an inner product. For convenience, let's define ( , )r by («,f>R = ||и + г;||2-||и-г;||2
59 Chapter 6. Inner-Product Spaces Here the subscript R reminds us that (, ) r was the inner product we defined when considering the case F = R. Now we are assuming that F = C, but ( , )r is still well defined. Note that (u,v) = (u,i>)r + (u,iv)Ri. We have already proved that ( , )r is additive in the first slot, and now we use that information. Let u, v, w € U. Then (u + v, w) = (u + v, w)n + (u + v, iw)ni = (u,w)R + (v,w)K + (u,iw)Ki + (v,iw)Ki = {(u, w)R + (u, iw)Ki) + ((v, w)R + (v, iw}ni) = (u,w) + (υ, го). Thus ( , ) is additive in the first slot. To prove that (, ) is homogeneous in the first slot, let u, υ € U. If λ € R, then (\u,v) = (λιι, v)r + (Au, £i>)rz = A(u,v)R + X(u,iv)Ki = X(u,v), where we have used the homogeneity of ( , )r in the first slot. The last equation above shows that ( , ) is homogeneous in the first slot with respect to all real numbers. We must still extend this result to complex numbers. Note that .. . ||t'u + i>||2 — \\iu — i>||2 + Цш + гирг — \\iu — iv\\2i (ги, υ) = _ \\г(и + г;)||2г - \\г(и - v)\\2i - \\г(и + iv)\\2 + |К(ц - гг;)||2 _ ||ц + г> ||2г - ||ц - г> ||2г - ||ц + tf||2 + ||u - if||2 4 = i(u,v). Combining this result with additivity and homogeneity with respect to real numbers, we get that ((a + bi)u, v) = (a + bi)(u, v)
60 Chapter 6. Inner-Product Spaces for all a, b € R. In other words, ( , ) is homogeneous in the first slot with respect to all complex numbers. Finally, we must show that (u,v) = (v,u). This last step is easy: llu + v\\2 - llu - v\\2 + llu + iv\\2i - llu - iv\\2i (u, υ) = 4 \\υ + u\\2 - \\υ - u\\2 + \\i{-iu +.«)||2t - ||(-i)(w + v)\\2i 4 llv + u\\2 - \\v - u\\2 + \\v + iu\\4 - llv - iu\\2i = (υ, it). This completes the proof that ( , ) is an inner product when F = C. Suppose η is a positive integer. Prove that /1 sinx sin2x sinnx cosx cos2x cosnx\ is an orthonormal list of vectors in C[—π, π], the vector space of continuous real-valued functions on [—ж, ж] with inner product (f,9) = / f{x)g(x)cb:. J -τ Comment: This orthonormal list is often used for modeling periodic phenomena such as tides. Solution: First we need to show that each element of the list above has norm 1. This follows easily from the following formulas: / / 4, , 2jt — s\n2jt (sin jt)2dt= Aj J (cos jtfdt 2 2jt + sin 2jt 4j Next we need to show that any two distinct elements of the list above are orthogonal. This follows easily from the following formulas, valid when ЗФк:
61 Chapter 6. Inner-Product Spaces / / / (sin jf) (sin kt)dt = js'm(j — k)t + ks'm(J — k)t — jsin(j + k)t + ks\n(j + k)t 20' - k)U + k) (sin j t) (cos kt) dt = j cos(j — k)t + kcos(j — k)t + j cos(J + k)t — kcos(j + k)t 2(k-j)(j + k) (cos j t) (cos kt) dt = js\n(J — k)t + fcsin(j — k)t + js'm(j + k)t — fesin^" +k)t (sin jf) (cos jt) dt = W-k){j + k) (cos jt)2 2i 10. On V2(R), consider the inner product given by (P> Я) = / P(x)q(x) dx- Jo Apply the Gram-Schmidt procedure to the basis (Ι,χ,χ2) to produce an orthonormal basis of ^(R-)· Solution: Applying the Gram-Schmidt procedure to (1, x, x2) produces (using elementary calculus and some arithmetic) the following orthonormal basis of V2(R): (1, V3(-l + 2x),\/5(l - 6x + 6x2)). 11. What happens if the Gram-Schmidt procedure is applied to a list of vectors that is not linearly independent? Solution: Suppose (v\,..., vm) is a linearly dependent list of vectors in V. livi = 0, then at the first step of applying the Gram-Schmidt procedure to (v\,..., vm) we will be dividing by 0 when trying to set ei = υι/||υι||. If v\ Φ 0, then by the linear dependence lemma (2.4), some Vj is in span(ui,.. ,,Vj-i); here we choose j to be the smallest positive integer with this property. If we apply the Gram-Schmidt procedure to produce (ei,..., ej-i) at the end of step j, then span(ui,... ,Vj-X) = span(eb ... ,ej-i).
62 Chapter 6. Inner-Product Spaces Thus Vj € span(ei,..., e_,_i). By 6.17, this implies that Vj = (vj,ei)ei + · · · + (vj, ej_i)ej_i. Thus the Gram-Schmidt formula 6.23 for ej includes a division by 0, which is not allowed. 12. Suppose V is a real inner-product space and (v\,..., vm) is a linearly independent list of vectors in V. Prove that there exist exactly 2m orthonormal lists (ei,..., em) of vectors in V such that span(ui,..., Vj) = span(ei,..., e_,·) for all j € {1,.. .,m}. Solution: For j = 1, the condition above states that span(ui) = span(ei). Because there are only two vectors in span(ui) with norm 1 (these two vectors are υι/||υι|| and —vi/|jvi ||), we have only these two choices for e\. Now suppose that j > 1 and that an orthonormal list (ei,..., e_j_i) has been chosen such that span(ui ,...,Vj-i) = span(ei,..., e,-_i). The Gram-Schmidt procedure produces e_, € V such that (ei,...,ej) is an orthonormal list and span(ui,..., Vj) = span(ei,..., ej). Suppose ej' € V is another vector with these properties, meaning that (ei,..., ej-i, ej) is an orthonormal list and span(ui,... ,Vj) = span(ei,... ,ej_i,e/). The last two equations show that span(ei,..., ej-i,ej') = span(ei) · · · ι ey)- In particular, ej' € span(ei,... ,ej), which implies that €/ = (β/ι ei>ei + · · · + (ej', ej)ej = {ej , ej)ej, where the first equality comes from 6.17 (with span(ei,..., ej) replacing V and j replacing n) and the second equality holds because (ei,... ,e_j_i,e/)
63 Chapter 6. Inner-Product Spaces is an orthonormal list. Taking norms of both sides of the last equation, and recalling that ej and e/ both have norm 1, we see that |(e/,e_/)| = 1. Thus (ej',ej) = 1 or (е/,е/} = —1. Hence the last equation above implies that e/ = ej or e/ = —ej. We have shown that there are exactly two possible choices for each ej. As j ranges from 1 to m, this gives us exactly 2Ш possible choices for (βι,.-.,βπ,). 13. Suppose (ei,...,em) is an orthonormal list of vectors in V. Let υ € V. Prove that IMI2 = K«,ei>|a + ... + |<«iem>|2 if and only if υ € span(ei,..., em). Solution: Extend (βι,.,.,βπ,) to an orthonormal basis (βι,.,.,βη) of V. Then ν = (v, ei)ei -i \-(v, en)e„ and N2 = |(t;1e1>|2 + -.. + |(t;)en>|2; see 6.17. Prom the last equation, we see that 1Н12 = |<»,е1>|а + - + |<»1ет>|а if and only if (υ,β,η+ι) = · · - = (υ, e„) = 0. From the first equation above, this happens if and only if υ = (v, ei)ei + - - · + (v, eTO>em, which happens if and only if υ € span(ei,..., em). 14. Find an orthonormal basis of 7*2(R) (with inner product as in Exercise 10) such that the differentiation operator (the operator that takes ρ to p7) on 7b(R) has an upper-triangular matrix with respect to this basis. Solution: Because l' = 0, x' = 1, and (x2)' = 2x, the differentiation operator on 7*2(R) has an upper-triangular matrix with respect to the basis (Ι,χ,χ2). However, (Ι,χ,χ2) is not an orthonormal basis. But, as can be seen from the proof of 6.27, if the Gram-Schmidt procedure is applied to this
64 Chapter 6. Inner-Product Spaces basis, we will get an orthonormal basis with respect to which the differentiation operator has an upper-triangular matrix. As we saw in Exercise 10 of this chapter, the Gram-Schmidt procedure applied to (Ι,χ,χ2) gives (1, V3(-l + 2x),\/5(l - 6x + 6x2)). which is our desired orthonormal basis of ^(R)· 15. Suppose U is a subspace of V. Prove that dimJ7-L = dimVr-dimJ7. Solution.- Prom 6.29, we know that V = U@UL. Thus by Exercise 17 in Chapter 2, we have dim V = dim U + dim J7X, which implies that dim J7X = dim V — dim U. 16. Suppose U is a subspace of V. Prove that UL = {0} if and only if U = V. Solution: From 6.29, we know that v = ueu-L. This clearly implies that UL = {0} if and only if U = V. 17. Prove that if Ρ € C(V) is such that P2 = Ρ and every vector in null Ρ is orthogonal to every vector in range P, then Ρ is an orthogonal projection. Solution: Suppose Ρ € C{V) is such that Ρ2 = Ρ and every vector in null Ρ is orthogonal to every vector in range P. Let U = range P. We will show that Ρ equals the orthogonal projection Рц. То do this, suppose υ € V. Then υ = Pv + (v- Pv). Clearly Pv € range Ρ = U. Also, P(v — Pv) = Pv — P2v = 0, which means that ν — Pv € null P. Thus ν — Pv is orthogonal to every vector in U. In other words, ν — Pv € J7X. Thus the equation above writes ν as the sum of a vector in U and a vector in UL. In this decomposition, the vector in U equals, by definition, Pyv. Hence Pv = Puv, as desired.
65 Chapter 6. Inner-Product Spaces 18. Prove that if Ρ G C(V) is such that P2 = Ρ and \\Pv\\ < imi for every ν G V, then Ρ is an orthogonal projection. Solution: Suppose и G range Ρ and w g null P. If we can show that (u,w) = 0, then by the previous exercise we can conclude that Ρ is an orthogonal projection. Because и G range P, there exists u' G V such that и = Pu'. Applying Ρ to both sides of this equation, we have Pu = PV = Pu' = u. Because w G null P, this implies that P(u + aw) = и for every α G F. Thus \\u\\2 = \\P(u + aw)\\2 < ||u + ato||2 for every a G F, where the second line follows from our hypothesis that ||Ρυ|| < ||υ|| for every υ € V. The inequality above implies (see Exercise 2 of this chapter) that (u, w) = 0, as desired. 19. Suppose Τ G £>{V) and U is a subspace'of V. Prove that U is invariant under Τ if and only if РцТРц = ТРи. Solution: First suppose that U is invariant under T. Let υ G V. Then P(jv G U and hence T(Puv) € U (because U is invariant under T). Thus Pu(T(Puv)) = T(P(jv). Because υ was an arbitrary vector in V, this implies that РцТРи = ТРу, as desired. To prove the implication in the other direction, now suppose that PuTPu = ТРи.
66 Chapter 6. Inner-Product Spaces Suppose и € U. Then P(ju = u, so applying both sides of the equation above to и gives P(j(Tu) = Tu, which implies that Tu € U. Because и was an arbitrary vector in U, this implies that Τ is invariant under U, as desired. 20. Suppose Τ € C{V) and U is a subspace of V. Prove that U and UL are both invariant under Τ if and only if РцТ — ТРц. Solution: First suppose that U and J7X are both invariant under T. By the previous exercise, this implies that PuTPu = TPu and Ρί/χΓΡί/χ=ΓΡί/χ. But Рц± = I — P(j, so the last equation becomes (/ - Pu)T(I - Pu) = T(I - Pv). Expanding both sides of the equation above and rearranging terms, we get ΡυΤΡυ = PuT. Combining this with the first equation above, we get РцТ = TPu, as desired. To prove the implication in the other direction, suppose now that ΡυΤ = ΤΡυ. Then PuTPu = (ΡυΤ)Ρυ = (TPu)Pu = TPu2 = TPu, which implies (by the previous exercise) that U is invariant under T, as desired. Also, Ρί/χΓΡί/χ = ((/-Ρί/)Γ)Ρί/χ = {Τ-ΡυΤ)Ρυ± = (Τ-ΤΡυ)Ρυ± = T(l - Ρυ)Ρυ± = TPu J = ΤΡυ±,
67 Chapter 6. Inner-Product Spaces which implies (by the previous exercise) that £/ is invariant under T, as desired. 21. In R4, let tf = span((l, 1,0,0), (1,1,1,2)). Find и &U such that \\u - (1,2,3,4)|| is as small as possible. Solution: First we find an orthonormal basis of U by applying the Gram-Schmidt procedure to ((1,1,0,0), (1,1,1,2)), getting «'(js-js·0·0) e2 = (0·0·^'^)- Thus with ei,62 as above, (еьбг) is an orthonormal basis of U. By 6.36 and 6.35, the closest point и € U to (1,2,3,4) is ((l,2,3,4),ei)ei + ((l,2,3,4),e2)e2, which equals /3 3 11 22\ V2'2' 5' δ/' 22. Find ρ e P3(R.) such that p(0) = 0, ?/(0) = 0, and f |2 + 3r-p(x)|2iix Jo is as small as possible. Solution: Define an inner product on ^(R) by (/. 9)= I f(x)9(x) <&· Jo Let q(x) = 2 + 3x, and let U = {ρ € V3(K) : p(0) = 0,p'(0) = 0}. With this notation, our problem is to find the closest point ρ € U to q. To do this, first we find an orthonormal basis of U.
68 Chapter 6. Inner-Product Spaces A polynomial ρ satisfying p(0) = 0, p'(O) = 0 has constant term 0 and first degree term also equal to 0. Thus a basis of U is (x2,x3). Apply the Gram-Schmidt procedure to this basis, getting ei = VEx2 e2 = V7(-5x2 + 6x3). Thus with е\,в2 as above, (еьвг) is an orthonormal basis of U. By 6.36 and 6.35, the closest point ρ € U to q is given by the formula p= (q,ei)ei +(q,e2)e2. A short computation now shows that p(x) = 24x2 - —x3. 23. Find ρ € Vs(R) that makes /| sin χ - p(x)|2dx -7Γ as small as possible. (The polynomial 6.40 is an excellent approximation to the answer to this exercise, but here you are asked to find the exact solution, which involves powers of ж. A computer that can perform symbolic integration will be useful.) Solution: Let C[-ir, π] denote the real vector space of continuous real- valued functions on [—π, π] with inner product (f,g)= Г f(x)g(x)dx. J—π Let υ € C[—π, π] be the function defined by v(x) = sinx. Let U denote the subspace of C[-n, π] consisting of the polynomials with real coefficients and degree at most 5. We need to find ρ € U such that ||υ — p\\ is as small as possible. First find an orthonormal basis of U by applying the Gram-Schmidt procedure (using the inner product above) to the basis (l,x,x2,x3,x4,x5) of U, producing the orthonormal basis (61,62,63,64,65,ее), where
69 Chapter 6. Inner-Product Spaces ei v/2^' 62 ~ π·3/2 ' ез e4 = e5 = 3/2 ' y/Ifr2 - 3x2) 2тг5/2 ^/f (Зтг2* - 5x3) 2^72 ' 3(3тг4 - 30тг2х2 + 35x4) 8V2tt9/2 у/Ц{\ЪжАх - 70тг2х3 + 63х5) 66 8тг»/2 Now compute Р{/*> using 6.35 (with m = 6), getting 105(1485 - 1537Г2 + тг4) 315(1155 - 125тг2 + тг4) 3 PuV= 8^ x ^ x 693(945 - 1057Γ2 + π4) 5 + 8тг>° Х ■ Finally, 6.36 and the discussion following 6.42 show that the function above is the one we seek. 24. Find a polynomial q € V2 (R) such that 1 fl for every ρ G ^(R). Solution: We will need an orthonormal basis of ^(R), where the inner product of two polynomials in ^(R) is defined to be the integral from 0 to 1 of the product of the two polynomials. An orthonormal basis of ^(R) was already computed in Exercise 10 of this chapter. Specifically, let ei(x) = 1 e2(x) =V3(-l+2x) e3(x) = \/5(l-6x + 6x2).
70 Chapter 6. Inner-Product Spaces Then (61,62,63) is an orthonormal basis of ^(R). Define a linear functional φ on T-^R) by φ(ρ) = ρ{^)· We seek q e 7MR) such that φ(ρ) = (p,q) for every ρ € Яг (R)· By the formula given in the proof of 6.45, we have q = <p(ei)ei + v?(e2)e2 + <£>(e3)e3. Evaluate the right side of the equation above to get 3 q(x) = -- + 15x - 15x2. 25. Find a polynomial q € ^(R) such that / p(x)(coswx)dx = J p(x)q(x)dx Jo Jo for every ρ € ЯгСН.)- Solution: Define a linear functional φ on ^(R) by φ{ρ) — J p(x)(costtx)<Ix. Jo We seek q 6 7MR) such that φ{ρ) = (p,q) for every ρ G ^(R), where the inner product on ^(R) is defined as in the previous exercise. Letting βι,β2,β3 be as in the previous exercise, but using our new definition of φ, we again have q = <p(ei)ei + <p(e2)<a + ¥>(ез)ез- Evaluate the right side of the equation above to get 12-24x q(x) 7Г2 26. Fix a vector υ e V and define Τ e C(V, F) by Tu = (u, v). For α 6 F, find a formula for T*a. Solution: Because Τ e C(V,F), we know that T* G £(F,f). Fix α 6 F. Then T*a is the unique vector in V such that
71 Chapter 6. Inner-Product Spaces (a) (Tu, a) = (u, T*a) for all ύ € U. The inner product on the right is the inner product in V, but the inner product on the left is the usual inner product on F: the product of the entry in the first slot with the complex conjugate of the entry in the second slot. Thus (Tu,a) = {Tu)a = (u,v)a (b) = (u,av). Comparing (a) with (b) gives (u, T*a) = (u, av) for all и € U. Thus T*a = av. 27. Suppose η is a positive integer. Define Τ € £(Fn) by T{zi, . . . , Z„) = (0,2l,...,2n-l). Find a formula for T*{z\,..., Zn). Solution: Fix (z\,..., z„) € Fn. Then for every {w\,..., wn) € Fn, we have ((u>i,..., ton), Τ*{ζχ,..., Zn)) = (T{wi,..., wn), (zi,..., Zn)) = ((0,Wi, . . . ,Wn^),(zi, . . . , Zn)) = WiZl-\ l-lUn-iZn" = ({wi,... ,wn),{z2,... ,zn,0)). Thus T*(zi,...,zn) = (z2,...,zn,0). 28. Suppose Τ € C(V) and λ € F. Prove that λ is an eigenvalue of Γ if and only if λ is an eigenvalue of T*. Solution: We have
72 Chapter 6. Inner-Product Spaces λ is an not eigenvalue of Τ <=ϊ Τ — XI is invertible <=ϊ S(T - XI) = (T- XI)S = / for some S € C(V) <=ϊ(Τ- XI)*S* = S*(T - XI)* = I for some S € C(V) <=ϊ (Τ - XI)* is invertible <=ϊ Τ* - XI is invertible ·<=>· λ is not an eigenvalue of T*. Thus λ is an eigenvalue of Τ if and only if λ is an eigenvalue of Γ*. 29. Suppose Τ € C(V) and U is a subspace of V. Prove that U is invariant under Τ if and only if J7X is invariant under T*. Solution: First suppose that U is invariant under T. To prove that UL is invariant under T*, let ν € UL. We need to show that T*v € J7X. But (u,T*v) = (Tu,v) -0 for every и € U (because if и € U, then Tu € U and hence Tu is orthogonal to v, an element of UL). Thus T*v € J7X, and hence J7X is invariant under T*, as desired. To prove the other direction, now suppose that J7X is invariant under T*. Then by the first direction, we know that (U-1)-1 is invariant under (T1*)*. But (U^ = U (by 6.33) and (T*)* = T, so U is invariant under T, completing the proof. 30. Suppose Τ € C(V, W). Prove that (a) T is injective if and only if T* is surjective; (b) Τ is surjective if and only if T* is injective. Solution: First we prove (a): Τ is injective <=*► null Τ = {0} <=ϊ (rangeΓ*)χ = {0} <=ϊ rangeT* = W <=$■ Τ* is surjective ,
73 Chapter 6. Inner-Product Spaces where the second line comes from 6.46(c). Now that (a) has been proved, (b) follows immediately by replacing Τ with T* in (a). 31. Prove that dim null T* = dim null Τ + dim W - dim V and dim range T* = dim range Τ for every Τ € C(V, W). Solution: Let Γ € C(V, W). Then dim null Γ* = dim(rangeT)-1- = dim W — dim range Τ = dim null Τ + axmW - dim V, where the first equality comes from 6.46(a), the second equality comes from Exercise 15 of this chapter, and the third equality comes from 3.4. This proves the first equality that we seek. To prove the second equality, note that dim range T* = dim W - dim null T* = dim V - dim null Τ = dim range T, where the first and third equalities come from 3.4 and the second equality comes from the first part of this exercise. This proves the second equality that we seek. 32. Suppose Λ is an m-by-n matrix of real numbers. Prove that the dimension of the span of the columns of A (in Rm) equals the dimension of the span of the rows of A (in Rn). Solution: Let Γ € £(Rn,Rm) be such that the matrix of Τ (with respect to the standard bases) equals A. Then range Γ equals the span of the columns of A. Thus
74 Chapter 6. Inner-Product Spaces dimension of the span of the columns of A = dim range Τ — dim range Τ1* = dimension of the span of the columns of λΛ{Τ*) = dimension of the span of the columns of the transpose of A = dimension of the span of the rows of A, where the second equality comes from the previous exercise and the fourth equality comes from 6.47.
Chapter 7 Operators on Inner-Product Spaces 1. Make ^(R·) into an inner-product space by defining (Р,Я)= / p{x)q(x)dx. Jo Define Τ € £(-p2(R)) by Т(<ю + ац -f- a2x2) = ац. (a) Show that Τ is not self-adjoint. (b) The matrix of Τ with respect to the basis (ΐ,χ,χ2) is " 0 0 0 0 1 0 0 " 0 0 This matrix equals its conjugate transpose, even though Τ is not self- adjoint. Explain why this is not a contradiction. Solution: (a): Note that but (Γ1,χ> = (0,χ> = 0 (1,Гх) = (1,х> _ 1 ~ 2* 75
76 Chapter 7. Operators on Inner-Product Spaces Thus (ΤΊ,χ) φ (Ι,Τχ), which shows that Τ is not self-adjoint. (b): The result stating that the matrix of T* is the conjugate transpose of the matrix of Τ has as a hypothesis that we are working with orthonormal bases (see 6.47). Because (ΐ,χ,χ2) is not an orthonormal basis of ^(R), we cannot compute the matrix of T* with respect to this basis by taking the conjugate transpose of the matrix of T. Prove or give a counterexample: the product of any two self-adjoint operators on a finite-dimensional inner-product space is self-adjoint. Solution: Let S,T € £(F2) be the operators whose matrices (with respect to the standard basis) are given by M(S) 1 0 0 2 and M{T) 0 1 1 0 Each of these matrices obviously equals its conjugate transpose, and hence S, Τ are self-adjoint. Now M{ST) = M(S)M(T) 0 1 2 0 Because M{ST) does not equal its conjugate transpose, ST is not self- adjoint. Thus we have an example of two self-adjoint operators whose product is not self-adjoint. Of course there are also many other examples. Comment: Suppose S,T € C{V) are self-adjoint. Then ST is self- adjoint if and only if ST = TS (as is easy to see). (a) Show that if V is a real inner-product space, then the set of self-adjoint operators on V is a subspace of C(V). (b) Show that if V is a complex inner-product space, then the set of self- adjoint operators on V is not a subspace of C(V). Solution: (a): Suppose V is a real inner-product space. Obviously the zero operator is self-adjoint. Furthermore, if S, Τ € C(V) are self-adjoint, then (S + T)* = S* + T* = S + T,
77 Chapter 7. Operators on Inner-Product Spaces ала thus S+T is self-adjoint. Finally, if Г € C(V) is self-adjoint and a € R, then (aT)* = aT* = aT, and thus aT is self-adjoint. We have shown that the set of self-adjoint operators on V contains the zero operator and that it is closed under addition and scalar multiplication. Thus the set of self-adjoint operators on V is a subspace of C(V). (b): Suppose now that V is a complex vector space. The identity operator / is self-adjoint, but (il)* = -il so il is not self-adjoint. Thus the set of self-adjoint operators on V is not closed under scalar multiplication and hence it is not a subspace of C(V). 4. Suppose Ρ € C(V) is such that P2 = P. Prove that Ρ is an orthogonal projection if and only if Ρ is self-adjoint. Solution: First suppose that Ρ is an orthogonal projection. Thus there is a subspace U of V such that Ρ = Рц. Suppose v\, щ. € V. Write V\ = U\+W\, V2 = U2 + U>2, where u\,U2 € U and w\,W2 € J7X (see 6.29). Now (Pvl,v2) = (ui,u2 + w2) = (ltl,U2>+ (U1,W2) = (ltl,U2> = (ttl,U2> + (lui,U2> = (щ +WX,U2) = (vi,Pvi). Thus Ρ = P*, and hence Ρ is self-adjoint. To prove the implication in the other direction, now suppose that Ρ is self-adjoint. Let υ € V. Because P(v — Pv) = Pv — P2v = 0, we have ν - Pv € null Ρ = (range P*)x = (range P)x, where the first equality comes from 6.46(c). Writing v = Pv + (v- Pv),
78 Chapter 7. Operators on Inner-Product Spaces we have Pv € range Ρ and (v — Pv) € (range P)x. Thus Pv = Prangcpv. Because this holds for all ν € V, we have Ρ = PI&ngcp, which shows that Ρ is an orthogonal projection. 5. Show that if dim V > 2, then the set of normal operators on V is not a subspace of C(V). Solution: Suppose dimV > 2. Let (ei,...,e„) be an orthonormal basis of V. Define S, Τ € C(V) by 5(αιβι + 1- Οηβη) = α.2&\ — α\&ι and T{aiei Η 1- a„en) — α^&\ + α\&ι. A simple calculation verifies that 5*(αιβι + 1- a„en) = —α.2&ι + а\&ч.· From this formula, another simple calculation shows that SS* = S*S. Yet another simple calculation shows that Τ is self-adjoint. Thus both S and Τ are normal. However, S + Τ is given by the formula (5 + Γ)(αιβι+ ··· + αηβη) = 2a2ei. A simple calculation verifies that {S + T)* (aid + ··· + a„e„) = 2axe2. A final simple calculation shows that (S + T)(S + Τ)* φ (S + T)*(S + T). In other words, S + Τ is not normal. Thus the set of normal operators on V is not closed under addition and hence is not a subspace of C(V). 6. Prove that if Γ € C{V) is normal, then range Τ = range T*. Solution: Suppose Τ is normal. Then range Γ = (null Γ*)x = (nullT)x = rangeT*,
79 Chapter 7. Operators on Inner-Product Spaces where the first equality comes from 6.46(d), the second equality comes from 7.6 (see especially the marginal comment at 7.6), and the third equality comes from 6.46(b). 7. Prove that if Γ € C(V) is normal, then null Tk = null Τ and range Tk = range Τ for every positive integer k. Solution: Suppose Τ € C(V) is normal and that к is a positive integer. Obviously we can assume that к > 2. First we will prove that null 7* = nullT. If υ € null Γ, then Tkv = Tk-l(Tv) = Tk'l0 = 0, and so υ € nullTfc. Thus null Г С nullT*. To prove an inclusion in the other direction, suppose now that ν € null Tk. Then (T*Tk-lv,T*Tk-lv) = (TT*Tk-lv,Tk-lv) = (TTkv,Tk-lv) = (0,Tk-lv) = 0, where the second equality holds because T*T = TT*. The last equality above implies that Т*Тк~1у = О. Thus 0 = (Γ*3*~Ιν>Γ*~2«> = (Tk-lv,Tk-lv). Hence Tk~lv = 0. In other words, υ € null Τ*-1. The same argument, with к replaced with к — I, shows that υ € null Τ*-2. Repeat this process until reaching the conclusion that υ € null Г. This shows that null Т* С null Г, completing that proof that nullT* = nullT. Now we will show that range Tk = range T. If υ € range Tk, then there exists и € V such that υ = Tku = T(Tk~l)u, which implies that υ € range T. Thus range Tk С range T. Note that
80 Chapter 7. Operators on Inner-Product Spaces dim range Tk = dim V - dim null Tk = dim V- dim null Τ = dim range Τ, where the first and third equalities come from 3.4 and the second equality comes from the first part of this exercise. Because range Tk and range Γ have the same dimension and one of them is contained in the other, these two subspaces of V must be equal, completing the proof. 8. Prove that there does not exist a self-adjoint operator Τ € £(R.3) such that r(l,2,3) = (0,0,0)andT(2,5,7) = (2,5,7). Solution: Suppose Τ € £(R3) is such that T(l,2,3) = (0,0,0) and T(2,5,7) = (2,5,7). Obviously (1,2,3) is an eigenvector of Γ with eigenvalue 0 and (2,5,7) is an eigenvector of Τ with eigenvalue 1. If Γ were self-adjoint, then eigenvectors corresponding to distinct eigenvalues would be orthogonal (see 7.8). Because (1,2,3) and (2,5,7) are not orthogonal, Τ cannot be self-adjoint. 9. Prove that a normal operator on a complex inner-product space is self- adjoint if and only if all its eigenvalues are real. Comment: This exercise strengthens the analogy (for normal operators) between self-adjoint operators and real numbers. Solution: Suppose V is a complex inner product space and Τ € C(V) is normal. If Γ is self-adjoint, then by 7.1 all its eigenvalues are real. Conversely, suppose that all the eigenvalues of Τ are real. By the complex spectral theorem (7.9), there is an orthonormal basis (ei,..., e„) of V consisting of eigenvectors of T. Thus there exist real numbers λι,..., λη such that Tej = Xjej for j = 1,..., n. .The matrix of Τ with respect to the basis (ei,..., e„) is the diagonal matrix with λι,..., λη on the diagonal. This matrix equals its conjugate transpose. Thus Τ = T*. In other words, Τ is self-adjoint, as desired. 10. Suppose V is a complex inner-product space and Τ € C(V) is a normal operator such that T9 = T8. Prove that Τ is self-adjoint and T2 = T. Solution: By the complex spectral theorem (7.9), there is an orthonormal basis (ei,..., e„) of V consisting of eigenvectors of T. Let λι,..., A„ be the corresponding eigenvalues. Thus
81 Chapter 7. Operators on Inner-Product Spaces Tej = Xjej for j = 1,..., n. Applying Τ repeatedly to both sides of the equation above, we get l^ej = Xj9ej and T^ej = Xj8ej. Thus λ,·9 = А_Д which implies that Xj equals 0 or 1. In particular, all the eigenvalues of Τ are real. This implies (by the previous exercise) that Τ is self-adjoint. Applying Τ to both sides of the equation above, we get I ej = Xj ej = Xjtj = Tej, where the second equality holds because Xj equals 0 or 1. Because T2 and Τ agree on a basis, they must be equal. 11. Suppose V is a complex inner-product space. Prove that every normal operator on V has a square root. (An operator S € C(V) is called a square root of Γ € C(V) if S2 = T.) Solution: Suppose Τ € C(V) is normal. By the complex spectral theorem (7.9), there is an orthonormal basis (ei,... ,βη) of V consisting of eigenvectors of T. Thus there exist complex numbers Ai,..., A„ such that Tej = Xjej for j = l,...,n. Define S to be the operator on V such that Sej = Xj1'2ej for j = l,...,n; here Xj1'2 denotes a complex square root of Xj (every nonzero complex number has two square roots—it does not matter which one is chosen). Then, as is easy to verify, S2 = T. Thus S is a square root of T. 12. Give an example of a real inner-product space V and Τ € C(V) and real numbers α, β with a2 < 4/3 such that T2 + aT + βΐ is not invertible. Comment: This exercise shows that the hypothesis that Τ is self-adjoint is needed in 7.11, even for real vector spaces. Solution: Let Τ e C(R2) be the counterclockwise rotation on R2; so T{x,y) = (~У,х) for (г.У) € R2· Thus T2 = -/. Taking α = 0 and β = 1, we have a2 < 4/3 and Τ2 + αΤ + βΙ = T2 + I = 0. In particular, T2 + aT + βΐ is not invertible.
82 Chapter 7. Operators on Inner-Product Spaces 13. Prove or give a counterexample: every self-adjoint operator on V has a cube root. (An operator S € C(V) is called a cube root of Γ € C(V) if S3 = T.) Solution: Suppose Τ G C(V) is self-adjoint. By the spectral theorem (7.13), there is an orthonormal basis (ei,...,en) of V consisting of eigenvectors of T. The corresponding eigenvalues must be real (by 7.1). Thus there exist real numbers Ai,..., An such that Tej = Xjej for j = 1,..., n. Define S to be the operator on V such that Se3· = Aj1'3e;· for j = 1,..., n. Then, as is easy to verify, 53 = T. Thus S is a cube root of T, completing the proof that every self-adjoint operator on V has a cube root. 14. Suppose Τ € C(V) is self-adjoint, λ € F, and e > 0. Prove that if there exists υ € V such that ||w|| = 1 and ||Γυ-λι/||<6, then Τ has an eigenvalue A' such that |A — A'| < e. Solution: By the spectral theorem (7.13), there is an orthonormal basis (ei,..., en) of V consisting of eigenvectors of T. Let Ai,..., An be the corresponding eigenvalues. Suppose ν G V is such that ||w|| = 1 and ||Γυ — λυ|| < e. From 6.17 we have ν = (υ, ei)ei 4- · · · + (υ, e,i)en, and so Τυ = Αι (υ, ei)ei + ■ ■ ■ + λ„(υ, en)e„. Thus £2>||Γι/-λι/||2 = ΙΙ(λι - λ)(υ,ei)ei + · · · + (A„ - AXt/.e^e»!!2 = |A1-A|2|(l;1e1>|2 + .-- + |An-A|2Kl;,en>|2 >(min{|A1-A|2,...,|An-A|2})(|(l;,e1>|2 + ... + |(l;,eIl>|2) = min{|A,-A|2,...,|An-A|2} Thus 6 > |Aj — A| for some j. In other words, there is an eigenvalue whose distance from A is less than e, as desired.
83 Chapter 7. Operators on Inner-Product Spaces 15. Suppose U is a finite-dimensional real vector space and Τ € C(U). Prove that U. has a basis consisting of eigenvectors of Τ if and only if there is an inner product on U that makes Τ into a self-adjoint operator. Solution: First suppose that U has a basis (ei,..., en) of eigenvectors of T. Because (ei,..., e„) is a basis of J7, every element of U can be uniquely written as a linear combination of (ei,...,en). Thus we can define an inner product on U by (aiei -I 1- a„en, biei -I 1- bne„) = a\bi -\ 1- anbn. It is easy to verify that this is indeed an inner product on U and that (ei,..., en) is on orthonormal basis of U with respect to this inner product. Because each ej is an eigenvector of Г, the operator Τ has a diagonal matrix with respect to the orthonormal basis (ei,..., en). Thus Τ is self-adjoint. Conversely, now suppose that there is an inner product on U that makes Τ into a self-adjoint operator. Then by the spectral theorem (7.13), U has a basis consisting of eigenvectors of T. 16. Give an example of an operator Τ on an inner product space such that Τ has an invariant subspace whose orthogonal complement is not invariant under T. Comment: This exercise shows that 7.18 can fail without the hypothesis that Τ is normal. Solution: Define Τ G £(F2) by T(to, ζ) = (ζ, 0). Then T(w, 0) = (0,0) for all to € F. Thus the subspace U defined by U = {(го,0) : w € F} is invariant under T. However, UL = {(0, z) : G F}, which is not invariant under Τ because (0,1) e Ux but Γ(0,1) = (1,0) φ UL. Of course there are also many other examples. 17. Prove that the sum of any two positive operators on V is positive. SOLUTION: Suppose S and Τ are positive operators on V. Because S and Τ are self-adjoint, so is S + T. Furthermore, ((S + 7>, υ) = (St/, υ) + (Τυ, υ) >0. Thus S + Τ is a positive operator, as desired. 18. Prove that if Γ € £{V) is positive, then so is Tk for every positive integer k.
84 Chapter 7. Operators on Inner-Product Spaces Solution: Suppose Τ € C(V) is positive and к is a positive integer. Then Tk is self-adjoint (because Τ is self-adjoint). First consider the case where к is an even integer. Then we can write к = 2m for some positive integer m. Now (Tkv,v) = (T2mv,v) = (Tmv,Tmv) >0 for every υ e V, where the second equality holds because Τ is self-adjoint. The inequality above shows that Tk is positive, as desired. Now consider the case where к is an odd integer. Then we can write к = 2m + 1 for some nonnegative integer m. Now (Tkv,v) = (T2m+lv,v) = (T(Tmv),Tmv) >0 for every ν € V, where the second equality holds because Τ is self-adjoint and the inequality holds because Τ is positive. The inequality above shows that Tk is positive, as desired. 19. Suppose that Γ is a positive operator on V. Prove that Τ is invertible if and only if (Tv, v)>0 for every иб^\ {О}. Solution: First suppose that Τ is invertible. By 7.27, there exists an operator S € C(V) such that Τ = S*S. Suppose ν € V \ {0}. Then Sv φ 0 because otherwise we would have Tv = S*Sv = 0, which would contradict the invertibility of T. Now (Tv, v) = (S'Sv, v) = (Sv, Sv) >0, as desired. Now suppose that (Tv, v) > 0 for every ν € V \ {0}. In particular, this means that Tv φ 0 for every ν € V \ {0}. Thus Τ is injective, and hence Τ is invertible (see 3.21), as desired.
85 Chapter 7. Operators on Inner-Product Spaces 20. Prove or disprove: the identity operator on F2 has infinitely many self- adjoint square roots. SOLUTION: For each t € [—1,1], the operator whose matrix (with respect to the standard basis) equals " t ν/Γ^ί5 ■ ч/ПЙ? -t is self-adjoint and a square root of the identity operator, as can be verified by squaring the matrix above. Thus the identity operator has infinitely many self-adjoint square roots. 21. Prove or give a counterexample: if S € C(V) and there exists an orthonormal basis (ei,..., en) of V such that ||5е;|| = 1 for each ej, then S is an isometry. Solution: Define S e £(F2) by S(w,z) = (to+ 2,0). With the usual inner product on F2, the standard basis ((1,0), (0,1)) is an orthonormal basis of F2. Note that ||5(1,0)|| = ||5(0,1)|| = 1. However, S is not an isometry because ||5(1, —1)|| = 0. Of course there are also many other examples. 22. Prove that if S € £(R3) is an isometry, then there exists a nonzero vector χ e R3 such that S2! = x. Solution: Suppose S € £(R3) is an isometry. Then there is a basis of R3 with respect to which S has a block diagonal matrix, where each block on the diagonal is a 1-by-l matrix containing 1 or —1 or is a 2-by-2 matrix (see 7.38). Because R3 has odd dimension, at least one of these blocks must be a 1-by-l matrix. In other words, either 1 or —1 must be an eigenvalue of S. Thus there is a nonzero vector χ € R3 such that Sx = Xx, where λ = ±1. Hence S2x = S(Sx) = S(Xx) = XSx = \2x = x. 23. Define Τ € £(F3) by T{zi,z2,z3) = (23,2.21,3.22). Find (explicitly) an isometry S € £(F3) such that Τ = Sy/T'T.
86 Chapter 7. Operators on Inner-Product Spaces 24. 0 2 0 0 0 1 0 0 3 2 0 0 1 0 0 _ 0 3 0 , which e< ' 4 0 0 0 9 0 0 0 1 Solution: With respect to the standard basis of F3, we have M(T) = Thus M{T) = Computing the product M(T*)M(T), which equals M{T*T), we get M(TT) = Prom the matrix above, we see that {T*T){z\, z2l zz) = (4.21,9.22,23). Thus Vf^T(zi1z2, 23) = (2.2i, З22,23). Hence if we define S € £(F3) by 5(21,22,23) = (23.21.22), then S is an isometry and Τ = SVT*T. Suppose Τ € C(V), S € C(V) is an isometry, and R G C(V) is a positive operator such that Τ = SR. Prove that R = \/T*T. Comment: This exercise shows that if we write Τ as the product of an isometry and a positive operator (as in the polar decomposition), then the positive operator must equal y/T*T. Solution: Taking adjoints of both sides of the equation Τ = SR, we have T* = R*S* = RS\ where the last equation holds because R is positive (and hence self-adjoint). Multiplying together our formulas for T* and T, we get T*T = RS'SR = R\
87 Chapter 7. Operators on Inner-Product Spaces where the last equation holds because S is an isometry (and hence S*S = I by 7.36). The equation above asserts that R is a square root of T*T; because R is positive, this implies that R = VT*T. 25. Suppose Τ G C{V). Prove that Τ is invertible if and only if there exists a unique isometry S G C(V) such that Τ = SVT'T. Solution: First suppose that Τ is invertible. The polar decomposition (7.41) states that there exists an isometry S G C(V) such that Τ = SVT*T. Because Τ is invertible, this implies that y/T*T is invertible (see Exercise 22 in Chapter 3). Thus the equation above implies that S = T(y/T*T) . Because S must be given by this formula, we see that there is a unique operator S G C(V) such that Τ = Sy/T'T, as desired. Now suppose that there exists a unique isometry S € C(V) such that T = SVT*T. This means that the linear map 5г in the proof of the polar decomposition (7.41) must be 0 because otherwise we could replace 5г with —S2 and get another choice for S. But range S2 equals (range T)x, and hence (range T)x = {0}. This implies that range Г = V, which implies that Τ is invertible (by 3.21), as desired. 26. Prove that if Τ G C(V) is self-adjoint, then the singular values of Τ equal the absolute values of the eigenvalues of Τ (repeated appropriately). Solution: Suppose Τ G C{V) is self-adjoint. There exists an orthonor- mal basis (ei,..., e„) of V consisting of eigenvectors of T. Thus Tej = Xjej for each j, where Ai,..., λη 6 R are the eigenvalues of T. Thus TTej = T2ej = (Α,)4· for each j. The equation above implies that y/T*Tej = |Aj|e_j for each j. Thus the singular values of Τ are |Ai|,..., |λη|, as desired. 27. Prove or give a counterexample: if Τ G C(V), then the singular values of T2 equal the squares of the singular values of T. Solution: DeEne Τ e £(F2) by
88 Chapter 7. Operators on Inner-Product Spaces T(zuz2) = (z2,0). Then T*T{zi,z2) = (0,z2) and hence VT*T(z1,z2) = {0,z2). Thus the eigenvalues of \/T*T are 0,1. Hence the singular values of Τ are 0,1. However, T2 = 0, so the singular values of T2 are 0,0. Thus for this operator T, the singular values of T2 do not equal the squares of the singular values of T. Of course there are also many other examples. 28. Suppose Τ € C(V). Prove that Τ is invertible if and only if 0 is not a singular value of T. Solution: If S e C{V) and ST=TS = /, then taking adjoints we get T*S* = S'T* = I. Thus if Γ is invertible, then so is T\ Now Τ is invertible <=> Τ and T* are invertible <=> T*T is invertible <=> y/T*TVT*T is invertible <=> VT'T is invertible <=> 0 is not an eigenvalue of VT*T <=> 0 is not a singular value of T, where the second and fourth equivalences follow from Exercise 22 in Chapter 2. 29. Suppose Τ € C(V). Prove that dim range Γ equals the number of nonzero singular values of T. Solution: By the singular value decomposition (7.46), there exist or- thonormal bases (u\,..., Un) and (w\,..., wn) of V such that Tv = si (υ, ui)wi -I 1- sn(i>, un)wn for every υ € V, where s\,... ,sn are the singular values of T. For each j, we have Tu3· = SjWj. Thus each wj corresponding to a nonzero Sj is in rangeT. The equation above also shows that the toy's corresponding to nonzero Sj's span range T. Thus dim range Τ equals the number of nonzero singular values of T. 30. Suppose S Ε C(V). Prove that S is an isometry if and only if all the singular values of S equal 1.
89 Chapter 7. Operators on Inner-Product Spaces Solution: We have S is an isometry <=> S*S = I <=> all the eigenvalues of VS*S equal 1 <=> all the singular values of S equal 1, where the first equivalence comes from 7.36 and the third equivalence comes from the spectral theorem (7.9 or 7.13) applied to the self-adjoint operator 31. Suppose Γι, T2 € C(V). Prove that Γι and T2 have the same singular values if and only if there exist isometries Si,S2 € C(V) such that Γι = SiT2S2. Solution: First suppose that Γι and Γ2 have the same singular values s\,..., sn. By the singular-value decomposition (7.46), there exist orthonor- mal bases (eb ... ,e„), (/1,... ,/n), {е\,... ,e'n), (/{,..- ,f'n) of V such that Txv = si(v,ei)fi + --- + sn(v,en)fn, T2v = si(v, e\)f[ + ■■■ + sn(v,e'n)f'n for every υ 6 V. Define Si,S2€ C(V) by Si (αι/ί + · · · + anf'n) = 01/1+··- + anfn, S2(aiei +■■■ + a„e„) = сце\ + ■■■ + One'n. Then ||51(α1/ί + ··· + αη/;)||2 = ||α1/1+..-+αη/η||2 = |&1|2 + ..· + |αη|2 = ||α1/ί+.-·+αη/;||2, and thus Si is an isometry. Similarly, S2 is an isometry. This implies that 52* = S2-1 (see 7.36). In particular, Sfe'j = ej. Now for υ € V we have T2(S2v) = si^t/.e'jj/i + · · · + sn(S2v,e'n)f'n = si(v, S2*e\)f[ +■■■ + sn(v, S2*e'n)f'n = si(v,ei)f[ + ■■■ +sn(v,en)f'n.
90 Chapter 7. Operators on Inner-Product Spaces Thus Si{T2S2v) = si(v,ei)Sif[ +·■· + SnfaejSifn = si(v,ei)fi +--- + sn(v,en)fn = Τιυ for every υ eV. Hence S1T2S2 = ΤΊ, as desired. To prove the implication in the other direction, now suppose that there exist isometries 5ь5г € С-(У) such that ΤΊ = S1T2S2· Using 7.36, we have 1\ 1\ — 02 -12 &1 011202 = 1S2 72*72ιί?2· This implies that T\*Ti and ТУТг have the same eigenvalues (and that the corresponding spaces of eigenvectors have the same dimensions). Thus T\ and T2 have the same singular values. 32. Suppose Τ € £(^) has singular-value decomposition given by Τυ = 5ι(υ,βι)/ι + ··· + sn(v1en)fn for every ν € V, where s\,... ,sn are the singular values of Τ and (ei,..., en) and (/1,..., /„) are orthonormal bases of V. (a) Prove that T'v = si (υ, /i)ei + · · · + sn(v, fn)en for every υ € V. (b) Prove that if Τ is invertible, then T-i„ <",/i)ei <«,/η)βη 1 υ = + · · · Η Sl Sn for every υ € V. Solution: (a): Fix υ e V. Then (to, T*v) = (Tw, υ) = (si (w, ei)/i + · · · + sn(w, e„)/n, v) = si(w,ei)(fi,v) + --- + sn(w,en)(fn,v) = (w,si(v,fi)ei +--- + sn(v,fn)en)
91 Chapter 7. Operators on Inner-Product Spaces for all w G V. This implies that T*v = si (υ, /i)ei + --- + sn(v, /n)e„, as desired. (b): Suppose Τ is invertible. Let υ €V and let ("■/i)ei , , (v,fn)en w — 1 Λ ; si sn none of the singular values si,...,sn equals 0 (see Exercise 28 of this chapter), so this makes sense. Now T (v,fi)Tei , , («,/η)Γβ» lw = 1 -i Si 8n (υ, fi)sifi , , (v,fn)snfn — 1- ·· · Η Si Sn = <«,/l>/l +··· + <«,/n>/n = V. Thus ш = Γ-1υ, as desired. 33. Suppose Τ G C(V). Let s denote the smallest singular value of T, and let s denote the largest singular value of T. Prove that sibll < \\Tv\\ < s\\v\\ for every υ €V. Solution: Let ν G V. By the singular value decomposition (7.46), there exist orthonormal bases (ui,..., Un\ and {w\,..., wn) of V such that Tv = si(v, ui)wi -1 1- sn(v,un)wn, where si,..., sn are the singular values of T. Because (ui,..., un) and (toi,..., wn) are both orthonormal bases of V, we have ё2\М2 = з2(\(ь,щ)\2 + ... + \(ь,ип)\2) <si2\(v,u1)\2 + -- + sn2\(v,un)\2 = \\Tvf,
92 Chapter 7. Operators on Inner-Product Spaces giving the first desired inequality. Also, \\Tv\\2 = s12\(v,u1)\2 + --- + sn2\(v,un)\2 <*2(|<υ,"ι>|2 + ··· + Κ^η>|2) = s2\\v\\2, giving the second desired inequality. 34. Suppose T", T" 6 C(V). Let s' denote the largest singular value of T", let s" denote the largest singular value of T", and let s denote the largest singular value of T" + T". Prove that s < s' 4- s". Solution: Let Τ = Τ" + Τ". Because s is a singular value of T, we know that s is an eigenvalue of VT*T. Thus there exists a vector υ Ε V such that ||υ|| = 1 and y/T*Tv = sv. Now s = \\sv\\ = Hv^ft/Ц = \\Tv\\ = \\rv + r'v\\ < \\rv\\ + \\Г'ь\\ <s'\\v\\+s"\H = s' + s", where the third line above comes from 7.42 and the sixth line above comes from the previous exercise.
Chapter 8 Operators on Complex Vector Spaces 1. Define Τ € £(C2) by T(w,z) = (z,0). Find all generalized eigenvectors of T. Solution: Suppose λ is an eigenvalue of T. For this particular operator, the eigenvalue-eigenvector equation T{w, z) = A(tu, z) becomes the system of equations ζ = Xw 0 = Xz. If λ φ 0, then the second equation implies that ζ = 0, and the first equation then implies that w = 0. Because an eigenvalue must have a nonzero eigenvector, this shows that 0 is the only possible eigenvalue of T. For λ = 0, the equations above show that ζ must equal 0, but w can be arbitrary. Thus 0 is indeed an eigenvalue of T, and the set of eigenvectors corresponding to this eigenvalue is {(w)0):weF}. Note that T2 = 0. Thus every vector in C2 is a generalized eigenvector of Τ (corresponding to the eigenvalue 0). 93
94 Chapter 8. Operators on Complex Vector Spaces 2. Define Τ G £(C2) by T(w,z) = (-z,w). Find all generalized eigenvectors of T. Solution: On page 78 of the textbook we saw that the eigenvalues of Τ are i and -i. Note that T2 = -I. The set of generalized eigenvectors of Τ corresponding to the eigenvalue г equals ηιι11(!Γ — il)2 (by 8.7). To compute this, note that (T - il)2 = T2-2iT-I = -21 - 2iT = -2i(T - il). Thus пи11(!Г — il)2 equals the set of eigenvectors of Τ corresponding to the eigenvalue i. On page 78 of the textbook we noted that this equals {(a,-ia) :a€ C}. The set of generalized eigenvectors of Τ corresponding to the eigenvalue —i equals n\i\\(T + H)2 (by 8.7). To compute this, note that (T + il)2 = T2 + 2iT - I = -21 + 2iT = 2i{T + iI). Thus пи11(!Г + il)2 equals the set of eigenvectors of Τ corresponding to the eigenvalue —i. On page 78 of the textbook we noted that this equals {(a,ia) : a G CK 3. Suppose Τ G C{V), m is apositive integer, and υ G V is such that Tm~1v φ Ο but T^-v = 0. Prove that (υ,Γυ,Γν..,"!™-1!/) is linearly independent. Solution: Suppose ao, a.\, аг,..., am_i G F are such that oqv + aiTv + a2T2v + ■ ■ · + am-1Tn-lv = 0. Because Τ^υ = 0, if we apply Τ™-1 to both sides of the equation above, we get oqI^-^v = 0. Because Γ171-^ φ 0, this implies that Go = 0. Thus the equation above can be rewritten as
95 Chapter 8. Operators on Complex Vector Spaces aiTv + a2T2v + ■■■+ am-iT"1-1!/ = 0. Applying Τ™-2 to both sides of this equation, we get αιΤ™-^ = 0. Thus a\ = 0. Continuing in this fashion, we have clq = αϊ = a2 = · · · = Om-i = 0, which means that (υ, Tv, T2v,..., Τ"1- ιυ) is linearly independent. 4. Suppose Τ € £(C3) is denned by T(zi, z2, zz) = (z2, z3,0). Prove that Τ has no square root. More precisely, prove that there does not exist S € £(C3) such that S2 = T. Solution: Note that Γ3 = 0. Suppose there exists S € £(C3) such that S2 = T. Then S6 = Γ3 = 0, so S is nilpotent. By 8.8, this implies that S3 = 0. Thus T2 = S4 = SS3 = 0. But Tl2(,zi,Z2,Z3) = (23,0,0), so T2 is not the 0 operator, contradicting the equation above. This contradiction shows that our supposition that there exists S € £(C3) such that S2 = Τ must have been false. 5. Suppose S,T € £(V). Prove that if ST is nilpotent, then TS is nilpotent. Solution: Suppose ST is nilpotent. Thus there exists a positive integer η such that (ST)n = 0. Now (TS)n+l = (TS)(TS)... (TS) = T(ST)(ST)...(ST)S = T(ST)nS = CO(o)(S) = 0, and thus TS is nilpotent. 6. Suppose ./V G £(V) is nilpotent. Prove (without using 8.26) that 0 is the only eigenvalue of N. Solution: There is a positive integer m such that Nm = 0. This implies that ./V is not injective, so 0 is an eigenvalue of N. Conversely, suppose A is an eigenvalue of N. Then there exists a nonzero vector υ G V such that
96 Chapter 8. Operators on Complex Vector Spaces λυ = Nv. Repeatedly applying N to both sides of this equation shows that Xmv = Ν™υ = 0. Thus λ = 0, as desired. 7. Suppose V is an inner-product space. Prove that if TV € C(V) is self-adjoint and nilpotent, then N = 0. Solution: Suppose ./V g £(V) is self-adjoint and nilpotent. Because ./V is self-adjoint, there is an orthonormal basis (ei,... , en) of V consisting of eigenvectors of ./V (by the spectral theorem). Because N is nilpotent, 0 is the only eigenvalue of ./V (see Exercise б of this chapter). Thus the eigenvalue corresponding to each ey must equal 0. In other words, Ne3- = 0 for each j. Because (ei,..., en) is a basis of V, this implies that N = 0. 8. Suppose N G C(V) is such that nullWdimV/-1 φ nullNdimV. Prove that N is nilpotent and that dim null N3 = j for every integer j with 0 < j < dim V. Solution: Because null Ndim v~l φ null Ndim v, we know (by 8.5) that null Ν3~λ φ null Nj whenever 0 < j < dim V. Thus {0} = nullJV°c шШЛ^С ...С шШЛГВт1'-1с null7VdimV/. 1 J + + + + At each of the strict inclusions in the chain above, the dimension must increase by at least 1. However, if the dimension increases by more than 1 at any step, we would end up with dim null Ndim v > dimV, a contradiction because a subspace of V cannot have dimension larger than dim V. Thus the dimension increases by exactly one at each step. In other words, dim null N] = j for every integer j with 0 < j < dimV". In particular, taking j = dimV, we have dim null 7Vdim v = dimK. This means that nullNdimV = у Thus jy-dimV = 0) ша SQ N is nilpotent. 9. Suppose Τ G C(V) and m is a nonnegative integer such that range Τ"1 = range T"*l. Prove that range Tk = range T™ for all к > τη.
97 Chapter 8. Operators on Complex Vector Spaces Solution: Suppose и G rangeTm+l. Thus there exists a vector υ in V (the domain of T) such that и = T"1*1^ Now Ί^υ is in range Ί™, which by our hypothesis equals range Tm+1. Thus there exists w G V such that Ί^υ = Гт+1ш. Putting all this together, we have u = г"И-1,, = Г(Т,т+1го) = t^+V Thus и G rangeT"171"*"2. Because и was an arbitrary vector in rangeT'm+1) we have shown that range Т4™"1 С rangeT"n+2. We also have an easy inclusion in the other direction, so we conclude that range Tm+l = range T"171"*"2. In the paragraph above, we showed that rangeT™ = rangeT,m+1 implies rangeT"171"*"1 = range Τ™*2. Apply that result, with m replaced with m + 1, to conclude that range T"1*2 = range J™*3. Continuing in this fashion, we see that range Τ"1 = range Τ""-*-1 = range Τ"η+2 as desired. 10. Prove or give a counterexample: if Τ G £(У), then V = null Τ Θ range Т. Solution: Define Τ G £(F2) by T(w,z) = (z,0). Thus null Τ = rangeT = {(to,0) : to G F}, which clearly implies that F2 is not the direct sum of null Τ and range T. Of course there are also many other examples. 11. Prove that if Τ G C(V), then Vr = nullT,r*©rangeTn, where η = dim V. Solution: Let Τ G £(V). First we show that V = null 7™ + range T".
98 Chapter 8. Operators on Complex Vector Spaces To do this, let υ € V. Then v = (v-Tnu)+Tnu for any vector и € V. Obviously Г"и € range Τ". Thus looking at the equation above, we see that we need to show that there exists и € V such that υ — T^u € null Τ". In other words, we want a vector и € V such that T^iv - T^u) = 0, which is equivalent to Τ"υ = T2nu. But T^v € range?"1, and ranger" = rangeT2" (by 8.9), so Τ"υ € rangeT2". Thus there indeed exists и € V such that T^v = T2nu, completing our proof υ € null T" + range T™. Because υ was an arbitrary vector in V, this implies that V = nullT" + range Γ". For any linear map (and in particular for I"1), the dimension of the domain equals the sum of the dimensions of the null space and range (by 3.4). In other words, dim V = dim nullT" + dim range T". This equation, along with the equation V = null IT1 + range T™, implies that V = nullT" Φ range Τ" (by 2.19). 12. Suppose V is a complex vector space, N € C(V), and 0 is the only eigenvalue of N. Prove that N is nilpotent. Give an example to show that this is not necessarily true on a real vector space. Solution: Because 0 is the only eigenvalue of N, 8.23(a) implies that every vector in V is a generalized eigenvector of Τ corresponding to the eigenvalue 0. This implies that N is nilpotent. Define Τ € £(R3) by T(x,y,z) = (-j/,x,0). Then 0 is an eigenvalue of Τ because Γ(0,0,1) = (0,0,0). As can be verified from the definition of eigenvalue, Τ has no other eigenvalues (which must be in R, because Γ is an operator on a real vector space). However, T^x, y, z) = (y, —x, 0). In particular, Τ3 φ 0. Thus Τ is not nilpotent. Of course there are also many other examples. 13. Suppose that V is a complex vector space with dim V = η and Τ € C(V) is such that nullTn-2^nullTn-1. Prove that Τ has at most two distinct eigenvalues.
99 Chapter 8. Operators on Complex Vector Spaces Solution: Because nullT"-2 Φ nullT"-1, we see that dimnullTJ' is at least 1 more than dim null !P'"l for j = I,...,n - 1 (by 8.5). Thus dim null T"-1 > n- 1. In particular, 0 is an eigenvalue of Γ with multiplicity at least η - I. Because the sum of the multiplicities of all the eigenvalues of Τ equals η (by 8.18), this implies that Τ can have at most one additional eigenvalue. 14. Give an example of an operator on C4 whose characteristic polynomial equals (z - 7)2(z - 8)2. Solution: Define Τ G £(C4) by T(zu Z2,z3,Zi) = (7ζι,7ζ2,8ζ3,8ζ4). Then ηιι11(!Γ - 71) is the two-dimensional subspace {(21,22,0,0): 21,22 GC} and пи11(!Г - 81) is the two-dimensional subspace {(0,0,23,24): 23,24 GC}. Thus 7 is an eigenvalue of Τ with multiplicity at least 2 and 8 is an eigenvalue of Γ with multiplicity at least 2. Because 2+2 = 4 = dim C4, there can be no other eigenvalues of Τ and the eigenvalues 7 and 8 must have multiplicity 2 (by 8.18). Thus the characteristic polynomial of Γ equals (2 - 7)2(2 — 8)2. Of course there are also many other examples. 15. Suppose V is a complex vector space. Suppose Τ G C(V) is such that 5 and 6 are eigenvalues of Τ and that Τ has no other eigenvalues. Prove that (T-5I)n-l(T-6I)n-1 =0, where η = dim V. Solution: Because 5 and 6 are eigenvalues of Τ and Τ has no other eigenvalues, the characteristic polynomial of Τ must be of the form (z-5)d>(z-6)d\ where 1 < di and 1 < d^. Because di + d2 = n, we must also have di < n- 1 and d2 < η — 1. The Cayley-Hamilton theorem implies that (T-5I)dl(T-6l)d2 =0.
100 Chapter 8. Operators on Complex Vector Spaces Because di < η - 1 and ά-ι < η - 1, we can multiply the equation above by appropriate powers of Τ -51 and T- 61 to get (Γ-5/)η_1(Γ-6/)η_1 = 0. 16. Suppose V is a complex vector space and Τ € C{V). Prove that V has a basis consisting of eigenvectors of Τ if and only if every generalized eigenvector of Τ is an eigenvector of T. Comment: For complex vector spaces, this exercise adds another equivalence to the list given by 5.21. Solution: First suppose that V has a basis consisting of eigenvectors of T. Thus there exists a basis («i,..., vn) of V and Ai,..., An € С such that Tvj = XjVj for each j. Suppose υ € V is a generalized eigenvector of Τ corresponding to an eigenvalue A. Because («i,..., υη) is a basis of V, there exist αϊ,..., an € С such that ν = αινι Η 1- anvn. Thus (Γ - λ/)υ = (Αι - λ)αιυι + ■ ■ ■ + (λη - λ)αηυη. Applying Τ — XI repeatedly to both sides of this equation, we get (Γ - λ/)ηυ = (λ! - Χ)ηαιυι + .. - + (λ„ - A)na„t;n. The left side of the equation above equals 0 (because υ is a generalized eigenvector of Τ corresponding to the eigenvalue A). Thus the right side of the equation above equals 0. Thus (Ay — A)nay = 0 for each j. This implies that the indices j such that a.j φ 0 must all have all satisfy Ay = A, which means that υ is a linear combination of the Vj's that correspond to eigenvalue A, which means that υ is itself an eigenvector corresponding to eigenvalue A, as desired. To prove the other direction, now suppose that every generalized eigenvector of Γ is an eigenvector of T. By 8.25, there exists a basis of V consisting of generalized eigenvectors of T. Because every generalized eigenvector of Τ is an eigenvector of T, this gives a basis of V consisting of eigenvectors of T, as desired. 17. Suppose V is an inner-product space and N € C(V) is nilpotent. Prove that there exists an orthonormal basis of V with respect to which N has an upper-triangular matrix.
101 Chapter 8. Operators on Complex Vector Spaces Solution: By 8.26, there is a basis of V with respect to which N has an upper-triangular matrix. By 6.27, this basis can be chosen to be orthonormal. (If V is a complex inner-product space, then we can just use 6.28 directly, so the hypothesis that N is nilpotent is not needed.) 18. Define N € £(F5) by N(xi,x2,X3,X4,Xb) = (2х2,Зхз,-Х4,4х5,0). Find a square root oi I + N. Solution: Note that N2(xi,X2,X3,X4,Xb) = (6X3,-3X4,-4X5,0,0) iV3(xi,X2,X3,X4,X5) = (-6X4,-12X5,0,0,0) iV4(xi, X2, X3, хл, хь) = (-24x5,0,0,0,0) Ν5(χ1,χ2,Χ3,χ4,Χ5) = (0,0,0,0,0). Because TV5 = 0, the proof of 8.30 shows that is a square root oi I + N. Using the formulas above, we calculate that the operator S € £(F5) defined by 5(X1,X2,X3,X4,X5) = / Зхз 3X4 15X5 Зхз 3X4 ЗХ5 (x1+x2- —- —+—, x2 + —+ —-—, Хз _ ~2~ + ~2~' Xa + 2X5' Xb) is a square root of / + N. 19. Prove that if V is a complex vector space, then every invertible operator on V has a cube root. Solution: First suppose that ./V € £(V) is nilpotent. We will show that I + N has a cube root by imitating the proof of 8.30. Specifically, we guess that there is a cube root of / + N of the form / + aiN + a2N2 + ■··+ (hn-iN
102 Chapter 8. Operators on Complex Vector Spaces where m is such that Nm = 0. Having made this guess, we can try to choose αϊ, α2,..., Om-ι so that the operator above has its cube equal to I+N. Now (I + alN + a2N2 + a3N3 + ---+am-iNm-1)3 = I + 3<nN + (За2 + 3ai 2)W2 + (3a3 + 6ai a2 + ai 3)W3 + · · · + (За™-! + terms involving ai,... ,Οπ,^ίΓ1"1. We want the right side of the equation above to equal I + N. Hence choose αϊ so that 3αι = 1 (thus ai = 1/3). Next, choose a2 so that За2 + 3ai2 = 0 (thus a2 = —1/9). Then choose аз so that the coefficient of N3 on the right side of the equation above equals 0 (thus аз = 5/81). Continue in this fashion for j = 4,..., m— 1, at each step solving for ay so that the coefficient of TV·7 on the right side of the equation above equals 0. Actually we don't care about the explicit formula for the ay's. We need only know that some choice of the ay's gives a cube root of / + N. Having shown that the identity plus a nilpotent always has a cube root, we now look at the proof that every invertible operator on a complex vector space has a square root (see 8.32). In that proof, if we replace the words "square root" with "cube root", we get a proof that every invertible operator on a complex vector space has a cube root. 20. Suppose Τ € C(V) is invertible. Prove that there exists a polynomial ρ € V(F) such that T~l = p(T). Solution: Let ao + a\z + · ■ · + Om-iz"1-1 + zm denote the minimal polynomial of T. This is the monic polynomial of smallest degree such that aQI + aiT + --+am-iTm-l+Tm = 0. If clq were equal to 0, then we could multiply both sides of the equation above by T~l to get a1I + a2T + --+am-iTm-2 + Tm-1 =0, which would give a monic polynomial q of smaller degree such that q(T) = 0, contradicting the definition of the minimal polynomial. Thus clq φ 0. Because αο φ 0, we can solve the first equation above for the identity operator /, getting r _ _aJ.<Ti _ . . . _ a"1~1 rpm-l Lt™ ao ao ao
103 Chapter 8. Operators on Complex Vector Spaces Now multiply both sides of the equation above by Γ ι, getting -1 _ _alr_a2T атп-1ггт-2 1 Setting rp-l _ _ZLj _ ^_ψ _ ... _ am-lrpm-2 _rpm-l CLQ CtQ Oq CLQ we thus have T~l = p(T). 21. Give an example of an operator on C3 whose minimal polynomial equals z2. Solution: DeHne Τ € £(C3) by T(wuW2yW3) = (ш3)0,0). Clearly T2 = 0. In other words, the polynomial z2 when applied to Τ gives 0. Thus the minimal polynomial of Γ is a divisor of z2 (by 8.34). But the only monic polynomials that divide z2 are 1, z, and z2. The polynomial 1 applied to Τ gives the identity operator, which is not 0, and the polynomial ζ applied to Τ gives T, which is also not 0. Thus the minimal polynomial of Τ must hez2. Of course there are also many other examples. 22. Give an example of an operator on C4 whose minimal polynomial equals Φ-i)2· SOLUTION: Define Τ € £(C4) by T(wi, io2, юз, wA) = (0, w-i + to4, to3, wA). Then (T - I)(wi,W2,W3iW4) = (-tub 104,0,0), which implies that (Τ - Ι)2{υ>ι.,υ)2,υ)3,ιυΑ) = {wu 0,0,0), which implies that T{T - I)2 = 0.
104 Chapter 8. Operators on Complex Vector Spaces In other words, the polynomial z(z - l)2 when applied to Τ gives 0. Thus the minimal polynomial of Γ is a divisor of z(z - l)2 (by 8.34). Note that 0 is an eigenvalue of Γ because T(l, 0,0,0) = (0,0,0,0) and 1 is an eigenvalue of Τ because T(0,1,0,0) = (0,1,0,0). Thus 0 and 1 must both be roots of the minimal polynomial of Τ (by 8.36). The only monic polynomials that divide z(z — l)2 and have 0,1 as roots are z{z - 1) and z(z — l)2. Because T(T — Ι) φ 0, as is easy to check, this implies that z{z — l)2 is the minimal polynomial of T. Of course -there are also many other examples. 23. Suppose V is a complex vector space and Τ € C[V). Prove that V has a basis consisting of eigenvectors of Τ if and only if the minimal polynomial of Τ has no repeated roots. Comment: For complex vector spaces, this exercise adds another equivalence to the list given by 5.21. Solution: First suppose that there is a basis («i,..., vn) of V consisting of eigenvectors of T. Let Ai,..., Am be the distinct eigenvalues of T. Then for each vj, there exists At with (T - \kl)vj = 0. Thus {T-\iI)...{T-^1)4 = 0 for each j (because all the operators in sight commute, for each j the appropriate Τ — \kl can be moved to the last position in the product above). An operator that sends each vector in a basis to the 0 vector is the 0 operator, so (T-\lI)...(T-\mI) = 0. Thus the polynomial (z — Ai)... (z - Am) when applied to Τ gives 0. Thus the minimal polynomial of Γ is a divisor of {z — \\)... (z — Am) (by 8.34). Because (z — Ai)... (z - Am) has no repeated roots, this implies that the minimal polynomial of Τ has no repeated roots, as desired. (Because each eigenvalue of Τ must be a root of the minimal polynomial of T, the minimal polynomial of Τ actually equals (z - Ai)... (z - Am).) To prove the implication in the other direction, now suppose that the minimal polynomial of Τ has no repeated roots. Letting Ai,..., Am denote the distinct eigenvalues of T, this means that the minimal polynomial of Τ equals (ζ- Χι)...(ζ- Am).
105 Chapter 8. Operators on Complex Vector Spaces Thus (T-XlI)...(T-XmI) = 0. Let Um be the subspace of generalized eigenvectors corresponding to the eigenvalue Am. Recall that Um is invariant under Τ (see 8.23(b)). Suppose υ € Um- Let и = (Τ - Xml)v, so и € Um. The equation above implies that (T\Um - λι J)... {T\Um - Xm-JJu ={T- λι/)... (Г - XmI)v = 0. Because (T — XmI)\(jm is nilpotent (see see 8.23(c)), 0 is the only eigenvalue of (Г - XmI)\Um (this follows from 8.26 and 5.18). Thus T\Um - Ay/ is invertible (as an operator on Um) for j = 1,..., m — 1. The equation above thus implies that и = 0. In other words, υ is an eigenvector of T. We have shown that every generalized eigenvector of Τ corresponding to the eigenvalue Am is an eigenvector of T. There is nothing special about the eigenvalue Am—we could have relabeled the eigenvalues so that any of them was called Am. Thus we can conclude that every generalized eigenvector of Τ is actually an eigenvector of T. Because there is a basis of V consisting of generalized eigenvectors of Τ (see 8.25), this means that there is a basis of V consisting of eigenvectors of T, as desired. 24. Suppose V is an inner-product space. Prove that if Γ € £>{V) is normal, then the minimal polynomial of Τ has no repeated roots. Solution: Suppose Τ € £(V) is normal. Let ρ denote the minimal polynomial of T. Suppose A € F is an eigenvalue of T. We can write ρ(ζ) = (ζ-ΧΓς(ζ) where m is a positive integer and q is a mojaic polynomial such that д(А) φ 0. Our goal is to prove that m = 1, which implies that ρ has no repeated roots. Because p[T) = 0, we know that (T - XI)mq(T) = 0. This is equivalent to the statement that range q(T) С null(T - A/)m. Because Τ is normal, so is Τ - XI, and thus null(T - A/)m = null(T - XI) (by Exercise 7 in Chapter 7). Hence the set inclusion above becomes range9(T)CnuU(T-A/),
106 Chapter 8. Operators on Complex Vector Spaces which implies that (T—\I)q(T) = 0. Letting pi(z) = (z—\)q(z), this means that pi-is a monic polynomial with the property that Pi(T) =0. If m > 1, then the degree of pi would be less than the degree of p, contradicting the definition of minimal polynomial. Thus we can conclude that m = 1, as desired. Comment: The proof given above works on both real and complex vector spaces. If V is a complex vector space, then this exercise can be done by using the complex spectral theorem (7.9) and Exercise 23 of this chapter. 25. Suppose Τ € C(V) and ν € V. Let ρ be the monic polynomial of smallest degree such that p(T)v = 0. Prove that ρ divides the minimal polynomial of T. Solution: Let q denote the minimal polynomial of T. By the division algorithm (4.5), there exist polynomials s,r € V(F) such that q = sp + r and deg г < deg p. Thus q(T)v = s(T)p(T)v + r(7>. Because q(T) = 0 and p(T)v = 0, the equation above shows that r(T)v = 0. This implies that г = 0 (otherwise we could multiply г by a scalar to get a monic polynomial with degree smaller than deg ρ that when applied to Τ gives an operator having υ in its null space, which would contract the definition of ρ as the monic polynomial of smallest degree with this property). Using the information that г = 0, rewrite the formula above for q as q = sp. Thus ρ divides q, the minimal polynomial of T. 26. Give an example of an operator on C4 whose characteristic and minimal polynomials both equal z(z — l)2(z — 3). Solution: Define Τ € £(C4) by T(wi, W2,W3,Wi) = (Q,W2 + W3,W3,3W4).
107 Chapter 8. Operators on Complex Vector Spaces An easy computation shows that T(T — I)2{T — 31) = 0. Thus the minimal polynomial of Γ is a divisor of z(z — l)2(z — 3) (by 8.34). Note that 0 is an eigenvalue of Τ because T(l,0,0,0) = (0,0,0,0) and 1 is an eigenvalue of Τ because T(0,1,0,0) = (0,1,0,0) and 3 is an eigenvalue of Τ because T(0,0,0,1) = (0,0,0,3). Thus 0, 1, and 3 must be roots of the minimal polynomial of Τ (by 8.36). The only monic polynomials that divide z(z — l)2(z—3) and have 0,1,3 as roots are z(z-l)(z-3) and z(z-l)2(z-Z). BecauseT(T- 1)(Τ-3) φ 0, as is easy to check, this implies that z(z — l)2(z — 3) is the minimal polynomial of T, as desired. Because Τ is an operator on a four-dimensional complex vector space and the minimal polynomial of Τ has degree 4, the characteristic polynomial of Τ (which is a monic polynomial of degree 4 that is divisible by the minimal polynomial of T) must equal the minimal polynomial of T. Of course there are also many other examples. 27. Give an example of an operator on C4 whose characteristic polynomial equals z(z — l)2(z - 3) and whose minimal polynomial equals z(z - l)(z - 3). Solution: Define Τ € £(C4) by T{wi,W2,WZ,Wi) = (0,tu2,tu3,3tu4). An easy computation shows that T(T—I)(T—31) = 0. Thus the minimal polynomial of Τ is a divisor of z{z — l)(z — 3) (by 8.34). Note that 0 is an eigenvalue of Τ because T(l,0,0,0) = (0,0,0,0) and 1 is an eigenvalue of Τ because Γ(0,1,0,0) = (0,1,0,0) and 3 is an eigenvalue of Г because T(0,0,0,1) = (0,0,0,3). Thus 0, l,and3 must be roots of the minimal polynomial of Τ (by 8.36). The only monic polynomial that is a divisor of z(z — l)(z — 3) and has 0,1,3 as roots is z(z — \)(z — 3). Thus z(z — l)(z — 3) is the minimal polynomial of T, as desired. Note that every vector in {(0,102,103,0) : 102,103 € C) is an eigenvector of Τ corresponding to the eigenvalue 1. Thus the eigenvalue 1 of Γ has multiplicity at least 2. The eigenvalues 0 and 3 of Γ have multiplicity at least 1. Because Γ is an operator on a four-dimensional complex vector space, the sum of the multiplicities of all the eigenvalues equals 4 (by 8.18). Thus the each use of the phrase "at least" in the previous paragraph can be replaced by "equal" because if any of the eigenvalues had larger multiplicity, the sum of the multiplicities of all the eigenvalues would exceed 4.
108 Chapter 8. Operators on Complex Vector Spaces Because 0 and 3 are eigenvalues of Τ with multiplicity 1 and 1 is an eigenvalue of Τ with multiplicity 2 and Τ has no other eigenvalues (the multiplicities of the eigenvalues mentioned already sum to 4), the characteristic polynomial of Τ equals z(z - l)2(z - 3), as desired. Of course there are also many other examples. 28. Suppose Go,... ,αη-ι € C. Find the minimal and characteristic polynomials of the operator on Cn whose matrix (with respect to the standard basis) is 0 —ao 1 0 -ai 1 '" · -a2 0 -a„_2 1 -an_! Comment: This exercise shows that every monic polynomial is the characteristic polynomial of some operator. Solution: Suppose that Τ € £(Cn) has matrix as above with respect to the standard basis (ei,..., e„) of C". Thus Tex = e2 T2ei = Te2 = e3 r'-'ei =Te„_i = e„ T^ei = Ten = -aoei — a\ei Οη-ιβη. Thus (еьГеьГ2еь...,Гп-1е1) = (еье2,ез,...,еп). In particular, (ei,Te\,T2ei,... ,Tn~le\) is linearly independent. Thus if ρ is a monic polynomial with degree less than n, then p{T)e\ φ 0. Thus the minimal polynomial of Τ must have degree n. Writing l^ei as a linear combination of (ei,T'ei,T2ei,...,Tn_1ei) is possible in only one way: "icy = — clqBi — a\Te\ — · · · — On-iT" e\. Thus setting
109 Chapter 8. Operators on Complex Vector Spaces p(z) =go +a\z-\ |-ап_1.гп-1 + zn, we see that ρ is the only monic polynomial of degree η such that p(T)e\ = 0. Hence ρ must equal the minimal polynomial of T. Because Τ is an operator on an η-dimensional complex vector space and the minimal polynomial of Τ has degree n, the characteristic polynomial of Τ (which is a monic polynomial of degree η that is divisible by the minimal polynomial of T) must equal the minimal polynomial of T. Thus the characteristic polynomial of Τ also equals p. 29. Suppose N € C(V) is nilpotent. Prove that the minimal polynomial of N is zm+l, where m is the length of the longest consecutive string of l's that appears on the line directly above the diagonal in the matrix of N with respect to any Jordan basis for N. Solution: Suppose (vi,...,vn) is a Jordan basis for N and that m equals the length of the longest consecutive string of l's that appears on the line directly above the diagonal of M(N, (v\,...,υη)). The diagonal of Λ4(Ν, (v\, ... ,vn)) contains only 0's (by Exercise 6 of this chapter). Thus λ4(Ν, (υι,... ,vn)) is a block diagonal matrix whose blocks have the form "01 0 * "·. 1 0 0 and the largest such block in an (τη + l)-by-(m + 1) matrix. For each Vj, we see that Nm+lVj = 0. Because Nm+l equals 0 on a basis of V, we conclude that Nm+l = 0. Thus the minimal polynomial of N most divide zm+i ^y 8.34) and hence must be of the form zk for some к < m + 1. But there is a basis vector Vj such that N^-Vj ч= Vj-m φ 0. Thus Nm φ 0, which implies that the minimal polynomial of ./V equals zm+l. 30. Suppose V is a complex vector space and Τ € C[V). Prove that there does not exist a direct sum decomposition of V into two proper subspaces invariant under Τ if and only if the minimal polynomial of Τ is of the form (z-\)d'mV for some λ € С. Solution: First suppose that there does not exist a direct sum decomposition of V into two proper subspaces invariant under T. Thus Τ has only one eigenvalue (by 8.23), which we will call λ.
110 Chapter 8. Operators on Complex Vector Spaces There is a Jordan basis of Τ (by 8.47), meaning that with respect to this basis Τ has a block diagonal matrix Ai 0 0 Am where each Aj is an upper-triangular matrix of the form λ 1 0 Aj = 0 1 λ If m > 1, then we could let U denote the span of the basis vectors corresponding to A\ and let W denote the span of the basis vectors corresponding to U2, ■ ■., Um; we would have V = U® W, where U and W would be proper subspaces of V invariant under T. This contradiction shows that m = 1. Thus the matrix of Τ with respect to our Jordan basis is just A\. In other words, there is a basis (yi,..., v<am y) of V such that the matrix of Τ - XI with respect to this basis is 0 0 0 1 0 The previous problem now implies that the minimal polynomial of Τ - XI equals zdim v. This clearly implies that the minimal polynomial of Τ equals (z -X)dimV, as desired. To prove the implication in the other direction, suppose that the minimal polynomial of Τ equals (z - X)dimV. Suppose that there exist two proper subspaces J7i, U2 of V, each invariant under T, such that V = U\ φ С/г- Let pi denote the minimal polynomial of T\u1 and рг denote the minimal polynomial of T\u2. If ui € U\, then (ΡΙΡ2)(Ϊ>1=Ρ2(Ϊ>1(2>1 = 0. Similarly, if ич € U2, then
Ill Chapter 8. Operators on Complex Vector Spaces 31. {piP2)(T)u2 = Pi(T)p2(T)u2 = 0. Because every vector in V can be written in the form щ +U2, where щ € U\ and U2 € U2, the equations above imply that (p\P2)(T) = 0. Thus the minimal polynomial of T, which equals (z - X)dmxV, is a divisor of рда (by 8.34). The degree of the monic polynomial p\p2 equals the degree of pi plus the degree of P2, which is less than or equal to dimJ7i + dim[/2, which equals dim V. Because (z - X)dhaV is a divisor of P1P2, this implies that pl(z)P2(z) = (z-\f™v. This implies that Pl(z) = (z - X)dlmUi and p2(z) = (z - X)dimU\ Let m = max{dimJ7i,dimJ72}. Let p(z) = (z - X)m. The equations above show that ρ(Τ)\υι = 0 and p(T)\u2 = 0. Thus p(T) = 0. Because the degree of ρ is less than dim V, this contradicts our hypothesis that (z-X)dlmV is the minimal polynomial of T. This contradiction means that our assumption that there exists a direct sum decomposition of V into two proper subspaces invariant under Τ must have been false, as desired. Suppose Τ € C{V) and (υι,...,υη) is a basis of V that is a Jordan basis for T. Describe the matrix of Τ with respect to the basis (υη,..,υι) obtained by reversing the order of the v's. Solution: The matrix of Τ with respect to (υι,, agonal matrix Ai 0 0 Am where each Aj is an upper-triangular matrix of the form Xj 1 0 . ,vn) is a block di- Aj = 1 Xj
112 Chapter 8. Operators on Complex Vector Spaces Temporarily fix j, and let (ui,..., Uk) be the part of (v\,..., vn) corresponding to the block Aj. Thus Tu\ = XjUi TU2 =U\ + \jU2 Tuz = u2 + Aju3 Тик = ujt_i +XjUk. Write the equations above in the form Тик = \jUk +Uk-i Tuk-l = XjUk-l +Uk-2 TU2 = AjU2 + Ul Tu\ = \jU\. Thus the matrix of Т^ра^щ,...,Ufc) ^vit-h respect to (ujt,... ,щ) is 0 1 Xj In other words, Bj is obtained from Aj by reflection across the diagonal, so in Bj the 1 's he below the diagonal instead of above it. Now we see that the matrix of Τ with respect to (vn,... ,v\) is the block diagonal matrix 0 Bi Bj =
Chapter 9 Operators on Real Vector Spaces Prove that 1 is an eigenvalue of every square matrix with the property that the sum of the entries in each row equals 1. Solution: Suppose that A is an n-by-n matrix such that the sum of the entries in each row of A equals 1. Let χ be the n-by-1 matrix all of whose entries equal 1. Then from the definition of matrix multiplication we see that the entry in row j, column 1 of Ax equals that sum of the entries in row j of A, which equals 1. Thus Ax = x, which implies that 1 is an eigenvalue of A. Consider a 2-by-2 matrix of real numbers A = а с b d Prove that A has an eigenvalue (in R) if and only if (a - d)2 + Abe > 0. Solution: A number λ € R is an eigenvalue of A if and only if there exist numbers x, у € R, not both 0, such that α с b d X . У . = λ χ . У . Q££ -U (ΥΠ The left side of this equation equals [ , ], so the equation above is equivalent to the system of equations 113
114 Chapter 9. Operators on Real Vector Spaces 3. (a - λ)χ + су = О Ъх + (d - Х)у = 0. It is easy to see that this system of equations has a solution other than x = У = 0 if and only if (a - X)(d - λ) = be, which is equivalent to the equation X2-(a + d)X + (ad-bc) = 0. There is a real number λ satisfying the equation above if and only if (a + d)2 - 4(ad - be) > 0. The left side of the inequality above equals (a - d)2 + 4bc, and thus we conclude that A has a real eigenvalue if and only if (a - d)2 + Abe > 0. Suppose A is a block diagonal matrix Ai 0 A = 0 where each Aj is a square matrix. Prove that the set of eigenvalues of A equals the union of the eigenvalues of A\,..., Am. Solution: Suppose that Aj has size rij-by-rij. Let x\ χ = If where each Xj is an rij-by-l matrix. Then Aixi Ax = sLmXri Thus the equation Ax = Xx is equivalent to the system of equations
115 Chapter 9. Operators on Real Vector Spaces A\X\ = Xxi A-mXxn — AXm. Suppose that λ is an eigenvalue of A. Then there is a nonzero vector χ satisfying the system of equations above. Because χ is nonzero, there exists к such that xjt is nonzero. Because A^Xk = X%k, this implies that λ is an eigenvalue of Ajt- Thus λ is in the union of the eigenvalues of A\,..., Am, as desired. To prove the implication in the other direction, suppose now that λ is in the union of the eigenvalues of Ai,..., Am. Then there exists к such that λ is an eigenvalue of A^. Thus there exists a nonzero n^-by-l vector Xk such that AfcXfc = Axjfc. For j φ к, define Xj to be the rij-by-1 matrix all of whose entries equal 0, and define χ to be the matrix determined by xi,... ,xm as above. Then χ is nonzero (because xjt is nonzero) and Ax = λχ, which shows that λ is an eigenvalue of A, as desired. 4. Suppose A is a block upper-triangular matrix Ai * A = 0 Am where each Aj is a square matrix. Prove that the set of eigenvalues of A equals the union of the eigenvalues of A\,..., Am. Comment: Clearly Exercise 4 is a stronger statement than Exercise 3. Even so, students may want to do Exercise 3 first because it is easier than Exercise 4. Solution: We will prove that 0 is an eigenvalue of A if and only if 0 is an eigenvalue of at least one of the Ak's. This is all we need to do, because for arbitrary λ € F, we can replace A with A - XI and each Ak with Ak - XI, concluding that λ is an eigenvalue of A if and only if λ is an eigenvalue of at least one of the A^'s. This last statement implies that the set of eigenvalues of A equals the union of the eigenvalues of A\,..., Am. Suppose that A has size n-by-n and each Aj has size ny-by-ny. We can write a typical n-by-1 matrix χ in the form Xl χ = Xt,
116 Chapter 9. Operators on Real Vector Spaces where each Xj is an ny-by-l matrix. The product Ax can be computed by multiplying together the block matrices of A and χ given above, using the same formula as one would use when multiplying matrices of numbers (this follows from the definition of matrix multiplication). First suppose that 0 is an eigenvalue of A. Thus there exists a nonzero n-by-1 matrix χ such that Ax = 0. Write χ in the form above, and let к be the largest index with nonzero x^; thus we can write ' x\ 0 (If к = τη, then the 0's shown above at the tail of χ do not appear.) If we break Ax into blocks of the same size as was done for x, then the fcth block of Ax will equal A^Xk', this follows from the block upper-triangular form of A and the 0's that appear in χ after the fcth block. But Ax = 0, so the fcth block of Ax equals 0, so A^Xk = 0. Because Xk φ 0, this implies that 0 is an eigenvalue of Ak, as desired. To prove the implication in the other direction, suppose now that 0 is an eigenvalue of some Ak- This means that the operator on Mat(n, 1,F) (the vector space of n-by-1 matrices) that sends Xk G Mat(n, 1, F) to AkXk is not injective. Thus this operator is not surjective (by 3.21). Thus the operator on Mat(ni Η l· Пк, 1, F) that sends X\ to A\ * _ 0 Ak is not surjective (because the last block in the product above will be AkXk, which cannot be an arbitrary n^-by-l matrix). Again using 3.21, this means that the last operator is not injective. In other words, there exists a nonzero vector χι Xk
117 Chapter 9. Operators on Real Vector Spaces x\ Xk €Mat(m +--- + пь 1,F) such that Αχ 0 A* x\ Xk = 0. Adjoining an appropriate number of 0's, this implies that Ak Ajt+i X\ Xk 0 = 0. In other words, 0 is an eigenvalue of A, as desired. Suppose V is a real vector space and Τ € C(V). Suppose α, β € R are such that T2 + aT + βΐ = 0. Prove that Τ has an eigenvalue if and only if a2 > 4/3. Solution: First suppose that Τ has an eigenvalue λ € R. Thus there exists a nonzero vector υ € V such that Tv = Xv. Applying Τ to both sides of the last equation, we get T^v — X2v. Thus 0 = (Γ2 + aT + βΙ)υ = X2v + aXv + βν = (Χ2 + αΧ + β)ν. Because ν φ 0, the last equation implies that λ2 + αλ + β = 0, which implies (recall that λ, α, and β are all real) that a2 > 4/3, as desired. To prove the implication in the opposite direction, suppose now that a2 > 4/3. Then the polynomial χ2 + αχ+β has two real roots, which means that we can write
118 Chapter 9. Operators on Real Vector Spaces x2 + ax + β = (χ - г\) (χ - гг) for some π, гг € R. Thus 0 = Τ2 + αΤ+βΙ = (Τ - ΓιΙ)(Τ - r2I). In particular, (Τ - riI)(T - Γ2Ι) is not injectivc, which implies that at least one of Τ — r\I and Τ — ril is not injective. In other words, at least one of r\, гг must be an eigenvalue of T. Thus Τ has an eigenvalue, as desired. 6. Suppose V is a real inner-product space and Τ € C{V). Prove that there is an orthonormal basis of V with respect to which Τ has a block upper- triangular matrix A\ * where each Aj is a 1-by-l matrix or a 2-by-2 matrix with no eigenvalues. Solution: We know that there is a basis (υι,..., vn) of V with respect to which the matrix of Τ has the block upper-triangular form above, where each Aj is a 1-by-l matrix or a 2-by-2 matrix with no eigenvalues (see 9.4). Apply the Gram-Schmidt procedure to (v\,..., vn), getting an orthonormal basis (ei,..., en) of V such that span(ui,... ,vj) = span(ei,...,ej) for j = Ι,.,.,η (see 6.20). This condition on the spans implies that the matrix of Τ with respect to (ei,...,en) is also a block upper-triangular of the form above, where each Aj is a 1-by-l matrix or a 2-by-2 matrix (these Aj's may differ from the Aj's corresponding to the matrix of Τ with respect to (υι,..., vn)). All that remains is to show that, if necessary, we can modify our orthonormal basis so that none of the 2-by-2 blocks on the diagonal have eigenvalues. Suppose that Aj is a 2-by-2 block on the diagonal of the matrix of Τ with respect to (ei,...,en) and that Aj has an eigenvalue λ. Thus there exist x,y € R, not both 0, such that X У = X У
119 Chapter 9. Operators on Real Vector Spaces Let ejt, ejt+i denote the basis vectors corresponding to Aj. Then, as is easy to verify, T(xek + yek+i) = u + X(xek + yek+i) for some и € span(ei,..., ejt-i). Let = xek + yek+i Jk ||xefc + yefc+ill and choose Д+ι € V such that (fk, fk+i) is orthonormal and span(/fc, fk+i) = span(efc,efc+i). Then the matrix of Τ with respect to the orthonormal basis (ei,. ..,ejt-i,/fc,/fc+iiefc+2i··· a) will be a block upper-triangular matrix with the same entries on the diagonal as previously, except that Aj will be replaced by the 2-by-2 matrix λ * ' 0 * " In other words, where we had Ay on the diagonal, we can now think of having two 1-by-l matrices on the diagonal (and we still have a block upper- triangular matrix because of the 0 in the lower-left entry of the matrix above). Repeating, when necessary, the procedure described above, we obtain an orthonormal basis of V with respect to which Τ has a block upper-triangular matrix A\ * where each Aj is a 1-by-1 matrix or a 2-by-2 matrix with no eigenvalues. 7. Prove that if Γ € C(V) and j is a positive integer such that j < dim V, then Τ has an invariant subspace whose dimension equals j — 1 or j. SOLUTION: Suppose Τ € C(V) and j is a positive integer such that j < dim V. If V is a complex vector space, then Τ has an invariant subspace
120 Chapter 9. Operators on Real Vector Spaces whose dimension equals j (by Exercise 17 in Chapter 5). So we can assume that V is a real vector space. By 9.4, there is a basis (υι,..., vn) of V with respect to which Τ has a block upper-triangular matrix A\ * where each A^ is a 1-by-l matrix or a 2-by-2 matrix. Either Vj-\ or Vj must be the last vector in a block of vectors corresponding to some A^. Thus either span(ui,..., i>j-i) or span(ui,..., Vj) must be invariant under T. 8. Prove that there does not exist an operator Τ € £(R7) such that Τ2 +Τ + Ι is nilpotent. Solution: Suppose Τ € £(R7). Prom part (b) of 9.9 (combined with 9.4, which insures that Τ has a matrix of the form 9.10 with respect to some basis of R7), we see that dimnuU(r2 + T + /)7 must be an even integer. In particular, dimnuu(!r2 + Τ + Ι)7 φ 7, which implies that null(T2 + Τ +1)7 φ R7, which implies that (Τ2 + Τ +1)7 φ 0, which implies that Τ is not nilpotent (by 8.8). 9. Give an example of an operator Τ € £(C7) such that T2 + T+I is nilpotent. Solution: Let λ = (-1 + 3i)/2. Define Τ € £(C7) by Τ = XI. Then T2 + Τ + / = (λ2 + λ + 1)/ = 0. In particular, Τ2 + Τ + I is nilpotent. 10. Suppose V is a real vector space and Τ € C(V). Suppose α, β € R are such that a2 < 40. Prove that null(r2 + aT + /3/)fc has even dimension for every positive integer k. Solution: Let к be a positive integer, and let U = mill(T2+aT+fiI)k. We need to prove that dim U is even.
121 Chapter 9. Operators on Real Vector Spaces Because U is invariant under Τ (by 8.22, with p(x) = (x2 + ax + /3)fc), we can define S € C(U) by S = Τ\υ· Clearly (S2 + aS + pi)k is the 0 operator on U. Thus S2 + aS + βΐ is a. nilpotent operator on U, which implies that nuU(52 +aS +pi)dlmU = U (by 8.8). Now part (b) of 9.9, applied to S and U instead of Τ and V, shows that dirnC/ is an even integer, as desired. 11. Suppose V is a real vector space and Τ € C(V). Suppose α, β € R are such that a2 < 4/3 and T2 + aT + βΐ is nilpotent. Prove that dim V is even and (T2+aT+/3/)dimV//2 = 0. Solution: Let S = T2 + aT + βΐ. Because S is nilpotent, there is a smallest positive integer m such that S™1 = 0. Thus {0} = nullS0 С nullS С nullS2 С ...С nuUSm = V, 1 J + + + + where the proper inclusions come from 8.5. The previous exercise states that each nullSfc has even dimension (in particular V, which equals nullS"1, has even dimension). Hence the dimension must increase by at least 2 in all the proper inclusions above. Thus dim S"1 > 2m, which clearly implies that m < (dim V)/2. Because S™ = 0, this implies that S6™ vl2 = 0, as desired. 12. Prove that if Г € £(R3) and 5,7 are eigenvalues of T, then Τ has no eigenpairs. Solution: Suppose that Τ € £(R3) and 5,7 are eigenvalues of T. Of course each of these two eigenvalues must have multiplicity at least 1. By 9.17, the sum of the multiplicities of all the eigenvalues of Τ plus twice the sum of the multiplicities of all the eigenpairs of Τ equals 3. Because the sum of the multiplicities of all the eigenvalues of Τ is at least 2, there is no room for an eigenpair, which would add at least 2 more to the sum (because we take twice the sum of the multiplicities of all the eigenpairs). Thus Τ has no eigenpairs. 13. Suppose V is a real vector space with dim V = η and Τ € C{V) is such that nulir^VnullT"-1. Prove that Τ has at most two distinct eigenvalues and that Τ has no eigenpairs. Solution: Because пи]1Тп~2 ф null Τ"-1, we see that dim null Tj is at least 1 more than dim null T·7-1 for j = 1,... ,n - 1 (by 8.5). Thus
122 Chapter 9. Operators on Real Vector Spaces dim null T"1-1 > η - 1. In particular, 0 is an eigenvalue of Τ with multiplicity at least η — I. Because the sum of the multiplicities of all the eigenvalues of Τ plus the sum of twice the multiplicities of all the eigenpairs of Τ equals η (by 9.17), this implies that Τ can have no eigenpairs and at most one additional eigenvalue. 14. Suppose V is a vector space with dimension 2 and Τ € C(V). Prove that if а с b d is the matrix of Τ with respect to some basis of V, then the characteristic polynomial of Τ equals (z - a)(z - d) - be. Comment: As usual unless otherwise specified, here V may be a real or complex vector space. Solution: Let q(z) = (z - a)(z - d) - be. If {νγ,υ-ΐ) is the basis of V with respect to which Τ has the matrix above, then Tv\ = av\ + bi)2 and Тщ = evi + dvi. Prom these equations you can easily verify that g(T)vi = 0 and q(T)t)2 = 0. Because q(T) is 0 on a basis of V, we conclude that q{T) = 0. Because q is a monic polynomial of degree 2 and q(T) = 0, we conclude that the minimal polynomial of Τ has degree 1 or 2. Suppose first that the minimal polynomial of Τ has degree 2. Because the minimal polynomial of Γ is a divisor of q (by 8.34), and because a monic polynomial of degree 2 can be a divisor of another monic polynomial of degree 2 only if the two polynomials are equal, we conclude that q is the minimal polynomial of T. The Cayley-Hamilton theorem now implies that q is a divisor of the characteristic polynomial of T, which is also a monic polynomial of degree 2. This implies that q is the characteristic polynomial of 7\ as desired. Now consider the only remaining possible case, which is that the minimal polynomial of Τ has degree 1, meaning that it equals ζ — λ for some λ € F. This implies that Τ = λ/, which implies that the characteristic polynomial of Τ equals (ζ—λ)2. Because Τ = XI, we must have α = d = X and b = с = 0. Thus q(z) = (z - λ)2. In particular, q is the characteristic polynomial of T, as desired. Comment: Note that we did not need to find the eigenvalues of Γ to do this exercise.
123 Chapter 9. Operators on Real Vector Spaces 15. Suppose V is a real inner-product space and S € C(V) is an isometry. Prove that if {α, β) is an eigenpair of S, then /3=1. Solution: There is a basis of V with respect to which S has a block diagonal matrix, where each block on the diagonal is a 1-by-l matrix or a 2-by-2 matrix of the form cos θ -sin θ sin θ cos θ ' with θ € (0, π) (see 7.38). The characteristic polynomial of the matrix above is (x - cos Θ)2 + sin2 Θ, which equals x2 — 2(cos θ)χ + 1. If (α, β) is an eigenpair of S, then dim null(52 + aS + /3/)<Um v > 0, which implies that x2 + ax + β is the characteristic polynomial of a 2-by-2 matrix of the form displayed above (see 9.9). Thus /3=1.
Chapter 10 Trace and Determinant 1. Suppose that Τ € C{V) and (vi,...,vn) is a basis of V. Prove that Л4(Т, (v\,... ,vn)) is invertible if and only if Τ is invertible. Solution: First suppose that M(T) is an invertible matrix (because the only basis is sight is (v\,..., vn), we can leave the basis out of the notation). Thus there exists an n-by-n matrix В such that M(T)B = BM{T) = I. There exists an operator S € C(V) such that M(S) = В (see 3.19). Thus the equation above becomes M(T)M(S) =M(S)M(T) = I, which we can rewrite as M{TS) = M{ST) = M(I), which implies that TS = ST = I. Thus Τ is invertible, as desired, with inverse S. To prove the implication in the other direction, suppose now that Τ is invertible. Thus there exists S € C(V) such that TS = ST = I. This implies that 124
125 Chapter 10. Trace and Determinant M{TS) = M{ST) = M(I), which implies that M(T)M(S) = M(S)M(T) = I. Thus M(T) is invertible, as desired, with inverse M(S). 2. Prove that if A and В are square matrices of the same size and AB = I, then В A = I. Solution: Suppose that A and В are n-by-n matrices and AB = I. There exist S, Τ € £(Fn) such that M{S) = A and M(T) = B; here we are using the standard basis of Fn (the existence of S, Τ € £(Fn) satisfying the equations above follows from 3.19). Because AB = I, we have M(S)M(T) = I, which implies that M(ST) = M(I), which implies that ST = I, which implies that TS = I (by Exercise 23 in Chapter 3). Thus В А = M(T)M(S) = M(TS) = M(I) = 1. 3. Suppose Τ € C(V) has the same matrix with respect to every basis of V. Prove that Γ is a scalar multiple of the identity operator. Solution: We begin by proving that (υ, Τν) is linearly dependent for every υ € V. To do this, fix υ € V, and suppose that (υ, Τυ) is linearly independent. Then (v, Tv) can be extended to a basis (v, Tv, щ,..., u„.) of V. The first column of the matrix of Τ with respect to this basis is ' 0 " 1 0 _ 0 _ Clearly (2v, Tv, щ,..., Un) is also a basis of V. The first column of the matrix of Τ with respect to this basis is
126 Chapter 10. Trace and Determinant 0 2 0 0 _ Thus Τ has different matrices with respect to the two bases we have considered. This contradiction shows that (υ, Τυ) is linearly dependent for every ν € V. This implies that for every vector in V is an eigenvector of T. This implies that Γ is a scalar multiple of the identity operator (by Exercise 12 in Chapter 5). 4. Suppose that (щ,..., un) and (υι,..., vn) are bases of V. Let Τ € £(V) be the operator such that Tvk = Uk for к = 1,..., η. Prove that Μ (Τ, (υι,..., vn)) = Μ (I, (щ,..., tin), (v\,. · ·, vn)). Solution: Fix к. Write uk = αιυι Η hanvn, where a\,...,On € F. Because Tv^ = щ, the fcth column of the matrix M. (T, (v\ ,..., vn)) consists of the numbers a\,..., an. Because 1щ = «jt> the fcth column of Μ (I, (щ,..., un), (υι,..., vn)) also consists of the numbers αϊ,... ,αη. Because M(T, (νι,...,νη)) and M(I, {щ,... ,un),{vi, ...,vn)) have the same columns, these two matrices must be equal. 5. Prove that if β is a square matrix with complex entries, then there exists an invertible square matrix A with complex entries such that A~lBA is an upper-triangular matrix. SOLUTION: Suppose β is an n-by-n matrix with complex entries. Let (ei,... ,en) denote the standard basis of Cn. There exists Τ € £(Cn) such that M(T, (ei,..., e„)) = В (see 3.19). There is a basis (υι,..., vn) of V such that M. (T, (v\,..., vn)) is an upper- triangular matrix (see 5.13). Let A = Λ4((ν\,. ..,vn), (e\,... ,en))- Then A is invertible (by 10.2) and A-lBA = A-lM{T,{ei,...,en))A = M(T,(vu...,vn)),
127 Chapter 10. Trace and Determinant where the second equality comes from 10.3. Thus A~XBA is an upper- triangular matrix. 6. Give an example of a real vector space V and Τ € C(V) such that trace(T2) < 0. Solution: Define Τ € £(R2) by T(x,y) = (-y,x). Then Γ2 = -/, so traceiT2) = -2. 7. Suppose V is a real vector space, Τ 6 C(V), and V has a basis consisting of eigenvectors of T. Prove that trace(T2) > 0. Solution: Let (v\,..., vn) be a basis of V consisting of eigenvectors of T. Thus there exist λι,...,λη 6 R such that Tv3- = XjVj for each j. Clearly the matrix of T2 with respect to the basis (v\,..., vn) is the diagonal matrix "λ,2 0 " _ 0 λη2 Thus trace Τ2 = λι2 + · · · + λ„2 > 0. 8. Suppose V is an inner-product space and v, w € C{V). Define Τ € C(V) by Tu = (u, v)w. Find a formula for trace T. SOLUTION: First suppose that υ φ 0. Extend (тглт) to an orthonormal basis (A,ei,... ,en) of V. Note that for each j, we have Tej = 0 (because (ej,v) = 0). The trace of Τ equals the sum of the diagonal entries in the matrix of Τ with respect to the basis (Л, ei,..., en). Thus tracer = (Пщ), щ) + (Τβχ,βχ) + ■■■ + (Te^e») = (w,v). If ν = 0, then Τ = 0 and so trace Γ = 0 = (w,v). Thus we have the formula
128 Chapter 10. Trace and Determinant traceT = (го, υ) regardless of whether or not ν = 0. 9. Prove that if Ρ 6 C(V) satisfies P2 = P, then trace Ρ is a nonnegative integer. Solution: Suppose that Τ € C{V) satisfies P2 = P. Let (щ,... ,ιι™) be a basis of range Ρ and let {v\,..., vn) be a basis of null P. Then (Ul,...,lim,l'i,...,l'n) is a basis of V (this holds because V = range Γ θ null Γ; see Exercise 21 in Chapter 5). For each Uj we have Pu3- = щ and for each Vk we have Pi>k = 0. Thus the matrix of Ρ with respect to the basis above of V is a diagonal matrix whose diagonal contains m l's followed by η 0's. Thus trace Ρ = m, which is a nonnegative integer, as desired. In fact, we have shown that trace Ρ = dim range P. 10. Prove that if V is an inner-product space and Τ € C(V), then traceT* = traceT. Solution: Suppose that V is an inner-product space and Τ € £{V). Let (ei,..., en) be an orthonormal basis of V. The trace of any operator on V equals the sum of the diagonal entries on the matrix of the operator with respect to this basis. Thus trace Γ* = (Γ·βι,βι> + ··· + (Γ*βη, en) = (ei,Tei>+--- + (en,ren> = (Tei,ei)+·*·· +(Ten, e„) = (rei,ei>+--- + (Ten,en> = traceT1. 11. Suppose V is an inner-product space. Prove that if Τ € C(V) is a positive operator and trace Τ = 0, then Τ = 0. Solution: Suppose Τ € C{V) is a positive operator and traceT = 0. There exists an operator S € C{V) such that Τ = S*S (by 7.27). Let (ei,... ,en) be an orthonormal basis of V. Then
129 Chapter 10. Trace and Determinant 0 = traceT = (Tel,el)+--- + (Ten,en) = (S*Sel,el) + --- + (S*Sen,en) = \\Sel\\2 + --- + \\Sen\\2. The equation above implies that Sej = 0 for each j. Because S is 0 on a basis of V, we have 5 = 0. Because Τ = S*S, this implies that Τ = 0. 12. Suppose Τ € £(C3) is the operator whose matrix is " 51 -12 -21 ' 60 -40 -28 . 57-68 1 Someone tells you (accurately) that -48 and 24 are eigenvalues of T. Without using a computer or writing anything down, find the third eigenvalue of T. Solution: The sum of the eigenvalues of Τ equals the sum of the diagonal terms of the matrix above (both quantities equal trace T). The sum of the diagonal terms of the matrix above equals 12. The sum of two of the eigenvalues of T, -48 and 24, equals -24. Because the sum of all three eigenvalues of Τ must equal 12, the third eigenvalue of Τ must be 36. 13. Prove or give a counterexample: if Τ € C(V) and с € F, then trace(cT) = с trace T. Solution: Suppose Τ € C{V) and с € F. To prove that trace(cT) = с trace T, consider a basis of V. Then trace Τ equals the sum of the diagonal terms of the matrix of Τ with respect to this basis. The matrix of cT, with respect to the same basis, equals с times the matrix of T. Thus the sum of the diagonal terms of the matrix of cT equals с times the sum of the diagonal terms of the matrix of T. In other words, trace(cT') = с trace T. 14. Prove or give a counterexample: if S,Τ € C(V), then trace(5T) = (trace S) (trace T). Solution: Define S, Τ € £(F2) by S(x,y) = T(x,y) = (-y,x). Then with respect to the standard bases the matrix of S (which of course equals the matrix of T) is
130 Chapter 10. Trace and Determinant 0 -1 1 0 ' Thus traceS = trace Γ = 0. However, ST = -I, so trace ST = -2. Thus for this choice of S and T, we have trace(ST) φ (trace S) (trace T). Of course there are also many other examples. 15. Suppose Τ € C(V). Prove that if trace(5TJ = 0 for all S € C(V), then T = 0. Solution: Suppose that trace(5T') = 0 for all S € C(V). Then trace(TS) = 0 for all S € C(V) (by 10.12). Suppose that there exists υ € V such that Tv φ 0. Then (Tv) can be extended to a basis (Tv, щ,..., Un) of V. Define S € C(V) by S(aTv + b\u\ Η 1- bnun) = av. Thus S(Tv) = υ and Suj = 0 for eachj. Hence (TS)(Tv) = T(S{Tv)) = Tv and (TS)(uj) = 0 for each j. This implies that with respect to the basis (Tv,u\,... ,un), the matrix of TS consists of all 0's except for a 1 in the upper-left corner. Thus trace(r5) = 1. This contradiction shows that our assumption that Tv φ 0 must have been false. Thus Tv = 0 for every ν € V, which means that Τ = 0. 16. Suppose V is an inner-product space and Τ € C(V). Prove that if (ei,..., e„) is an orthonormal basis of V, then trace(r*r) = ||rei||2 + --- + ||ren||2. Conclude that the right side of the equation above is independent of which orthonormal basis (ei,..., en) is chosen for V. Solution: Suppose that (ei,..., en) is an orthonormal basis of V. Then trace T*T = (T*Tei,ei) +■■■ + (T'Ten,^) = (Te1,Te1) + --- + (Ten,Ten) = \\Te1\\2 + --- + \\Ten\\2. Because trace T*T does not depend upon the choice of a basis of V, the formula above shows that ||Tei||2 + ■ ■ ■ + ||Ге„||2 is independent of the orthonormal basis (ei,...,en).
131 Chapter 10. Trace and Determinant 17. Suppose V is a complex inner-product space and Τ € C(V). Let Ai,..., λη be the eigenvalues of T, repeated according to multiplicity. Suppose αι,ι <4,n On,l ··· an,n is the matrix of Τ with respect to some orthonormal basis of V. Prove that \Μ\2 + - + \\η\2<ΥΣ\α^\2· Solution: Suppose that (ei,...,en) is the orthonormal basis with respect to which Τ has the matrix above. Thus for each k, we have which implies that Thus Тек = alike\ -i 1- anifce„, ||Tefc||2 = |a1,^ + ..- + |anifc|^. ||Γβι||2 + ·.· + ||Γ^||2 = 53Σ|αΜ|2. fc=l j=l By the previous exercise, the left side of this equation equals trace(T'*T'). This reduces the exercise at hand to proving that |А1|2 + --- + |А„|2<1гасе(ГГ). There is an orthonormal basis (/i,..., /n) with respect to which Τ has an upper-triangular matrix (by 6.28). The diagonal entries of the matrix of Τ with respect to (/i,. ·., /n) axe precisely Ai,..., λη (by 8.10), where we can relabel the eigenvalues of Τ so that they appear in the order λι,..., A„ along the diagonal. In other words, M{T,(fu- .·,/„)) = λι 0 A„ From the matrix above, we see that
132 Chapter 10. Trace and Determinant μ*|2 < ЦГЛЦ2 for each k. Thus |λ,|2 + ··· + |λη|2<||Τ/,||2 + ··· + ||Γ/η||2 = trace(r*T), as desired (here the last equality comes from the previous exercise). 18. Suppose V is an inner-product space. Prove that (S,T) =trace(5T*) defines an inner product on C(V). Solution: Suppose that (·,·) is defined as above and R,S,T € C(V). Then (T,T) = trace(TT*), and thus (T,T) > 0 (by the formula given in Exercise 16 of this chapter, with Τ replaced with T*). Because TT* is a positive operator (see 7.27), we also see that (T, T) = 0 if and only if Τ = 0 (by Exercise 11 of this chapter). Now (R + S,T) = trace((i? + S)T*) = trace(flT* + ST) = trace(RT*) + trace(5T*) = (R,T) + (S,T), where the third equality comes from 10.12. For с € F, we have (cS, T) = trace(c5T*) = с trace(ST*) = c(S,T), where the second equality comes from Exercise 13 of this chapter. Finally, (S,T) = trace(Sr*) = trace((Sr*)*) = trace(T5*)
133 Chapter 10. Trace and Determinant where the second equality comes from Exercise 10 of this chapter. We have shown that (·, ·) satisfies all the properties required of an inner product. Comment: Suppose (ei,... ,βη) is an orthonormal basis of V and 0.1,1 ■■■ ai,n αη,ι ... αΠιη is the matrix of Τ with respect to this basis. Then (T,T> = trace(3T*) = trace(T*T) k=l j=\ where the second equality comes from 10.12 and the third equality comes ■from Exercise 16 of this chapter. Thus the norm on C(V) induced by (-,-) is the same as the standard norm on Fn (here we are identifying each operator with its matrix, which has n2 entries). Because norms determine the inner product (see Exercises 6 and 7 in Chapter 6), this means that the inner product (·, ■) is the same as the standard inner product of Fn (again using the identification via matrices with respect to the orthonormal basis (ei,...,e„)). 19. Suppose V is an inner-product space and Τ € C(V). Prove that if llT'till < ΙΙΓ11ΙΙ for every υ € V, then Τ is normal. Solution: Suppose that ||Γ*ν|| < ||Γυ|| for every υ € V. Suppose и € V with ||u|| = 1. Extend (u) to an orthonormal basis (u,ei,...,en) of V. Then traced) = ||7"u||2 + ||Г'в11|2 + · · · + ||T*en||2 <||Ти||2 + ||Ге1||2 + --- + ||Геп||2 = trace(T*T) = trace(7T*),
134 Chapter 10. Trace and Determinant where the first and third lines come from Exercise 16 of this chapter and the last line comes from 10.12. Because the first and last lines above are equal, we must have equality throughout. Thus ||T*u|| = \\Tu\\. This clearly implies that \\T'(au)\\ = \\T(au)\\ for every a G F. Because every vector in V can be written in the form au for some α G F and some и G V with ||u|| = 1, this implies that \\T*v\\ = \\Tv\\ for every υ G V. This implies that Τ is normal (by 7.6). Comment: This exercise fails on infinite-dimensional inner-product spaces, leading to what are called hyponormal operators, which have a well- developed theory. 20. Prove or give a counterexample: if Τ G C[V) and с G F, then det(cT) = cdimUdetT. Solution: Let η = dim V. If A is an n-by-n matrix, then det(cA) = cndetA for every с G F (this follows immediately from the definition 10.25). Now suppose that Τ G C(V). Because det Τ equals the determinant of the matrix of Τ with respect to any basis (see 10.33), the equation above implies that for every с G F we have det(cT) = detM(cT) = dei{cM[T)) = cn det M(T) = c"detT, as desired. 21. Prove or give a counterexample: if S, Τ G £(V), then det(5 + T) = det S + det T. Solution: Define S, Τ G £(F2) by S(x,y) = (x,0) and T(x,y) = (0,y). Then S and Τ are both not invertible. Thus det S = det Γ = 0 (by 10.14). However, S + Τ = I and det/ = 1, so det(5 + Τ) φ detS + detT.
135 Chapter 10. Trace and Determinant Of course there are also many other examples 22. Suppose A is a block upper-triangular matrix A\ * A = 0 where each Aj along the diagonal is a square matrix. Prove that detA = (detAi)...(detAm). Solution: First consider the case where m = 2, so we can write A in the form A = В 0 * С where В is an n-by-n matrix В = and С is a p-by-p matrix C = bi.i bn,l cl,l °РЛ Ί,η bn,n C1,P °P,P Let Qjjt denote the entry in row j, column к of A. Note that a3·^ = 0 if j > η and k<n (this follows from the block upper-triangular form of A). Because A is an (n +p)-by-(n +p) matrix, to compute det A we need to consider a typical permutation (mi,..., nin+p) £ perm(n + p). If any of mi,..., пц is greater than n, then Thus in computing det A we need only consider permutations in which the first η coordinates all come from {Ι,.,.,η}, which means that the last ρ coordinates all come from {n + 1,..., η + ρ}. We can break any such permutation (mi,... ,тпп+р) into two pieces: a permutation of {1,... ,n} and (relabeling η + 1,..., η + ρ as 1,... ,p to correspond to the labeling of the
136 Chapter 10. Trace and Determinant entries of C) a permutation of {1,..., p}. Clearly the sign of (mi,..., mn+P) will equal the product of the signs of these two permutations. Putting all this together, we have detA= 22 (sign(mi,...,m„+p))amiil ...amn+pin+p (mi ,...,тпп+р)ерстт(п+р) Σ [(signO"i,...,yn))(sign(fci1...,fcP))· (Λ·—Jn)€permn (fci,...,fcp)€permp bjul ■ ■ -bjn.nCki.l ■ ■ -Ck^p] Σ (sign(ii,...,in))bj,,i--.bjnin· C7b--Jn)€pcrmn ^ (signCX:! *p))cfclli - - - cj^ (fcl,...,fcp)6permp = (det5)(detC), completing the proof when m = 2. Suppose now that m > 2 and P.P Writing we have Thus A = В = Αι 0 Αι 0 * "m * Άτη— 1 A = В * 0 Ат detA = (det#)(detAm) = (detAi)...(detAm_i)(detAm),
137 Chapter 10. Trace and Determinant where the first equality holds by the m = 2 case proved above and the second equality comes from induction on m (meaning that we can assume the desired result holds when m is replaced with m — 1). 23. Suppose Λ is an n-by-n matrix with real entries. Let S € £(Cn) denote the operator on Cn whose matrix equals A, and let Τ € £(Rn) denote the operator on Rn whose matrix equals A. Prove that trace S = trace Τ and det S = detT. Solution: The formulas defining the trace and determinant of a matrix do not depend upon whether we think of the matrix entries as real or complex numbers. We have traceA = traceS and traceA = tracer (by 10.11). Thus trace S = trace T. Similarly, we have det A = det S and det A = det Τ (by 10.33). Thus det S = detT. 24. Suppose V is an inner-product space and Τ € C(V). Prove that det Γ* = detT. Use this to prove that |det T\ = det y/T*T, giving a different proof than was given in 10.37. Solution: Let η = dim V. If λ € F, then ((T - XI)n)* = {T* - XI)n, which implies that dim nuU(T - λ/)η = dim null(T* - AJ)n, where we have used Exercise 31 in Chapter 7. The equation above shows that the eigenvalues of T* are precisely the complex conjugates of the eigenvalues of T, with the same multiplicities. If F = C, the determinant equals the product of the eigenvalues, counting multiplicity, so we have detT* = detT (so far just on complex vector spaces). Now suppose F = R, so we must consider eigenpairs. If α, β € R, then dim null(T2 + aT + βΙ)η = dim null((T*)2 + aT* + βΙ)η, again by Exercise 31 in Chapter 7. Thus Τ and T* have the same eigenpairs with the same multiplicities. From the paragraph above, Τ and T* have the same eigenvalues with the same multiplicities. Because the determinant equals the product of the eigenvalues (counting multiplicity) times the product of the second coordinates of the eigenpairs (counting multiplicity), this implies that det T* = det Τ = detT.
138 Chapter 10. Trace and Determinant At this point, we know that det T* = det Τ regardless of whether F = С or F = R. Thus (det yfFff = (det VT*T) (det y/T*T) = det(T*T) = (det T) (det T) = (ditT)(detT) = |det7f. Taking square roots (and recalling that the positive operator y/T*T has a nonnegative determinant), we have det VT*T = |det!T|, as desired. 25. Let a,b, с be positive numbers. Find the volume of the ellipsoid 2 2 2 {(s,y,s)€R3:^ + ^ + ^<l} by finding a set Ω с R3 whose volume you know and an operator Τ € £(R3) such that Γ(Ω) equals the ellipsoid above. Solution: Let Ε denote the ellipsoid defined above and let Ω be the ball with radius 1 defined by Ω = {(x,y, z) € R3 : x2 + y2 + z? < 1}. Of course volume Ω = |тг. Define Τ € £(R3) by T{x,y,z) = {ax,by,cz). If (x, y, ζ) € Ω, then Μ2 , (by)2 , (c*)2 · 2 о г <1, which shows that T(x, y, z) € E. Thus Γ(Ω) С Е. Conversely, if (χ, у, ζ) € Ε, then /Χ\2 /y\2 fZ\2 Χ2 у2 Ζ2 (α) +(!) +(с) =^ + Ь + 1* <1,
139 Chapter 10. Trace and Determinant which shows that (|, £, f) € Ω. Because T(§, £, f) = (х.у.г), this implies that Ε с Τ(Ω). The last two paragraphs show that Ε = Τ(Ω). Thus volume Ε = volume Τ(Ω) = |detT| (volume Ω) 4жаЬс where the second equality comes from 10.38.