Section 32.4 Concepts
In this section.
Subsection 32.4.1 Moving past triangular block form
In Discovery 32.1, we first reminded ourselves that similarity is transitive, and applied this fact with triangular block form as an intermediate form: if a matrix is similar to a triangular block matrix, and that triangular block matrix is similar to another (hopefully simpler) matrix, then the first matrix is also similar to the third matrix. So in our quest to determine a uniform answer to Question 28.1.1, we can attempt to move past triangular block form by concentrating on that form itself. As in Discovery 32.1, consider triangular block form matrix
T=[T1T2β±Tβ],
where each block Tj is a scalar-triangular form. We would like to construct a transition matrix Q so that Qβ1TQ is somehow even simpler in form than T. There are aspects of triangular block form that make it particularly simple, and we don't want to mess those up. But if we restrict ourselves to working with transition matrices
Q=[Q1Q2β±Qβ]
that are also in block-diagonal form, with the same number and sizes of blocks as T, then the algebra of block-diagonal matrices (Statement 1 of Proposition 28.6.1) tells us that
Qβ1TQ=[Qβ11T1Q1Qβ12T2Q2β±Qβ1βTβQβ].
So instead of focusing on how to make triangular block form simpler, we can focus on how to make each scalar-triangular block simpler.
A scalar-triangular matrix S can be decomposed as a sum of a scalar matrix and an upper triangular matrix,
S=Ξ»I+N,
where the upper triangular matrix has zeros down the diagonal. (See (β) in Subsection 31.3.2.) If Q is any transition matrix, then
Qβ1SQ=Qβ1(Ξ»I+N)Q=Ξ»Qβ1IQ+Qβ1NQ=Ξ»I+Qβ1NQ,
so again we can narrow our focus onto how to make matrices of the form of N simpler through similarity. And since an upper triangular matrix of this form is always nilpotent (Lemma 31.5.1), we should turn our attention to the similarity patterns of nilpotent matrices.Subsection 32.4.2 The similarity pattern of elementary nilpotent form
As uncovered in Discovery 32.2, nilpotent matrices can only be similar to other nilpotent matrices. And as discussed in Section 32.2, it might be fruitful to first examine the case of nilpotent matrices that are similar to elementary nilpotent form. We'll follow the example of Discovery 32.4 and consider the 4Γ4 case. Suppose that A is similar to
N=[0000100001000010].
Using Pattern 26.3.1, we know that Pβ1AP=N can only be true if
Ap1=0p1+1p2+0p3+0p4,Ap2=0p1+0p2+1p3+0p4,Ap3=0p1+0p2+0p3+1p4,Ap4=0p1+0p2+0p3+0p4,
where the pj are the columns of P, as usual. Rewriting this to make it a little more clear, we have
Ap1=p2,Ap2=p3,Ap3=p4,Ap4=0.
The effect of multiplication by A on the columns of P seems to follow a cyclic pattern, though the pattern ends in the zero vector instead of wrapping back around to p1. This pattern becomes more evident if we repeatedly apply A to p1.
Ap1=p2,A2p1=A(Ap1)=Ap2=p3,A3p1=A(A2p1)=Ap3=p4,A4p1=A(A3p1)=Ap4=0.
Of course, the pattern ended at A4p1=0 because A is 4Γ4 nilpotent (Statement 4 of Theorem 31.5.3). In particular, the fact that it ended in the zero vector tells us that the vector before that in the cycle must be in the null space of A.
It should be clear that the pattern will unfold in a similar manner for an nΓn matrix A that is similar to an elementary nilpotent form matrix.
Pattern 32.4.1. Similarity with an elementary nilpotent matrix.
For nΓn matrix A to be similar to an elementary nilpotent matrix N via similarity relation Pβ1AP, the columns p1,p2,β¦,pn of the transition matrix P must satisfy the pattern
pj+1=Apj=Ajp1,
where the last vector pn=Anβ1p1 must be in the null space of A.
Subsection 32.4.3 Elementary nilpotent procedure
According to Pattern 32.4.1, choosing the first column p1 of a transition matrix P automatically chooses all the other columns by
pj+1=Ajp1.
However, none of the columns of P may be zero, since we need P to be invertible. In particular, we need
pn=Anβ1p1β 0.
So while A must be nilpotent in order to be similar to a nilpotent matrix, and so will satisfy An=0 (Statement 4 of Theorem 31.5.3), we must have Anβ1β 0. This echoes Task 32.3.b, where we recognized that an nΓn elementary nilpotent matrix has maximum degree of nilpotency n. It follows that for A to be similar to an elementary nilpotent matrix, it must also have maximum degree of nilpotency (Proposition 31.5.2).
Note that condition (β) is enough to guarantee that all the other columns of P are nonzero, since if
pj=Ajβ1p1=0
were true for some j<n, then
Anβ1p1=Anβj(Ajβ1p1)=0
would be true as well.
Procedure 32.4.2. For realizing a similarity relation between a nilpotent matrix of maximum degree of nilpotency and an elementary nilpotent matrix.
Suppose A is a nilpotent nΓn matrix with Anβ1β 0. To determine a transition matrix P so that Pβ1AP is in elementary nilpotent form, choose a vector v in Rn (or Cn, as appropriate) so that Anβ1vβ 0, then form transition matrix P with columns
p1=v,p2=Av,p3=A2v,β¦,pn=Anβ1v.
A suitable choice can be to choose v to be a standard basis vector ej when the jth column of Anβ1 is nonzero.
Remark 32.4.3.
- Such a vector v as in the procedure must exist because the only nΓn matrix whose null space is all of Rn (or Cn, as appropriate) is the zero matrix. Since we have assumed Anβ1β 0, there must exist vectors that are not in the null space of Anβ1. In particular, if the jth column of Anβ1 is nonzero, then we could take v to be ej, the jth standard basis vector.
- While we have justified above that choosing v so that Anβ1vβ 0 will guarantee that none of the columns of P with be zero, this is not enough to guarantee that those columns will be linearly independent, which is required for P to be invertible. We leave this to be verified in Section 32.6.
Subsection 32.4.4 Cyclic subspaces
In Procedure 32.4.2, for the transition matrix P to be invertible it must be true that the vectors
v,Av,A2v,β¦,Anβ1v
are linearly independent and so form a basis of Rn (or Cn, as appropriate). It is reasonable to turn the βcyclicβ pattern exhibited by these basis vectors into a general concept, for all matrices instead of just for nilpotent ones.
Given an nΓn matrix A, a subspace W of Rn (or Cn, as appropriate) that can be described by a βcyclicβ spanning set as
W=Span{w,Aw,A2w,β¦}
is called an A-cyclic subspace, and the generating vector w is called an A-cyclic vector for W. We will explore the theory of A-cyclic subspaces in Section 32.6, but we will say a couple of things here.
First, being contained in a finite-dimensional space, the infinite spanning set for W cannot be linarly independent. So there will be a point in the sequence
w,Aw,A2w,β¦
where Akw is dependent with all the previous vectors, and then we will have
W=Span{w,Aw,A2w,β¦,Akβ1w}.
(See Lemma 32.6.2.)
Second, an A-cyclic subspace is always A-invariant. Every vector in such a subspace W is a linear combination of the cyclic spanning vectors Ajw, and multiplying such a linear combination by A will obviously still be a linear combination of those cyclic spanning vectors, just with each power of A in the linear combination increased by one.
Warning 32.4.4.
A subspace of Rn (or Cn, as appropriate) does not necessarily contain any A-cyclic vectors, even if it is A-invariant.
Subsection 32.4.5 Properties of elementary nilpotent matrices
In Discovery 32.3, we explored properties of elementary nilpotent matrices and their powers. Here we will simply summarize our two main discoveries, which are both a consequence of a simple observation: in powers of an elementary nilpotent matrix, the diagonal of ones marches down the sub-diagonals of the matrix, taking one step down per increase in exponent. Suppose N is an nΓn matrix in elementary nilpotent form.- Matrix N has maximum degree of nilpotency n. That is, the first power of N that becomes the zero matrix is the nth power.
- The rank of N is one less than full, nβ1, and ranks of powers of N decrease from there:rankN=nβ1,rankN2=nβ2,rankN3=nβ3,β¦,until we reach rankNn=0.