Section 34.3 Concepts
In this section.
Subsection 34.3.1 Properties of Jordan normal form
First, let's visualize the overall form of a matrix in Jordan normal form:
As a Jordan normal form matrix is lower triangular, the diagonal entries are precisely the eigenvalues of the matrix, and the algebraic multiplicity of each eigenvalue is equal to the number of diagonal entries in total in which that eigenvalue appears. In other words, the algebraic multiplicity of each eigenvalue is equal to the sum of the sizes of the Jordan blocks \(J_i(\lambda_j)\) for that eigenvalue, corresponding to the size of a “macro” block enclosed in a box in the example matrix above. If we started with a matrix in triangular block form, these “macro” blocks would be the “simplified” form of each block in the triangular block form. That is, a block in triangular block form will break up further into smaller Jordan blocks. (See Example 34.4.1 for an example.) Going further in this direction, notice that each Jordan block is a scalar-triangular form that can be decomposed into a scalar matrix and a matrix in elementary nilpotent form:
So an elementary nilpotent matrix is just the special case of a Jordan block that has eigenvalue \(0\text{.}\)
Subsection 34.3.2 Uniqueness of Jordan normal form
Every matrix is similiar (over \(\C\)) to a Jordan normal form matrix (see Theorem 34.5.1). We would like to think of a matrix as having one and only one (unique) Jordan normal form, so that we may use the Jordan normal form to represent and distinguish similarity classes. However, if a matrix \(A\) has more that one distinct eigenvalue, we could mix up the order of the eigenvalues to obtain a different matrix in Jordan normal form to which \(A\) is also similar. Since there isn't any “natural” way to put the eigenvalues of a given matrix into a specific order, we will have to settle for the fact that a matrix can have several Jordan normal forms. In fact, our requirement that the Jordan blocks for a given eigenvalue \(\lambda_j\) be arranged in descending size isn't really “natural” either. Why not ascending size? Why not a random order of sizes? We will just have to accept that a matrix can have many different Jordan normal forms. But at least each of these Jordan normal form matrices will look essentially the same, and all of them will be similar to one another (and to the original matrix).
Subsection 34.3.3 Jordan normal form procedure
Procedure 34.3.1. Jordan normal form.
Given an \(n \times n\) matrix \(A\) whose characteristic polynomial factors completely, we can determine an invertible matrix \(P\) so that \(\inv{P} A P\) is in Jordan normal form as follows.
- Use Procedure 30.4.2 to obtain transition matrix \(Q\) so that \(U = \inv{Q} A Q\) is in triangular block form, with upper triangular blocks \(U_1,\dotsc,U_k\text{,}\) each corresponding to a distinct eigenvalue \(\lambda_1,\dotsc,\lambda_k\text{,}\) respectively, of \(A\text{.}\)
- For each index \(i\text{,}\) form the nilpotent matrix \(N_i = U_i - \lambda_i I_{m_i}\text{,}\) where \(m_i\) is both the size of \(U_i\) and the algebraic multiplicity of the eigenvalue \(\lambda_i\text{.}\) Then use one of the procedures from Subsection 33.4.3 (or some other method) to obtain an \(m_i \times m_i\) transition matrix \(R_i\) so that \(\inv{R_i} N_i R_i\) is in triangular-block nilpotent form. Using this same \(R_i\text{,}\) we will also have\begin{equation*} U_i' = \inv{R_i} U_i R_i = \inv{R_i} (\lambda_i I_{m_i} + N_i) R_i = \lambda_i I_{m_i} + \inv{R_i} N_i R_i\text{,} \end{equation*}so that the matrix \(U_i'\) will be in Jordan normal form, but all Jordan blocks will have scalar part \(\lambda_i\text{.}\)
- Form the \(n\times n\text{,}\) block-diagonal transition matrix \(R\) with blocks \(R_1,\dotsc,R_k\text{,}\) in order, down the diagonal. Then set \(P = QR\text{,}\) so that\begin{equation*} \inv{P} A P = \inv{R} \inv{Q} A Q R = \inv{R} U R \end{equation*}will be in Jordan normal form.
Subsection 34.3.4 Determining the form indirectly
Just as in Subsection 33.4.2, it's possible to use rank calculations to determine the precise Jordan normal form matrix to which a given matrix \(A\) is similar without actually determining a transition matrix. Since the first step in analyzing the form of a matrix is to determine its eigenvalues and their algebraic multiplicities by calculating the characteristic polynomial of the matrix, we assume we have this information at our disposal.
Consider \(A\) to be similar to a Jordan normal form matrix
Note that each block above is not in general a Jordan block — it is itself a block-diagonal matrix made up of one or more Jordan blocks. And so each nilpotent part \(N_i\) of each eigenvalue-based block is in triangular-block nilpotent form. To know the exact form, we just need to know the form of each nilpotent part. We can approach this task using similar patterns to the ones described in Subsection 33.4.2, once we get those eigenvalues out of the way.
Consider the matrix
As there shouldn't be any duplicate eigenvalues, the first block of \(N_1 = J - \lambda_1 I\) is the only one with zeros down the diagonal, and the same is true of any power of \(J - \lambda_1 I\text{.}\) So we will always have
where the \(m_i\) are the algebraic multiplicities of the other eigenvalues, as usual. In this way, we can investigate the exact form of the nilpotent submatrix \(N_1\) exactly as in Subsection 33.4.2, with the understanding that to get at the ranks of powers of \(N_1\text{,}\) we will have to subtract off the sum of the multiplicities of the other eigenvalues from the ranks of powers of \(J - \lambda_1 I\text{.}\) Similarly, we can investigate the form of any of the nilpotent \(N_i\) using the ranks of powers of \(J - \lambda_i I\text{.}\)
Now, if \(A\) is similar to \(J\text{,}\) then \(A - \lambda_i I\) is similar to \(J - \lambda_i I\) for each eigenvalue \(\lambda_i\text{.}\) (See the proof of Theorem 26.5.8.) So the ranks of powers of the matrices \(A - \lambda_i I\) will tell us the ranks of the powers of the matrices \(J - \lambda_i I\text{,}\) which means that we can learn the exact form of \(J\) by computing the ranks of powers of \(A - \lambda_i I\text{.}\)
See Example 34.4.1 for a simple example of using this kind of reasoning.