Skip to main content

Section 32.2 Motivation

We continue on our quest to answer Question 28.1.1. If we are happy with triangular block form, then we have succeeded: every matrix is similar (over \(\C\)) to one in triangular block form. But could we make a triangular block form similar to some even simpler form?

Philosophy of inquiry: break the problem apart.

If a problem is made up of readily-identifiable smaller and/or simpler parts, concentrate on each individual part separately, if possible.

A matrix in triangular block form is immediately recognizable as being made out of parts that are both smaller and simpler: the blocks themselves. But we can break each block into parts as well.

Each block in triangular block form is in scalar-triangular form, and a scalar-triangular form matrix can be decomposed as a sum

\begin{equation*} \lambda I + N \end{equation*}

of a scalar matrix \(\lambda I\) and a nilpotent matrix \(N\text{.}\) As we realized in Discovery 32.1, attempting to simplify each block in triangular form is equivalent to simplifying each nilpotent part of the scalar-triangular blocks. However, just as we did when we considered scalar-triangular form as an initial special case before analyzing general triangular block form, we will consider first consider a special case of nilpotent matrix.

Philosophy of inquiry: analyze the “cusp” case.

A “cusp” is a boundary between two different kinds of behaviour, in analogy with a cusp point on the graph of a function.

Visualization of a cusp point

A cusp is often where interesting things happen or where special patterns are revealed.

In Chapter 24, we learned that a singular matrix must have \(\lambda = 0\) as an eigenvalue (Theorem 24.6.3). So in the triangular form for a singular matrix, one of the blocks will have zeros down the diagonal, with as many diagonal zeros as the algebraic multiplicity of eigenvalue \(\lambda = 0\) in the characteristic polynomial. We often think of a singular matrix as being “like zero”, since \(0\) is the only number that does not have an inverse. But an \(n \times n\) zero matrix has eigenvalue \(\lambda = 0\) with maximum algebraic multiplicity \(m_\lambda = n\text{.}\) So, in a way, the algebraic multiplicity of \(\lambda = 0\) for a singular matrix measures how “close” to being zero that matrix is.

Diagram of a partition of the set of \(n \times n\) matrices.
Figure 32.2.1. Partition of the set of \(n \times n\) matrices into invertible and singular, with the set of singular matrices further partitioned by multiplicity of the eigenvalue \(\lambda = 0\text{.}\)

But the zero matrix is not the sole example of a matrix with eigenvalue \(\lambda = 0\) of maximum algebraic multiplicity — every nilpotent matrix does as well (Statement 2 of Theorem 31.5.3). So in one sense, nonzero nilpotent is as close as you can get to being zero without actually being zero.

However, we can further partition the set of nilpotent matrices of a particular size by degree of nilpotency — the lowest positive exponent which takes the nilpotent matrix to zero. The zero matrix stands alone in this partition, as it is the only matrix with degree of nilpotency equal to \(1\) (since it already is zero).

Diagram of a partition of the set of nilpotent \(n \times n\) matrices.
Figure 32.2.2. Partition of the set of \(n \times n\) singular matrices by degree of nilpotency.

The “cusp” case we will consider is at the top of this partition: those nilpotent matrices with degree of nilpotency \(n\text{.}\) These matrices are “close” to zero in the sense that they are nilpotent (and so have eigenvalue \(\lambda = 0\) with maximum algebraic multiplicity). But of all nilpotent matrices they are as far from zero as possible, since it takes the maximum exponent \(n\) for them to reveal their nilpotency. They are on the cusp between being nilpotent and being singular non-nilpotent.

We have already encountered a prototypical example of this sort of nilpotent matrix that is “on the cusp”: the elementary nilpotent form matrices of Discovery 32.3. Our task in this chapter (that we have already begun in Discovery guide 32.1) is to determine the pattern of similarity that emerges when a nilpotent matrix is similar to an elementary nilpotent one.