Skip to main content

Section 40.4 Concepts

Subsection 40.4.1 Diagonalization of Hermitian and symmetric matrices

Our first test case, explored in Discovery 40.3, was of a Hermitian matrix. In that discovery activity, with a full slate of linearly independent eigenvectors provided, we were easily able to produce a transition matrix \(P\) so that \(\inv{P} H P\) was diagonal. And, since the provided eigenvectors were already orthogonal, to convert \(P\) into a unitary matrix \(U\) so that \(\adjoint{U} H U\) was diagonal, all that was required was to normalize the eigenvectors to unit vectors.

As considered in Discovery 40.3.c, the crucial fact to make this work was that the eigenvectors were orthogonal, since unit vectors are then easy to produce by normalization.

Now, it may seem that lack of orthogonality is also easy to “fix” using the Gram-Schmidt process, and indeed we could use that process to convert an eigenspace basis into an orthogonal one. But as we found in the non-Hermitian example of Discovery 40.4, it won't work to fix eigenvectors from different eigenspaces that are not orthogonal to each other. In trying to correct a lack of one crucial property (orthogonality of basis vectors), Gram-Schmidt may affect the other crucial property for diagonalization: basis vectors must be eigenvectors.

Diagram of a change of variables via transition matrix.
Figure 40.4.1. Applying the Gram-Schmidt process to eigenvectors from different eigenspaces may result in non-eigenvectors.

Just as the fact that eigenvectors from different eigenspaces are automatically linearly independent (Proposition 25.6.6) was crucial to the process for diagonalization, eigenvectors from different eigenspaces being orthogonal is crucial to the prospect of being able to orthogonally/unitarily diagonalize a given matrix. Not every matrix has this property, but in Discovery 40.5 we discovered that self-adjoint matrices do have this property. And, in fact, we will prove in Subsection 40.6.2 that every self-adjoint matrix is orthogonally/unitarily diagonalizable.

It turns out that this is the end of the story in the real case — a real matrix is orthogonally diagonalizable if and only if it is symmetric (Theorem 40.6.13). So we will give a procedure for the real case now. (And essentially the same procedure will work for Hermitian matrices in the complex case.)

See Example 40.5.1 for an example of carrying out this procedure on a real symmetric matrix, and see Example 40.5.2 for an example of carrying out this procedure on a complex Hermitian matrix.

Subsection 40.4.2 Diagonalization of normal matrices

As Discovery 40.6 demonstrated, not every unitarily diagonalizable complex matrix is Hermitian. One property that unitarily diagonalizable matrices possess that does not seem to be exclusive to Hermitian matrices is commutativity with its own adjoint (see Discovery 40.7). A complex matrix with this property is called a normal matrix. And we will prove that normal matrices have the crucial property of orthogonal eigenspaces (see Statement 2 of Item 2), so that every normal matrix is unitarily diagonalizable (see Theorem 40.6.14).

Since each self-adjoint matrix obviously commutes with itself, every Hermitian matrix is normal, and it seems like we have now completely characterized the class of unitarily diagonalizable matrices. And the procedure to unitarily diagonalize a complex normal matrix is essentially identical to the procedure to orthogonally diagonalize a real symmetric matrix.

See Example 40.5.4 for an example of carrying out this procedure.