Section 40.4 Concepts
In this section.
Subsection 40.4.1 Diagonalization of Hermitian and symmetric matrices
Our first test case, explored in Discovery 40.3, was of a Hermitian matrix. In that discovery activity, with a full slate of linearly independent eigenvectors provided, we were easily able to produce a transition matrix P so that P−1HP was diagonal. And, since the provided eigenvectors were already orthogonal, to convert P into a unitary matrix U so that U∗HU was diagonal, all that was required was to normalize the eigenvectors to unit vectors. As considered in Discovery 40.3.c, the crucial fact to make this work was that the eigenvectors were orthogonal, since unit vectors are then easy to produce by normalization. Now, it may seem that lack of orthogonality is also easy to “fix” using the Gram-Schmidt process, and indeed we could use that process to convert an eigenspace basis into an orthogonal one. But as we found in the non-Hermitian example of Discovery 40.4, it won't work to fix eigenvectors from different eigenspaces that are not orthogonal to each other. In trying to correct a lack of one crucial property (orthogonality of basis vectors), Gram-Schmidt may affect the other crucial property for diagonalization: basis vectors must be eigenvectors.Procedure 40.4.2. Orthogonal diagonalization of a symmetric matrix.
Given a real, n×n, symmetric matrix S, we can construct an orthogonal transition matrix P so that P−1SP=PTSP is diagonal follows.
- Compute the eigenvalues of S. For each eigenvalue, compute a basis for the corresponding eigenspace.
- To each eigenspace basis from the first step, apply the Gram-Schmidt orthogonalization process to produce an orthogonal basis for that eigenspace.
- Normalize each eigenspace basis vector from the previous step to produce an orthonormal basis for each eigenspace.
- Form the n×n matrix P whose columns are the orthonormal eigenspace vectors from the third step collected all together.
Since the columns of P are constructed to be an orthonormal basis of Rn, this transition matrix will be orthogonal (Statement 4 of Theorem 39.5.6). And since the columns of P also form a basis of Rn consisting of eigenvectors for S, we will have P−1SP=PTSP=D, a diagonal matrix with the eigenvalues of S down the diagonal (Theorem 25.6.3).
Subsection 40.4.2 Diagonalization of normal matrices
As Discovery 40.6 demonstrated, not every unitarily diagonalizable complex matrix is Hermitian. One property that unitarily diagonalizable matrices possess that does not seem to be exclusive to Hermitian matrices is commutativity with its own adjoint (see Discovery 40.7). A complex matrix with this property is called a normal matrix. And we will prove that normal matrices have the crucial property of orthogonal eigenspaces (see Statement 2 of Item 2), so that every normal matrix is unitarily diagonalizable (see Theorem 40.6.14). Since each self-adjoint matrix obviously commutes with itself, every Hermitian matrix is normal, and it seems like we have now completely characterized the class of unitarily diagonalizable matrices. And the procedure to unitarily diagonalize a complex normal matrix is essentially identical to the procedure to orthogonally diagonalize a real symmetric matrix.Procedure 40.4.3. Unitary diagonalization of a normal matrix.
Given a complex, n×n, normal matrix N, we can construct a unitary transition matrix U so that U−1NU=UTNU is diagonal follows.
- Compute the eigenvalues of N. For each eigenvalue, compute a basis for the corresponding eigenspace.
- To each eigenspace basis from the first step, apply the Gram-Schmidt orthogonalization process to produce an orthogonal basis for that eigenspace.
- Normalize each eigenspace basis vector from the previous step to produce an orthonormal basis for each eigenspace.
- Form the n×n matrix U whose columns are the orthonormal eigenspace vectors from the third step collected all together.
Since the columns of U are constructed to be an orthonormal basis of Cn, this transition matrix will be unitary (Statement 4 of Theorem 39.5.6). And since the columns of U also form a basis for Cn consisting of eigenvectors for N, we will have U−1NU=UTNU=D, a diagonal matrix with the eigenvalues of N down the diagonal (using the complex version of Theorem 25.6.3).