In Chapter 25, we were interested in determining the conditions under which a square matrix is similar to a diagonal matrix. We called such a matrix diagonalizable. In this discovery guide, we’ll explore similarity more generally, both geometrically and algebraically.
The equality defining similar matrices seems one-directional: it would seem more appropriate to say that matrix \(A \) is similar to matrix \(B \), rather than saying that they are similar together, because \(A \) can be “transformed” into \(B \) via the transition matrix \(P \text{.}\)
Convince yourself that this distinction is not important by verifying that if \(A \) is similar to \(B \text{,}\) then \(B \) is also similar to \(A \text{:}\)
\begin{equation*}
\inv{(\fillinmath{XXX})} B (\fillinmath{XXX}) = A \text{.}
\end{equation*}
Suppose \(A, B \) are similar via transition matrix \(P \text{,}\) and that \(B \) is also similar to a third matrix \(C \) via transition matrix \(Q \text{,}\) so that \(\inv{Q} B Q = C\text{.}\)
As usual, write \(\basisfont{S} \) for the standard basis \(\basisfont{S} = \{ \uvec{e}_1, \uvec{e}_2 \} \) of \(\R^2 \text{.}\) Also write \(\basisfont{B} \) for the basis \(\basisfont{B} = \{ \uvec{p}_1, \uvec{p}_2 \} \) of \(\R^2 \) formed by the columns of \(P \) (Statement 11 of Theorem 21.5.5).
On a set of \(xy \)-axes, plot the vectors \(\uvec{v} \) and \(A \uvec{v} \text{,}\) where \(\uvec{v} = \left[\begin{smallmatrix} 3 \\ -2 \end{smallmatrix}\right] \text{.}\)
What are the columns of the transition matrix \(\ucobmtrx{B}{S} \text{?}\) Do you know another matrix in this activity that has those same columns? Then use Statement 3 of Proposition 22.5.4.
Let’s call the \(\basisfont{B} \)-coordinate system the \(wz \)-coordinate system, with \(w \) on the horizontal axis and \(z \) on the vertical axis. On a new set of \(wz \)-axes (don’t erase your \(xy \)-axes from before!), plot the vectors \(\matrixOf{\uvec{v}}{B} \) and \(B \matrixOf{\uvec{v}}{B} \text{.}\)
When computing \(B \matrixOf{\uvec{v}}{B} \text{,}\) the \(4 \) in the upper left entry of \(B \) multiplied the \(w \)-component of \(\matrixOf{\uvec{v}}{B} \text{,}\) and the \(-2 \) in the lower right entry of \(B \) multiplied the \(z \)-component of \(\matrixOf{\uvec{v}}{B} \text{.}\)
Describe this pattern in geometric terms, by considering how the diagonal entries of \(B \) determined how the vector \(\matrixOf{\uvec{v}}{B} \) was transformed into the vector \(B \matrixOf{\uvec{v}}{B} \text{.}\)
Just as the standard basis vectors \(\uvec{e}_1, \uvec{e}_2 \) correspond to the \(x \)- and \(y \)-axes, respectively, the \(\basisfont{B} \)-basis vectors \(\uvec{p}_1, \uvec{p}_2 \) correspond to the \(w \)- and \(z \)-axes, respectively.
Plot vectors \(\uvec{p}_1, \uvec{p}_2 \) on your original set of \(xy \)-axes from Task a, and then extend each of them in both directions (maybe with dashed lines) to create a set of \(wz \)-axes superimposed on the \(xy \)-axes.
Try to determine if your geometric description of the transformation \(\matrixOf{\uvec{v}}{B} \mapsto B \matrixOf{\uvec{v}}{B} \) from Task d is consistent with the geometric transformation \(\uvec{v} \mapsto A \uvec{v} \) on your first diagram, but relative to the new superimposed \(wz \)-axes.
(but for now stop thinking of \(P \) as a collection of columns). Again using the pattern of (✶✶✶) in Subsection 4.3.7, write down an expression for the first column of the product matrix \(P B \text{.}\)
Let’s explore your expression from Task b a little further. Suppose the first column of \(B \) is the vector \(\uvec{b}_1 = \left[\begin{smallmatrix} 5 \\ 3 \\ -1 \end{smallmatrix}\right] \text{.}\) Use the matrix-times-vector pattern from (✶✶) in Subsection 22.3.2 to express the first column of \(P B \) as a linear combination.
For \(A P = P B \) to be true, we must at least have the first columns of \(A P \) and \(P B \) equal. Set your expressions from Task a and Task c to be equal to help you fill in the following:
If we analyzed and compared the second columns of \(A P \) and \(P B \) in the same fashion, would we come to the same pattern as in Task d? What words in the pattern would you change? What if we analyzed and compared the third columns of \(A P \) and \(P B \) in the same fashion?
In each of the remaining discovery activities, assume that square matrices \(A,B \) are similar via transition matrix \(P \text{,}\) with \(\inv{P} A P = B \text{.}\)
Demonstrate that the transition matrix \(P \) transforms eigenvectors of \(B \) for a particular eigenvalue into eigenvectors of \(A \) for that same (shared) eigenvalue.