Processing math: 100%
Skip to main content

Section 26.3 Concepts

Subsection 26.3.1 The geometry of similarity

In Discovery 26.2, we explored the geometric nature of similarity between matrices by examining how each matrix in a similar pair “transformed” vectors in Rn. In effect, the columns of the transition matrix P form a new basis B of Rn, making P and P1 transition matrices from B to the standard basis S and vice versa (justifying our double defintion of the term transition matrix in Section 22.2 and Section 25.2). The new basis B forms a new set of axes in Rn, and we then find that the geometric nature of how a standard-coordinate vector is transformed relative to the new axes by one matrix agrees with how the same vector is transformed by the other matrix after the vector is first converted to the new coordinates.

The goal is to understand the first matrix and how it transforms vectors in Rn by finding a second similar matrix that is significantly simpler to understand, and then to use the transition matrices P and P1 to convert information about the simpler matrix into information about the original matrix.

Relating this back to Discovery 26.2, when we plotted the vectors

[v]B=P1v,B[v]B=BP1v

on their own set of wz-axes, we observed that the simple diagonal form of B made it easy to determine how [v]B is transformed by B: a stretch by a factor of 4 in the w-direction and both a reflection and a stretch by a factor of 2 in the z-direction. After superimposing a set of wz-axes onto the xy-axes using the B vectors, we observed that v was transformed by A in exactly the same way relative to the new superimposed axes. This should not have been a surprise, since the formula

Av=PBP1v=PBSBPSBv

(where A=PBP1 by the reversal of the similarity relation P1AP=B) can be translated as:

to transform v by A, one can instead convert v to B-coordinates, transform the result by B, and then convert that transformed vector back to standard coordinates.

Subsection 26.3.2 The algebra of similarity

In Discovery 26.2, we explored the algebraic connections between similar matrices A,B and the transition matrix P that realizes the similarity by P1AP=B.

Since inverses are complicated to work with, we first converted the similarity relation into AP=PB, and then we analyzed this equality of matrices by comparing corresponding columns of each side. Using the pattern of () in Subsection 4.3.7, the columns of AP are

Ap1,Ap2,,Apn,

where the pj are the columns of P. On the other side, the same matrix multiplication pattern gives the columns of PB as

Pb1,Pb2,,Pbn,

where the bj are the columns of B. To have both sides expressed in terms of the columns of P, we used the matrix-times-vector pattern of () in Subsection 4.3.7 to write each Pbj as a linear combination of the columns of P, with coefficients from the column bj.

Putting both sides together, we must have

Apj=Pbj=b1jp1+b2jp2++bnjpn.

Since P is assumed invertible, its columns form a basis of Rn. And linear combinations of basis vectors are precisely how we determine coordinates relative to a basis, which leads from the above equality of columns to the following pattern.

Pattern 26.3.1. Similarity by columns.

Square matrices A and B are similar via P1AP=B if and only if each column of B is the coordinate vector relative to BP of the result of transforming the corresponding column of P by A, where BP is the basis of Rn formed by the columns of P. In symbols, we can write this condition as

bj=[Apj]BP

for each column bj of B and each corresponding column pj of P.

This pattern will guide our exploration of matrix forms in most of the remainder of Part III of this book.

Subsection 26.3.3 Similarity classes

In Discovery 26.1, we demonstrated that the similarity relation is

symmetric

that is, a matrix is not just similar to another matrix, a pair of matrices must be similar to each other;

reflexive

that is, every matrix is similar to itself; and

transitive

that is, similarity can be transferred from first to third in a chain of similarities ABC.

These three properties together demonstrate that similarity is an equivalence relation.

A consequence of this fact is that similarity partitions Mn(C) into similarity classes — subsets of matrices that are all similar to each other but not to other matrices. To see why this is so, consider a particular square matrix A. The similarity class of A is defined to be the collection of all matrices that are similar to A. By the reflexive property, this class contains A itself, so is always nonempty. Suppose the class also contains matrices B and C, so that both BA and CA are true. Then the symmetric property lets us turn this into a chain BAC, which in turn lets us conclude BC by the transitive property. So matrices in a particular similarity class are all equivalent to each other. Furthermore, a matrix in the similarity class of A cannot be similar to a matrix outside of that class, for if BD is true then again we can form a chain ABD, so that D must be in the class.

Subsection 26.3.4 The similarities of similar matrices

The remaining activities in Discovery guide 26.1 provided justification for using the adjective similar in the terminology similar matrices. Just as we were able to use the transition matrices P and P1 to relate the geometric action of A on Rn relative to one set of coordinates to the geometric action of B on Rn relative to another set of coordinates in Subsection 26.3.1, those same transition matrices allow us to relate algebraic properties of the similar matrices.

We will formally state and prove many of the statements from those remaining discovery activities in Subsection 26.5.2.