Processing math: 100%
Skip to main content

Section 4.2 Rotations in two and three dimensions

The very cool thing about Lie groups is that almost all of their structures is encoded in their linearization. This is the key insight of Lie. In calculus, you can approximate an analytic function f(x) by looking at its linearization (or tangent line) at a point, say x=0. In fact, if the function is analytic, Taylor tells you that you can completely reconstruct the function locally if you know all its derivatives at x=0. But you need to know all derivatives. A similar statement holds in more than one dimensions; you can approximate a manifold at a point by looking at its tangent space. But this is only a linearization of the space.

For Lie groups the situation is different. It turns out that you only need to know the linearization of a group near the origin to recover the whole group locally! This is because of the group structure. If you know the first derivative of the group elements (thought of as matrices) at the origin, you can recover the group elements through exponentiation. So we can study the structure of Lie groups by looking at their linearizations at the origin, which are called Lie algebras. In other words, the Lie algebra of a Lie group can be thought of as the linearization, or tangent space, of the group manifold at the origin.

To introduce the concept of Lie algebras, we will first study rotations in two and three dimensions, and then define the formal idea that applies to all Lie groups.

Subsection 4.2.1 Rotations in two dimensions

Let us start by looking at the Lie group SO(2) of rotations in two dimensions. Those are given by 2Γ—2 special orthogonal matrices. We know that we can write any such rotation as a matrix

(cosΞΈsinΞΈβˆ’sinΞΈcosΞΈ),

in terms of an angle of rotation θ∈[0,2Ο€). The deep idea of Lie is that we can recover all rotations by doing a rotation by a very small angle many many times. Doesn't sound very deep, but it is. :-) More precisely, we think of infinitesimal rotations. If we take the angle ΞΈ to be small, we can approximate cosΞΈβ‰…1+O(ΞΈ)2 and sinΞΈβ‰…ΞΈ+O(ΞΈ)3. Keeping only terms of first order in ΞΈ, we can thus approximate the rotation matrix as

(cosΞΈsinΞΈβˆ’sinΞΈcosΞΈ)β‰…(1ΞΈβˆ’ΞΈ1)=I+ΞΈ(01βˆ’10).

Conversely, we can do a similar linear approximation for special orthogonal matrices. If A is a 2Γ—2 special orthogonal matrix A which is very close to the identity matrix, we can write Aβ‰…I+M for some β€œinfinitesimal” M. The condition that ATA=I implies that

(I+MT)(I+M)β‰…I+M+MT=I,

where we kept only terms of first order in M. Thus M=βˆ’MT, that is, M is an antisymmetric matrix. But in two dimensions any real antisymmetric matrix is a multiple of X=(01βˆ’10), thus we can write

A≅I+θX

for some real parameter ΞΈ as before. So we have recovered the same first order expansion as for rotations. We call X the generator of the Lie group (more precisely, it is a matrix representation of the generator of the Lie group).

Starting from orthogonal matrices, we recovered the first order expansion of rotations. But this should only work for special orthogonal matrices. It looks like we never imposed the condition that \(\det A = 1\text{.}\) Why?

The key insight of Lie is that we can now recover finite rotations from the knowledge of the generator X alone. Let R(ΞΈ) be a rotation by a finite angle ΞΈ. Pick a large integer N. We can recover R(ΞΈ) by doing R(ΞΈ/N) N times. In the limit as Nβ†’βˆž, R(ΞΈ/N) becomes an infinitesimal rotation. Thus we get:

R(ΞΈ)=limNβ†’βˆžR(ΞΈ/N)N=limNβ†’βˆž(I+ΞΈNX)N.

If we naively use the relation ex=limNβ†’βˆž(1+xN)N for matrices, we would conclude that

R(ΞΈ)=eΞΈX.

In other words, we can recover finite rotations by exponentiating the infinitesimal generator! That is cool. This is due to the group structure, which says that we can recover rotations by successively repeating rotations of smaller angles.

Let us be a little more precise and prove this explicitly, which makes the relation with the group structure more explicit.

Let us start with a two-dimensional rotation \(R(\theta)\text{.}\) Since it is a Lie group, we can Taylor expand near the identity:

\begin{equation} R(\theta) = I + \frac{d R}{d \theta} \Big|_{\theta = 0} \theta + \frac{1}{2} \frac{d^2 R}{d \theta^2} \Big|_{\theta = 0} \theta^2 + \ldots\label{equation-rotation2d-taylor}\tag{4.2.1} \end{equation}

(By derivative of a matrix here we mean derivative of its entries.) Let us now identity the derivatives of the rotation matrix. Rotations form a Lie group. The group structure is given by composition of rotations, which can be written as the requirement that

\begin{equation*} R(\theta_1+\theta_2) = R(\theta_1) R(\theta_2). \end{equation*}

Taking the derivative on both sides with respect to \(\theta_1\text{,}\) and then setting \(\theta_1 = 0\text{,}\) we get

\begin{equation} \frac{d R(\theta_1+\theta_2)}{d \theta_1} \Big|_{\theta_1 = 0} = R(\theta_2) \frac{d R(\theta_1)}{d \theta_1} \Big|_{\theta_1 = 0}.\label{equation-rotation2d}\tag{4.2.2} \end{equation}

We can calculate the left-hand-side of (4.2.2) via the chain rule:

\begin{equation*} \left( \frac{d R(\theta_1+\theta_2) }{d (\theta_1+\theta_2)} \frac{d (\theta_1+\theta_2)}{d \theta_1} \right) \Big|_{\theta_1 = 0} = \frac{d R(\theta_2)}{d \theta_2}. \end{equation*}

For the right-hand-side of (4.2.2), we define the matrix

\begin{equation*} X := \frac{d R(\theta_1)}{d \theta_1} \Big|_{\theta_1 = 0} = \begin{pmatrix} 0 \amp 1 \\ -1 \amp 0 \end{pmatrix}. \end{equation*}

Thus (4.2.2) becomes

\begin{equation} \frac{d R(\theta)}{d \theta} = X R(\theta).\label{equation-rotation2d-firstder}\tag{4.2.3} \end{equation}

In particular,

\begin{equation*} \frac{d R(\theta)}{d \theta}\Big|_{\theta = 0} = X. \end{equation*}

In fact, taking repeated derivatives of (4.2.3), we get:

\begin{equation*} \frac{d^n R(\theta)}{d \theta^n} = X \frac{d^{n-1} R(\theta)}{d \theta^{n-1}}. \end{equation*}

Evaluating at \(\theta=0\text{,}\) we get, by induction on \(n\text{,}\)

\begin{equation*} \frac{d^n R(\theta)}{d \theta^n}\Big|_{\theta = 0} = X^n. \end{equation*}

Thus all derivatives of the rotation matrix at the origin are determined by the generator \(X\text{!}\) As is clear from the proof, this follows because of the group structure of rotations.

Plugging this back into (4.2.1), with \(X^0 := I\text{,}\) we get

\begin{equation*} R(\theta) = \sum_{n=0}^\infty \frac{1}{n!} {\theta^n X^n} = e^{\theta X}, \end{equation*}

which concludes the proof.

The cool thing here is that all two-dimensional rotations can be recovered by exponentiating the infinitesimal generator X. So instead of studying the Lie group SO(2) directly, we could instead study the properties of the generator X. This is our first example of a Lie algebra. We say that X is an element of the Lie algebra so(2): we use the weird font to distinguish the Lie algebra from the Lie group. The algebra is rather trivial however here, since it has only one generator. Our next example will be less trivial.

Subsection 4.2.2 Rotations in three dimensions

Let us now consider the Lie group SO(3) consisting of three-dimensional rotations. We think of those as 3×3 special orthgonal matrices A. We do an infinitesimal expansion A≅I+M for an infinitesimal M. Then the orthogonality condition becomes

ATA=(I+MT)(I+M)β‰…I+MT+M=I.

Therefore, as for two-dimensional rotations, we conclude that M is a real antisymmetric 3Γ—3 matrix. In fact, it is customary in physics to introduce a factor of i in our linearization, and define instead Aβ‰…I+iL. Then what we have shown is that (iL)T=βˆ’iL. Since (iL)T is a real matrix, then it is equal to its complex conjugate βˆ’iL†. Thus the condition that (iL)T=βˆ’iL can be rewritten as L†=L, that is, L is a purely imaginary Hermitian 3Γ—3 matrix.

Any 3Γ—3 purely imaginary Hermitian matrix can be written as a linear combination of three matrices:

L1=βˆ’i(0000010βˆ’10),L2=βˆ’i(00βˆ’1000100),L3=βˆ’i(010βˆ’100000).

Those are the infinitesimal generators of the Lie group SO(3). We can write an arbitrary infinitesimal rotation as iΞΈ1L1+iΞΈ2L2+iΞΈ3L3 for three real parameters ΞΈ1,ΞΈ2,ΞΈ3 (we thus see that SO(3) is a three-dimensional Lie group). Following the same argument as for rotations in two dimensions, we can also conclude that we can write an arbitrary finite rotation in three dimensions (i.e. an element of the Lie group SO(3)) by exponentiation:

R(ΞΈ)=eiβˆ‘3i=1ΞΈiLi.

The algebra generated by the generators L1,L2,L3 is the Lie algebra so(3). Now that we have more than one generator, this is more interesting. How do we define the abstract properties of this algebra? (Instead of writing down an explicit representation in terms of three-dimensional matrices.) For this, we need one more element: the notion of a binary operation on the algebra, which, in matrix form, will be given by the commutator. Let us see how this comes about.

Subsection 4.2.3 Commutation and the commutator

What we have seen so far is that for rotations in two and three dimensions, we can reconstruct the group elements by exponentiating the infinitesimal generators. This is in fact a general statement for all Lie groups, as we will see in the next section. In the context of three-dimensional rotations, we found an explicit representation for the generators in terms of 3Γ—3 matrices. But just as when we defined abstract groups, we would like to obtain an abstract definition of the algebra of generators of a Lie group. For this, there is one element missing. In general, rotations do not commute. How can we see that from the point of view of the infinitesimal generators?

Let R≅I+M and R′≅I+M′ be infinitesimal rotations. Then

RRβ€²Rβˆ’1β‰…(I+M)(I+Mβ€²)(Iβˆ’M)=I+Mβ€²+(MMβ€²βˆ’Mβ€²M),

where we neglected terms of higher order. If we define the commutator [M,Mβ€²] as

[M,Mβ€²]:=MMβ€²βˆ’Mβ€²M,

then non-commutativity of R and Rβ€² is encapsulated in the statement of whether the commutator of their infinitesimal generators vanishes or not.

Thus, to encode the group structure of a Lie group in terms of the abstract notion of the algebra of its infinitesimal generators, we need to specify the commutation relations between the generators. This will give an abstract definition of a Lie algebra, from which a Lie group can be obtained by exponentiation.

For three-dimensional rotations, looking at the representation of the generators L1,L2,L3 in terms of 3Γ—3 matrices that we found, it is easy to compute that

[L1,L2]=iL3,[L2,L3]=iL1,[L3,L1]=iL2.

Check that the generators of three-dimensional rotations \(L_1,L_2,L_3\) satisfy these commutation relations.

This can be encoded neatly using the Levi-Civita symbol Ο΅ijk, which is defined as:

Ο΅ijk={1if (i,j,k) is an even permutation of (1,2,3);βˆ’1if (i,j,k) is an odd permutation of (1,2,3);0if two indices are repeated.

Then:

[Li,Lj]=i3βˆ‘k=1Ο΅ijkLk.

As we can see, the commutator closes, since the right-hand-side is a linear combination of the generators. Thus it provides a bilinear operation L×L→L on the vector space L spanned by L1,L2,L3. This is generally true, as we will see. For any Lie algebra, we are given a bilinear operation, which we write as a commutator, such that

[Li,Lj]=βˆ‘kcijkLk.

We call the cijk the structure constants of the Lie algebra.

Abstractly, we can define the Lie algebra so(3) as being the three-dimensional vector space V of real linear combinations of the generators L1,L2,L3, with a bilinear operation [⋅,⋅]:V×V→V specified by (4.2.5).

Subsection 4.2.4 Differential representation

So far we have worked exclusively with matrix representations of the rotation groups. In fact, we defined the rotation groups in terms of their fundamental, or defining, representations, as subgroups of GL(n,R).

But there are other types of representations that are very useful. Let us focus on three-dimensional rotations as an example. We can represent the infinitesimal generators of rotations as differential operators acting on functions f(x,y,z) on R3. To do that, what we need to do is find differential operators that satisfy the commutation relations (4.2.5). It is not too difficult to check that the following differential representation works:

L1=βˆ’i(yβˆ‚βˆ‚zβˆ’zβˆ‚βˆ‚y),L2=βˆ’i(zβˆ‚βˆ‚xβˆ’xβˆ‚βˆ‚z),L3=βˆ’i(xβˆ‚βˆ‚yβˆ’yβˆ‚βˆ‚x).

Check that the differential operators \(L_1,L_2,L_3\) satisfy the commutation relations (4.2.5).

You may recognize those operators as the angular momentum operators in quantum mechanics (up to a factor of ℏ). This is not a coincidence! The angular momentum operators in quantum mechanics are a representation of the infinitesimal generators of the group of rotations in three dimensions. In fact, the possibility of going back and forth between differential representations and matrix representations of the Lie algebra is the essence of the duality between the Schrodinger and Heisenberg pictures of quantum mechanics.