Section 36.6 Theory
In this section.
Subsection 36.6.1 Properties of inner products
Here we list algebraic properties of inner products, including those prescribed by the inner product axioms.
Proposition 36.6.1. Algebra rules of real inner products.
Suppose \(\inprod{\blank}{\blank}\) is a real inner product. Then the following are true for all vectors \(\uvec{u}\text{,}\) \(\uvec{v}\text{,}\) and \(\uvec{w}\) in the associated inner product space, and all scalars \(k\text{.}\)
- \(\uvecinprod{v}{u} = \uvecinprod{u}{v}\text{.}\)
- \(\inprod{\uvec{u}}{\uvec{v} + \uvec{w}} = \uvecinprod{u}{v} + \uvecinprod{u}{w}\text{.}\)
- \(\inprod{\uvec{u}}{\uvec{v} - \uvec{w}} = \uvecinprod{u}{v} - \uvecinprod{u}{w}\text{.}\)
- \(\inprod{\uvec{u} + \uvec{v}}{\uvec{w}} = \uvecinprod{u}{w} + \uvecinprod{v}{w}\text{.}\)
- \(\inprod{\uvec{u} - \uvec{v}}{\uvec{w}} = \uvecinprod{u}{w} - \uvecinprod{v}{w}\text{.}\)
- \(\inprod{k \uvec{u}}{\uvec{v}} = k \uvecinprod{u}{v}\text{.}\)
- \(\inprod{\uvec{u}}{k \uvec{v}} = k \uvecinprod{u}{v}\text{.}\)
- \(\uvecinprod{v}{v} = \unorm{v}^2\text{.}\)
- Both \(\inprod{\zerovec}{\uvec{v}} = 0\) and \(\inprod{\uvec{v}}{\zerovec} = 0\text{.}\)
Proof.
We will prove only Rule 9 and leave the rest up to you, the reader. (Though some of these are just restatements of the inner product axioms.)
Following the advice of Discovery 36.4, consider
which holds for every scalar \(k\) by Rule 6 (or, really, by Axiom RIP 3). If we take scalar \(k = 0\text{,}\) then we get
as desired.
The symmetric formula \(\inprod{\uvec{v}}{\zerovec} = 0\) then follows by Rule 1 (or, really, by Axiom RIP 1).
Most of the rules above also hold for complex inner products, but there are some wrinkles due to the complex conjugate.
Proposition 36.6.2. Algebra rules of complex inner products.
Suppose \(\inprod{\blank}{\blank}\) is a complex inner product. Then the following are true for all vectors \(\uvec{u}\text{,}\) \(\uvec{v}\text{,}\) and \(\uvec{w}\) in the associated inner product space, and all scalars \(a\text{.}\)
- \(\uvecinprod{v}{u} = \lcconj{\uvecinprod{u}{v}}\text{.}\)
- \(\inprod{\uvec{u}}{\uvec{v} + \uvec{w}} = \uvecinprod{u}{v} + \uvecinprod{u}{w}\text{.}\)
- \(\inprod{\uvec{u}}{\uvec{v} - \uvec{w}} = \uvecinprod{u}{v} - \uvecinprod{u}{w}\text{.}\)
- \(\inprod{\uvec{u} + \uvec{v}}{\uvec{w}} = \uvecinprod{u}{w} + \uvecinprod{v}{w}\text{.}\)
- \(\inprod{\uvec{u} - \uvec{v}}{\uvec{w}} = \uvecinprod{u}{w} - \uvecinprod{v}{w}\text{.}\)
- \(\inprod{a \uvec{u}}{\uvec{v}} = a \uvecinprod{u}{v}\text{.}\)
- \(\inprod{\uvec{u}}{a \uvec{v}} = \cconj{a} \uvecinprod{u}{v}\text{.}\)
- \(\uvecinprod{v}{v} = \unorm{v}^2\text{.}\)
- Both \(\inprod{\zerovec}{\uvec{v}} = 0\) and \(\inprod{\uvec{v}}{\zerovec} = 0\text{.}\)
Proof.
We will prove only Rule 7 and leave the rest up to you, the reader. (Though some of these are just restatements of the inner product axioms.)
Calculate
with justifications
- Rule 1 (or, really, Axiom CIP 1);
- Rule 6 (or, really, Axiom CIP 3);
- Rule 3 of Proposition A.2.11; and
- Rule 1 (or, really, Axiom CIP 1).
The addition and scalar multiple rules for inner products can be combined to demonstrate that a real inner product is bilinear (i.e. linear in both terms), while a complex inner product is sesquilinear (i.e. linear in the first term but conjugate-linear in the second).
Proposition 36.6.3. Linearity of inner products.
- A real inner product is bilinear. That is, for all vectors \(\uvec{u}\text{,}\) \(\uvec{v}\text{,}\) \(\uvec{w}\) in a real inner product space, and all scalars \(a\) and \(b\text{,}\) we have\begin{align*} \inprod{a \uvec{u} + b \uvec{v}}{\uvec{w}} \amp = a \uvecinprod{u}{w} + b \uvecinprod{v}{w},\\ \inprod{\uvec{u}}{a \uvec{v} + b \uvec{w}} \amp = a \uvecinprod{u}{v} + b \uvecinprod{u}{w}. \end{align*}
- A complex inner product is sesquilinear. That is, for all vectors \(\uvec{u}\text{,}\) \(\uvec{v}\text{,}\) \(\uvec{w}\) in a complex inner product space, and all scalars \(a\) and \(b\text{,}\) we have\begin{align*} \inprod{a \uvec{u} + b \uvec{v}}{\uvec{w}} \amp = a \uvecinprod{u}{w} + b \uvecinprod{v}{w},\\ \inprod{\uvec{u}}{a \uvec{v} + b \uvec{w}} \amp = \cconj{a} \uvecinprod{u}{v} + \cconj{b} \uvecinprod{u}{w}. \end{align*}
The properties of norm that we encountered using the dot product in \(\R^n\) remain true in every inner product space.
Proposition 36.6.4. Properties of the norm.
The following are true for all vectors \(\uvec{u}\) and \(\uvec{v}\) in an inner product space (real or complex) and all scalars \(k\text{.}\)
- \(\unorm{v} \ge 0\text{,}\) and \(\unorm{v} = 0\) only for \(\uvec{v} = 0\text{.}\)
- \(\norm{-\uvec{v}} = \unorm{v}\text{.}\)
- \(\norm{k\uvec{v}} = \abs{k}\unorm{v}\text{,}\) where \(\abs{k}\) indicates the ordinary absolute value of \(k\) in the context of a real inner product space, but indicates the complex modulus of \(k\) in the context of a complex inner product space.
- \(\norm{\uvec{v}-\uvec{u}} = \norm{\uvec{u}-\uvec{v}}\text{.}\)
Both The Cauchy-Schwarz inequality and the Triangle inequality remain true in every inner product space, real or complex.
Theorem 36.6.5. The Cauchy-Schwarz inequality.
For every pair of vectors \(\uvec{u}\) and \(\uvec{v}\) in an inner product space, we have \(\abs{\udotprod{u}{v}} \le \unorm{u}\unorm{v}\text{,}\) where \(\abs{\udotprod{u}{v}}\) indicates the ordinary absolute value in the real context and the complex modulus in the complex context.
Theorem 36.6.6. Triangle inequality.
For every pair of vectors \(\uvec{u}\) and \(\uvec{v}\) in an inner product space, we have \(\norm{\uvec{u}+\uvec{v}} \le \unorm{u} + \unorm{v}\text{.}\)
A vector can always be normalized to a unit vector.
Proposition 36.6.7. Normalization.
For nonzero vector \(\uvec{v}\) in an inner product space, the unit vectors in \(\Span \{\uvec{v}\}\) are
Proof idea.
Just calculate
using Rule 3 of Proposition 36.6.4, and simplify to get \(1\text{.}\)
Remark 36.6.8.
As usual, in the above proposition \(\abs{k}\) indicates the ordinary absolute value in the real context and the complex modulus in the complex context. The only two real scalars that satisfy \(\abs{k} = 1\) are \(k = \pm 1\text{.}\) But in the complex context, there are an infinite number of scalars that satisfy this condition: the entire unit circle in the complex plane.
Subsection 36.6.2 Inner products of \(\R^n\) and \(\C^n\)
Finally we'll address what we learned about alternative inner products in Discovery 36.7 and Subsection 36.4.7.
Proposition 36.6.9. Modified dot products are inner products.
- If \(A\) is a positive definite real matrix, then the pairing\begin{equation*} \uvecinprod{u}{v} = \utrans{\uvec{v}} A \uvec{u} \end{equation*}defines an inner product on \(\R^n\text{.}\)
- If \(A\) is a positive definite complex matrix, then the pairing\begin{equation*} \uvecinprod{u}{v} = \adjoint{\uvec{v}} A \uvec{u} \end{equation*}defines an inner product on \(\C^n\text{.}\)
Proof outline.
In the real case, assuming \(A\) is positive definite means, by definition, that \(A\) is symmetric and that the pairing defined in the statement satisfies Axiom RIP 4. The symmetry of \(A\) addresses Axiom RIP 1, which leaves only Axiom RIP 2 and Axiom RIP 3, which are easily confirmed using the properties of matrix algebra (Proposition 4.5.1).
The complex case is similar, with self-adjointness taking the place of symmetry.
The converse of the above proposition is true as well.
Theorem 36.6.10. Inner products are modified dot products.
- If \(\inprod{\blank}{\blank}\) is an inner product on \(\R^n\text{,}\) then there exists a positive definite real matrix \(A\) such that\begin{equation*} \uvecinprod{u}{v} = \utrans{\uvec{v}} A \uvec{u} \end{equation*}for all \(\uvec{u},\uvec{v}\) in \(\R^n\text{.}\)
- If \(\inprod{\blank}{\blank}\) is an inner product on \(\C^n\text{,}\) then there exists a positive definite complex matrix \(A\) such that\begin{equation*} \uvecinprod{w}{z} = \adjoint{\uvec{z}} A \uvec{w} \end{equation*}for all \(\uvec{w},\uvec{z}\) in \(\C^n\text{.}\)
Proof.
We will prove only the real case; the complex case is similar.
First, notice that for every matrix \(A\text{,}\)
where \(a_{ij}\) is the \((i,j)\) entry of \(A\) and \(\uvec{e}_i,\uvec{e}_j\) are standard basis vectors, as usual. So if we set \(A\) to be the matrix with entries
then we have
which is a start. And, as usually is the case, what is true for basis vectors will be true for all vectors.
Let's check: given \(\uvec{u},\uvec{v}\) in \(\R^n\text{,}\) we have
as desired.
It only remains to verify that \(A\) is positive definite. Using Axiom RIP 1, we have
so \(A\) is symmetric. And the fact that
that we have verified above, guarantees that
for all nonzero \(\uvec{v}\text{,}\) by Axiom RIP 4.
Finally, we verify the method of constructing positive definite matrices discussed in Subsection 36.4.7.
Proposition 36.6.11. Constructing positive definite matrices.
- If \(P\) is an invertible real matrix, then \(A = \utrans{P} P\) is a positive definite real matrix.
- If \(P\) is an invertible complex matrix, then \(A = \adjoint{P} P\) is positive definite complex matrix.
Proof.
Again, we will provide the proof only in the real context; the complex context is similar.
It is straightfoward to verify that \(A = \utrans{P} P\) is symmetric by showing \(\utrans{A} = A\text{.}\) So we will focus on the positive part of the definition of positive definite real matrix. Suppose \(\uvec{x}\) is a nonzero column vector in \(\R^n\text{.}\) Then,
We assumed both \(P\) invertible and \(\uvec{x} \neq \zerovec\text{,}\) so \(P \uvec{x} \neq \zerovec\) must be true as well (Theorem 6.5.2). Which also means \(\norm{P \uvec{x}} \neq 0\) (Rule 1 of Proposition 36.6.4). And therefore
as desired.