6.1. Transformation matrices#
For convenience we tend to use matrices to represent linear transformations. Let \(T: V \to W\) be a linear transformation from the vector spaces \(V\) to \(W\) where \(V, W \in \mathbb{R}^n\). If \(\{\vec{v}_1, \vec{v}_2, \ldots, \vec{v}_n\}\) is a basis for \(V\) then for a vector \(\vec{u} \in V\)
and by the definition of a linear transformation we can apply a linear transformation \(T\) to the vectors \(\vec{u}\) and \(\vec{v}_1, \vec{v}_2, \ldots, \vec{v}_n\)
so \(T(\vec{u})\) depends on the vectors \(T(\vec{v}_1), T(\vec{v}_2), \ldots, T(\vec{v}_n)\). We can write this as the matrix equation
In other words we can apply a linear transformation simply by multiplying \(\vec{u}\) by a matrix \(A\).
(Transformation matrix)
Let \(T : V \to W\) be a linear transformation and \(A\) be a matrix such that
then
\(A\) is said to be the matrix representation of the linear transformation \(T\) (also known as the transformation matrix).
A linear transformation \(T:\mathbb{R}^2 \to \mathbb{R}^2\) is defined by \(T: (x, y)^\mathsf{T} \mapsto (3x + y, x + 2y)^\mathsf{T}\). Calculate the transformation matrix and use it to calculate \(T(1,1)^\mathsf{T}\).
Solution
Since we are mapping from \(\mathbb{R}^2\) the transformation matrix is
Applying the transformation to the standard basis vectors
so the transformation matrix is
Applying the transformation matrix to \((1, 1)^\mathsf{T}\)
The affects of the linear transformation from Example 6.1 is illustrated in Fig. 6.1. Note that the transformation \(T\) can be thought of as changing the basis of the vector space. The unit square with respect to the basis \(\{\vec{e}_1, \vec{e}_1\}\) has been transformed into a unit parallelogram with respect to the basis \(\{ T(\vec{e}_1), T(\vec{e}_2)\}\).
6.1.1. Finding the transformation matrix from a set of images#
The calculation of the transformation matrix in Example 6.2 was straightforward as we knew what the transformation was. This will not always be a the case and we may know what the output of the transformation (known as the image) is but not the transformation itself. Consider a linear transformation \(T: \mathbb{R}^n \to \mathbb{R}^m\) applied to vectors \(\vec{u}_1, \vec{u}_2, \ldots, \vec{u}_n\). If \(A\) is the transformation matrix for \(T\) then
therefore
(Determining the linear transformation given the inputs and image vectors)
Given a linear transformation \(T: \mathbb{R}^n \to \mathbb{R}^m\) applied to a set of \(n\) vectors \(\vec{u}_1, \vec{u}_2, \ldots, \vec{u}_n\) with known image vectors \(T(\vec{u}_1), T(\vec{u}_2), \ldots, T(\vec{u}_n)\) then the transformation matrix \(A\) for \(T\) is
Determine the transformation matrix \(A\) for the linear transformation \(T\) such that
Solution
The inverse of \((\vec{u}_1, \vec{u}_2)\) is
Right multiplying the image matrix
This is the transformation matrix from Example 6.2.
6.1.2. Inverse linear transformation#
(Inverse linear transformation)
Let \(T: V \to W\) be a linear transformation with the transformation matrix \(A\) then \(T\) has an inverse transformation denoted by \(T^{-1}: W \to V\) which reverses the affects of \(T\). If \(\vec{u} \in V\) and \(\vec{v} \in W\) then
where \(A^{-1}\) is the transformation matrix for \(T^{-1}\).
Determine the inverse of the transformation \(T: \mathbb{R}^2 \to \mathbb{R}^2\) defined by \(T(x, y)^\mathsf{T} \mapsto (3 x + y, x + 2 y)^\mathsf{T}\) and calculate \(T^{-1}(4,3)^\mathsf{T}\).
Solution
We saw in Example 6.2 that the transformation matrix for \(T\) is
which has the inverse
Determining the inverse transformation
Calculating \(T^{-1}\begin{pmatrix} 4 \\ 3 \end{pmatrix}\)