Vector spaces

5. Vector spaces#

A vector space is a set of objects called vectors that satisfy axioms of vector addition and scalar multiplication. As the name suggests, vectors in Euclidean space that we met in the chapter on vectors form a vector space but so do lots of other types of mathematical objects. The use of the term vector when dealing with vector spaces could be matrices, polynomial functions, numbers etc.

Vector spaces are abstract and their purpose may not be apparent upon first meeting them. Geoffrey Scott gives a good explanation of why we need vector spaces which is paraphrased here. Consider the following three problems

Problem 1: Find \(x_1, x_2, x_3 \in \mathbb{R}\) that satisfy

\[\begin{split} x_1 \begin{pmatrix} 2 \\ -1 \\ 1 \end{pmatrix} + x_2 \begin{pmatrix} 1 \\ 2 \\ 0 \end{pmatrix} + x_3 \begin{pmatrix} 3 \\ 1 \\ -1 \end{pmatrix} = \begin{pmatrix} 6 \\ 2 \\ 0 \end{pmatrix}. \end{split}\]

Problem 2: Find \(x_1, x_2, x_3 \in \mathbb{R}\) such that

\[ x_1 (2t^2 - t + 1) + x_2 (t^2 + 2t -2) + x_3 (0t^2 + t) = 2t^2 + 5t + 1.\]

Problem 3: Find \(x_1, x_2, \ldots \in \mathbb{C}\) such that

\[ \sum_{n=1}^\infty x_n cos(n \pi t) = e^{5nt}. \]

Problems 1, 2 and 3 are described using three different mathematical objects. Problem 1 uses vectors, problem 2 uses polynomials and problem 3 uses trigonometric functions. On further inspection we can see that these three problems are all similar and involve solving a system of linear equations. We know how to solve problem 1 using the methods from the chapter on systems of linear equations so can we use these to solve problems 2 and 3 and other problems using different mathematical objects?

To answer this we could state theorems for each type of mathematical object, e.g.,

“Suppose you have a collection of columns vectors or polynomials or functions or any other type of mathematical object that can be multiplied by numbers and added together \(\ldots\)”

So every time we encounter a similar problem to the ones above we would need to state a theorem and prove it. Mathematicians like to be concise so the term vector space was invented to mean any type of mathematical object that can be multiplied by numbers and added together. So instead of the rambling phrase above we could use

“Let \(V\) be a vector space \(\ldots\)”

5.1. Definitions#

Definition 5.1 (Binary operation)

A binary operation is a mapping of two elements of the same set \(S\) whose output is also an element of \(S\).

\[ f: S \times S \to S\]

The symbol \(\times\) is used to denote the Cartesian product and \(\to\) denotes a mapping from one set to another.

We have already met examples of binary operations in addition, multiplication, subtraction and division of real numbers. For \(a,b \in \mathbb{R}\) then

\(f(a,b) = a + b\) is a real number.
\(f(a,b) = a \cdot b\) (\(a\) multiplied by \(b\)) is a real number [1].
\(f(a,b) = a - b\) is a real number.
\(f(a,b) = a \div b\) is a real number.

Another example is for the set of all \(2 \times 2\) matrices \(M_{2 \times 2}(\mathbb{R})\), \(f(A, B) = AB\) is a binary operation since \(AB\) is also a \(2 \times 2\) matrix.

Definition 5.2 (Field)

A field is a set \(F\) together with two binary operations on elements of \(F\) called addition and multiplication. For \(a,b \in F\) addition is written as \(a + b\) and multiplication is written as \(a \cdot b\).

The classic example of a field is a set of numbers which will suffice for what we will be doing here. However, a field can be any set of objects for which the above definition is satisfied and you may meet other examples of fields in your studies in the future.

Definition 5.3 (Vector space)

A vector space over a field \(F\) is a non-empty set \(V\) of objects called vectors on which the binary operations below are defined

addition \((+)\): \(V \times V \to V\). For example, given two elements \(u, v \in V\) we can ‘add’ them together to obtain another element \(u + v \in V\).
scalar multiplication \((\cdot )\): \(F \times V \to V\). For example given \(\alpha \in F\) and \(u \in V\) we can ‘multiply’ \(u\) by \(\alpha\) to obtain another element \(\alpha \cdot u \in V\)

For example, a field could be the set of all real numbers \(\mathbb{R}\) and a vector space be the set of all vectors in \(\mathbb{R}^3\) since we can add two vectors and multiply a vector by a scalar. Not all subsets of a field can be classified as a vector space, to do so the set \(V\) must satisfy the following axioms.

Axiom 5.1 (Axioms of a vector space)

Let \(u, v, w \in V\) and \(\alpha, \beta \in F\), \(V\) is a vector space if all of the following axioms are satisfied

A1: Associativity of vector addition: \(u + (v + w) = (u + v) + w\);
A2: Commutativity of vector addition: \(u + v = v + u\);
A3: Identity element of vector addition: there exists an element \(0 \in V\) such that \(u + 0 = u\) for all \(v \in V\);
A4: Additive inverse: For every \(u \in V\) there exists an element \(- v \in V\), called the additive inverse, such that \(u + (- u) = 0\);
M1: Associativity of scalar multiplication: \(\alpha(\beta u) = (\alpha \beta) u\);
M2: Identity element of scalar multiplication: there exists an element \(1\) called the multiplicative identity such that \(1 u = u\);
M3: Distributivity of scalar multiplication with respect to vector addition: \(\alpha(u + v) = \alpha u + \alpha v\);
M4: Distributivity of scalar multiplication with respect to addition: \((\alpha + \beta)u = \alpha u + \beta u\).

The first group of four axioms are concerned with addition and the second group of four axioms are concerned with scalar multiplication. When the scalars are real numbers, e.g., \(F = \mathbb{R}\), we call \(V\) a ‘real vector space’ which is short for a ‘vector space over the field of real numbers’. When the scalars are complex numbers, e.g., \(F=\mathbb{C}\), we talk of a ‘complex vector space’.

We now list some basic properties for multiplication by scalars in a vector space.

Theorem 5.1 (Properties of vector spaces)

Let \(V\) be a vector space over \(F\) and \(u \in V,\alpha\in F\). Then the following hold:

\(\alpha \cdot 0 = 0\);
\(0 \cdot u = 0\);
if \(\alpha \cdot u = 0\) then either \(\alpha = 0\) or \(u = 0\);
\(-(\alpha \cdot u) = (-\alpha)\cdot u = \alpha\cdot(-u)\).

The name ‘vector space’ indicates a connection with the study of vectors. Indeed our Euclidean spaces \(\mathbb{R}^n\), as introduced in Vectors, are vector spaces over \(F=\mathbb{R}\). Addition and scalar multiplication, in the sense of a vector space, for \(\mathbb{R}^n\) are as given in vector addition and scalar multiplication respectively. Euclidean vector spaces are often the first examples of vector spaces that a student meets. However, maybe they do not quite demonstrate the power behind this construction. The real motivation of the study of the vector spaces comes from the fact that a lot of more abstract sets, such as differentiable functions or matrices, can be viewed as vector spaces. Everything that we learn about vector spaces can be then applied to any set which satisfies the definition of a vector space. This means that the set of \(m\times n\) matrices with matrix addition and scalar multiplication or the set of all polynomials of degree at most \(n\), in some sense, behave in a very similar fashion to Euclidean space.

Example 5.1

Show that the following are vector spaces:

(i) The set of real numbers over itself;

(ii) The set of all polynomials of degree at most \(n\) with real coefficients \(P_n(\mathbb{R})\)

\[ \begin{align*} p = p(x) := a_0x^0 + a_1x^1 + a_2x^2 + \cdots + a_nx^n = \sum_{i=0}^n a_ix^i, \end{align*} \]

where \(a_i \in \mathbb{R}\) and \(x\) is some variable. Show that \(P(\mathbb{R}_n)\) is a vector space.

5.2. Examples of non-vector spaces#

Of course not all sets are vector spaces. For example, consider the set of integers \(\mathbb{Z}\) over the field \(F=\mathbb{R}\). It is easy to show that the axioms A1 to A4 are satisfied for the set of integers. The problem comes when one tries to define scalar multiplication. From the definition of scalar multiplication, \(\mathbb{R} \times \mathbb{Z} \to \mathbb{Z}\) so for \(u \in \mathbb{Z}\) and \(\alpha \in \mathbb{R}\), \(\alpha \cdot x\in \mathbb{Z}\). However, this is not always the case, for example, when \(u=1\) and \(\alpha = \frac{1}{2}\) we have \(\frac{1}{2} \cdot 1 = \frac{1}{2} \notin \mathbb{Z}\). This was an example of a proof by counterexample where we just need to show 1 instance where the axioms are not satisfied to prove that we do not have a vector space.

Example 5.2

Show that the following a not vector spaces:

(i) The set of all positive real numbers, \(\mathbb{R}_+\) over itself;

(ii) \(V\) is defined to be a parabola \(y=x^2\) in \(\mathbb{R}^2\), i.e., all the points in \(\mathbb{R}^2\) defined by \(V = {(x, x^2)}\).

Vector spaces

Contents

5. Vector spaces#

5.1. Definitions#

5.2. Examples of non-vector spaces#