Gaussian elimination

2.4. Gaussian elimination#

Gaussian elimination (GE) named after German mathematician Carl Friedrich Gauss is an algorithm for solving systems of linear equations. It is the most common method used in practice since it can by easily programmed into a computer and applied to larger systems. Consider the following method for solving a linear system of three equations in three unknowns:

\[\begin{split} \begin{align*} 3x_1 + x_2 - 2x_3 &= 1, \\ x_1 - x_2 + 2x_3 &= 3, \\ 2x_1 - 3x_2 + 7x_3 &= 4. \end{align*} \end{split}\]

Swap the first two equations around

\[\begin{split} \begin{align*} x_1 - x_2 + 2x_3 &= 3, \\ 3x_1 + x_2 - 2x_3 &= 1, \\ 2x_1 - 3x_2 + 7x_3 &= 4. \end{align*} \end{split}\]

Subtract 3 times the first equation from the second equation and 2 times the first equation from the third equation

\[\begin{split} \begin{align*} x_1 - x_2 + 2x_3 &= 3, \\ 4x_2 - 8x_3 &= -8, \\ - x_2 + 3x_3 &= -2. \end{align*} \end{split}\]

Multiply the second equation by \(\dfrac{1}{4}\)

\[\begin{split} \begin{align*} x_1 - x_2 + 2x_3 &= 3, \\ x_2 - 2 x_3 &= -2, \\ - x_2 + 3x_3 &= -2. \end{align*} \end{split}\]

Add the second equation to the third

\[\begin{split} \begin{align*} x_1 - x_2 + 2x_3 &= 3, \\ x_2 - 2 x_3 &= -2, \\ x_3 &= -4. \end{align*} \end{split}\]

Here the third equation gives the solution to \(x_3=-4\). We can substitute the value of \(x_3\) into the other two to find the solutions of \(x_2\) and \(x_1\)

\[\begin{split} \begin{align*} x_2 - 2(-4) &= -2 & \implies x_2 &= -10, \\ x_1 - (-10) + 2(-4) &= 3 & \implies x_1 &= 1. \end{align*} \end{split}\]

In this method, we used three types of operations on the equations in the system. These operations are known as elementary row operations.

Definition 2.2 (Elementary Row Operations (EROs))

The three elementary row operations that can be applied to a linear system of equations without changing the solution are

Type I: swap any two rows of the system;
Type II: multiply one row by a non-zero scalar;
Type III: replace a single row by itself plus a multiple of another row.

In the solution to the linear system of equations shown above we used a type I row operation in step 1, a type II row operation in step 3 and type III row operations in steps 2 and 4.

We can represent the linear system using matrices for convenience. We begin by expressing the linear system using an augmented matrix consisting of the concatenation of \(A\) and \(\vec{x}\) so any EROs that are applied to the augmented matrix are applied to the coefficients and the constant terms at the same time.

Definition 2.3 (Augmented matrix)

The augmented matrix is a representation of a system of linear equations \(A\vec{x}=\vec{b}\) such that the \(m\times n\) coefficient matrix \(A\) and right-hand side constant vector \(\vec{b}\) are combined into a single \(m\times (n+1)\) matrix \((A \mid \vec{b})\).

When writing the augmented matrix we often draw a partition separating \(A\) and \(\vec{b}\) (although this is not strictly necessary), i.e.,

\[\begin{split} \left( \begin{array}{cccc|c} a_{11} & a_{12} & \cdots & a_{1n} & b_1 \\ a_{21} & a_{22} & \cdots & a_{2n} & b_2 \\ \vdots & \vdots & \ddots & \vdots & \vdots \\ a_{m1} & a_{m2} & \cdots & a_{mn} & b_n \end{array} \right). \end{split}\]

Elementary row operations are applied to the augmented matrix so that we reduce it to what is known as row echelon form where the solution of the system can be easily calculated.

Definition 2.4 (Row Echelon Form (REF))

A matrix is said to be in Row Echelon Form (REF) if the following conditions are satisfied:

any non-zero rows are above any all-zero rows;
in each non-zero row, with the exception of the first row, the pivot element (the first non-zero element in the row) is to the right of the pivot element in the row above;

For example, the following matrices are in row echelon form and the red numbers are the pivot elements

\[\begin{split} \begin{align*} &\begin{pmatrix} \color{red}{1} & 2 \\ 0 & \color{red}{3} \end{pmatrix}, & &\begin{pmatrix} \color{red}{1} & 2 & 3 \\ 0 & 0 & \color{red}{4} \end{pmatrix}, & &\begin{pmatrix} 0 & \color{red}{1} & 3\\ 0 & 0 & \color{red}{4} \\ 0 & 0 & 0\end{pmatrix}. \end{align*} \end{split}\]

Note that the elements below the pivot elements are all zero.

2.4.1. Row reduction#

The process of transforming a matrix into row echelon form using elementary row operations is known as row reduction. For example, we will use Gaussian elimination to solve the following system of linear equations

\[\begin{split} \begin{align*} 3x_1 + x_2 - 2x_3 &= 1, \\ x_1 - x_2 + 2x_3 &= 3, \\ 2x_1 - 3x_2 + 7x_3 &= 4. \end{align*} \end{split}\]

Expressing this using an augmented matrix, we have

\[\begin{split} \left( \begin{array}{ccc|c} \color{red}{3} & 1 & -2 & 1 \\ \color{blue}{1} & -1 & 2 & 3 \\ \color{blue}{2} & -3 & 7 & 4 \end{array} \right). \end{split}\]

The first pivot element is in row 1 column 1 which has a value of 3. We need to apply row operations so that the elements in the column beneath the pivot element are zero. To do so we add a multiple of the pivot row to each of the rows beneath (a type III row operation). Since the pivot element is 3 and the first non-zero element in row 2 is 1, to reduce this to zero we subtract row 1 multiplied by \(\frac{1}{3}\) from row 2.

\[\begin{split} \begin{align*} \left( \begin{array}{ccc|c} \color{red}{ 3} & 1 & -2 & 1 \\ \color{blue}{ 1 - \frac{1}{3}(3)} & -1 - \frac{1}{3}(1) & 2 - \frac{1}{3}(-2) & 3 - \frac{1}{3}(1) \\ \color{blue}{ 2} & -3 & 7 & 4 \end{array} \right) & \longrightarrow \left( \begin{array}{ccc|c} \color{red}{ 3} & 1 & -2 & 1 \\ \color{blue}{ 0} & -4/3 & 8/3 & 8/3 \\ \color{blue}{ 2} & -3 & 7 & 4 \end{array} \right). \end{align*} \end{split}\]

We also need to do the same to row 3. Since the element row 3 column 1 has a value of 2, we need to subtract row 1 multiplied by \(\frac{2}{3}\) from row 3.

\[\begin{split} \left( \begin{array}{ccc|c} \color{red}{3} & 1 & -2 & 1 \\ \color{blue}{0} & -4/3 & 8/3 & 8/3 \\ \color{blue}{2 - \frac{2}{3}(3)} & -3 - \frac{2}{3}(1) & 7 - \frac{2}{3}(-2) & 4 - \frac{2}{3}(1) \end{array} \right) \longrightarrow \left( \begin{array}{ccc|c} \color{red}{3} & 1 & -2 & 1 \\ \color{blue}{0} & -4/3 & 8/3 & 8/3 \\ \color{blue}{0} & -11/3 & 25/3 & 10/3 \end{array} \right). \end{split}\]

Note that these two row operations could have been done simultaneously since changing the values in row 2 does not affect row 3 and vice-versa. Column 1 is now in row echelon form so we move to the next pivot element in row 2 which is \(-\frac{4}{3}\).

\[\begin{split} \left( \begin{array}{ccc|c} 3 & 1 & -2 & 1 \\ 0 & \color{red}{-4/3} & 8/3 & 8/3 \\ 0 & \color{blue}{-11/3} & 25/3 & 10/3 \end{array} \right). \end{split}\]

The element in row 3 column 2 has a value of \(-\frac{11}{3}\) and the pivot element has a value of \(-\frac{4}{3}\) so we need to subtract row 2 multiplied by \((-\frac{11}{3}) \div (-\frac{4}{3}) = \frac{11}{4}\) from row 3.

\[\begin{split} \left( \begin{array}{ccc|c} 3 & 1 & -2 & 1 \\ 0 & \color{red}{-4/3} & 8/3 & 8/3 \\ 0 & \color{blue}{-\frac{11}{3} - \frac{11}{4}(-\frac{4}{3})} & \frac{25}{3} - \frac{11}{4}(\frac{8}{3}) & \frac{10}{3} - \frac{11}{4}(\frac{8}{3}) \end{array} \right) \longrightarrow \left( \begin{array}{ccc|c} 3 & 1 & -2 & 1 \\ 0 & \color{red}{-4/3} & 8/3 & 8/3 \\ 0 & \color{blue}{0} & 1 & -4 \end{array} \right). \end{split}\]

Now the augmented matrix is in row echelon form. We can convert back to matrix form and express the linear system as three separate equations.

\[\begin{split} \begin{align*} \begin{pmatrix} 3 & 1 & -2 \\ 0 & -4/3 & 8/3 \\ 0 & 0 & 1 \end{pmatrix} \begin{pmatrix} x_1 \\ x_2 \\ x_3 \end{pmatrix} = \begin{pmatrix} 1 \\ 8/3 \\ -4 \end{pmatrix} && \implies & \begin{array}{rcl} 3 x_1 + x_2 - 2 x_2 &=& 1, \\ -\frac{4}{3} x_2 - \frac{8}{3} x_3 &=& \frac{8}{3}, \\ x_3 &=& -4. \end{array} \end{align*} \end{split}\]

Since we have reduced the coefficient matrix to row echelon form we have a solution for the final variable. We can then substitute known values of the variables into the preceding equation to solve for the preceding variable. We continue in this way until we have solutions for all of the variables in the system. This step is known as back substitution. So for our system the final equation gives \(x_3=-4\) so substitution into the second equation gives

\[\begin{split} \begin{align*} -\frac{4}{3} x_2 + \frac{8}{3} (-4) &= \frac{8}{3} \\ -\frac{4}{3} x_2 &= \frac{40}{3} \\ \therefore x_2 &= -10, \end{align*} \end{split}\]

and substituting \(x_2\) and \(x_3\) into the first equation gives

\[\begin{split} \begin{align*} 3x_1 + (-10) - 2(-4) &= 1 \\ 3x_1 &= 3 \\ \therefore x_1 &= 1. \end{align*} \end{split}\]

In the interest of brevity, the following notation is used to denote the three types of EROs

Type I: swap row \(i\) and row \(j\): \(R_i \leftrightarrow R_j\);
Type II: multiply row \(i\) by the non-zero scalar \(k\): \(kR_i\);
Type III: add \(k\) times row \(j\) to row \(i\): \(R_i + kR_j\).

Since the EROs do not change the solution to the system of equations, it does not matter which EROs are applied to row reduce the augmented matrix. A common approach is to ensure the pivot elements have a value of 1 which can decrease the number of fractional values thus simplifying the calculations. For example, consider the following row reduction of the same augmented matrix as before.

\[\begin{split} \begin{align*} & \left( \begin{array}{ccc|c} 3 & 1 & -2 & 1 \\ 1 & -1 & 2 & 3 \\ 2 & -3 & 7 & 4 \end{array} \right) \begin{array}{l} R_1 \leftrightarrow R_2 \\ \phantom{x} \\ \phantom{x} \end{array} & \longrightarrow & \left( \begin{array}{ccc|c} 1 & -1 & 2 & 3 \\ 3 & 1 & -2 & 1 \\ 2 & -3 & 7 & 4 \end{array} \right) \begin{array}{l} \\ R_2 - 3 R_1 \\ R_3 - 2 R_1 \end{array} \\ \\ \longrightarrow & \left( \begin{array}{ccc|c} 1 & -1 & 2 & 3 \\ 0 & 4 & -8 & -8 \\ 0 & -1 & 3 & -2 \end{array} \right) \begin{array}{l} \\ \dfrac{1}{4}R_2 \\ \phantom{x} \end{array} & \longrightarrow & \left( \begin{array}{ccc|c} 1 & -1 & 2 & 3 \\ 0 & 1 & -2 & -2 \\ 0 & -1 & 3 & -2 \end{array} \right) \begin{array}{l} \\ \\ R_3 + R_2 \end{array} \\ \\ \longrightarrow & \left( \begin{array}{ccc|c} 1 & -1 & 2 & 3 \\ 0 & 1 & -2 & -2 \\ 0 & 0 & 1 & -4 \end{array} \right) \end{align*} \end{split}\]

Solving using back substitution gives \(x_1 = 1\), \(x_2 = -10\) and \(x_3 = -4\) which was the same solution as we saw before.

The steps used in Gaussian elimination is written in pseudocode in Algorithm 2.1.

Algorithm 2.1 (Gaussian elimination)

Inputs: An \(m \times n\) matrix \(A\).

Outputs: The matrix \(A\) in row echelon form

Start with the pivot element as the element in the first row and column of \(A\).
For each column of \(A\)
- If the pivot element is zero, perform a row swap with a row beneath the pivot row that has a non-zero element in the pivot column. If no such row exists then move to the next column.
- For each row beneath the pivot row
  - Subtract the pivot row multiplied by the value in the pivot column of the row divided by the pivot element from the current row.
Return \(A\)

Example 2.3

Use Gaussian-elimination to solve the following systems of linear equations:

(i) \( \begin{align*} x_1 + 2x_2 &= 7, \\ 3x_1 - 4x_2 &= 1. \end{align*} \)

(ii) \(\begin{align*} x_1 + x_3 &= 3, \\ -2x_1 + x_2 + 3x_3 &= 3, \\ -x_1 + 2x_2 + 4x_3 &= 5. \end{align*}\)

Gaussian elimination

Contents

2.4. Gaussian elimination#

2.4.1. Row reduction#