Artificial Intelligence 🤖
Matrices
Eigenvalues and Eigenvectors

Eigenvalues and Eigenvectors

This section is relevant as prerequisite material for systems of ODEs discussed in Chapter 14.

Definition: Eigenvalues and Eigenvectors

For a square matrix AA we have a nonzero vector x\boldsymbol{x} and a scalar λ\lambda such that

Ax=λx,A \boldsymbol{x}=\lambda \boldsymbol{x},

then λ\lambda is an eigenvalue of AA and x\boldsymbol{x} is the corresponding eigenvector.

We find eigenvalues and eigenvectors firstly by rewriting (10.88) as,

(AλI)x=0.(A-\lambda I) \boldsymbol{x}=\mathbf{0} .

This is a homogeneous equation and if AλIA-\lambda I is non-singular then the only solution is the trivial one given by x=0\boldsymbol{x}=\mathbf{0}. We therefore require that

det(AλI)=0\operatorname{det}(A-\lambda I)=0 \text {. }

The Characteristic Equation

The characteristic equation of a matrix AA is given by,

χA(λ)=det(AλI).\chi_{A}(\lambda)=\operatorname{det}(A-\lambda I) .

Consider the 2×22 \times 2 matrix,

A=(2103)A=\left(\begin{array}{ll} 2 & 1 \\ 0 & 3 \end{array}\right)

To find the eigenvalues of AA, we solve the characteristic equation (10.90). This is given by

2λ103λ=0\left|\begin{array}{cc} 2-\lambda & 1 \\ 0 & 3-\lambda \end{array}\right|=0

which yields a quadratic equation for λ\lambda,

(2λ)(3λ)=0(2-\lambda)(3-\lambda)=0

The solutions to (10.93) are λ1=2\lambda_{1}=2 and λ2=3\lambda_{2}=3. For each eigenvalue, we look for a corresponding eigenvector by solving for x\boldsymbol{x} in Eq. (10.89). We start with λ1=2\lambda_{1}=2 and define x=(x1,x2)\boldsymbol{x}=\left(x_{1}, x_{2}\right)^{\top} in Eq. (10.89) which gives us the matrix equation,

(0101)(x1x2)=(00).\left(\begin{array}{ll} 0 & 1 \\ 0 & 1 \end{array}\right)\left(\begin{array}{l} x_{1} \\ x_{2} \end{array}\right)=\left(\begin{array}{l} 0 \\ 0 \end{array}\right) .

Note that since we demanded that AλIA-\lambda I is singular, then we expect Eq. (10.94) to result in an under-determined system, i.e. 2 unknowns but only 1 independent equation. Equations (10.94) admit the solution x2=0x_{2}=0 and we are free to set x1x_{1} since the system is satisfied with any value for x1x_{1}. We choose to set x1=cx_{1}=c where cc is some arbitrary constant. This gives the eigenvector as

x=c(10)\boldsymbol{x}=c\left(\begin{array}{l} 1 \\ 0 \end{array}\right)

which represents an infinite set of solutions as cRc \in \mathbb{R}. Repeating the steps for the second eigenvalue, λ2=3\lambda_{2}=3, we have

(1100)(x1x2)=(00);\left(\begin{array}{cc} -1 & 1 \\ 0 & 0 \end{array}\right)\left(\begin{array}{l} x_{1} \\ x_{2} \end{array}\right)=\left(\begin{array}{l} 0 \\ 0 \end{array}\right) ;

this gives us the solution x1=x2x_{1}=x_{2} and again we are free to set x1=dx_{1}=d and hence x2=dx_{2}=d, where dRd \in \mathbb{R}. The eigenvector corresponding to λ2=3\lambda_{2}=3 therefore is

x=d(11).\boldsymbol{x}=d\left(\begin{array}{l} 1 \\ 1 \end{array}\right) .

Note that we can also write the eigenvectors simply as

(10),(11)\left(\begin{array}{l} 1 \\ 0 \end{array}\right), \quad\left(\begin{array}{l} 1 \\ 1 \end{array}\right)

and the understanding is that any constant multiple of the above eigenvectors is also an eigenvector.

Characteristic equation of a 2×22 \times 2 matrix

Recall that the trace of a square matrix is given by the sum of its diagonal entries [see Eq. (10.11)]. We may calculate the characteristic equation of a 2×22 \times 2 matrix using the trace and the determinant of the coefficient matrix AA,

det(AλI)=a11λa12a21a22λ=λ2(a11+a22)trace Aλ+a11a22a12a21detA=λ2τλ+Δ;\begin{aligned} \operatorname{det}(A-\lambda I) & =\left|\begin{array}{cc} a_{11}-\lambda & a_{12} \\ a_{21} & a_{22}-\lambda \end{array}\right|=\lambda^{2}-\underbrace{\left(a_{11}+a_{22}\right)}_{\text {trace } A} \lambda+\underbrace{a_{11} a_{22}-a_{12} a_{21}}_{\operatorname{det} A} \\ & =\lambda^{2}-\tau \lambda+\Delta ; \end{aligned}

where τ\tau and Δ\Delta are often used to denote the trace and determinant of a square matrix, respectively. The eigenvalues are then obtained from,

λ±=τ±τ24Δ2\lambda_{ \pm}=\frac{\tau \pm \sqrt{\tau^{2}-4 \Delta}}{2}

if τ24Δ<0\tau^{2}-4 \Delta<0 we have complex eigenvalues, if τ24Δ>0\tau^{2}-4 \Delta>0 we have real eigenvalues, and if τ24Δ=0\tau^{2}-4 \Delta=0 we have repeated eigenvalues. Note that in this course we focus on real coefficient matrices; it is still possible to find complex eigenvalues for real coefficient matrices but we need to take care when finding corresponding eigenvectors. Since the system we start with is real then the solutions to the real system must also be real. This case is discussed in more detail in Chapter 14 in the context of differential equations. Example 10.10 Find the eigenvalues and corresponding eigenvectors of the matrix,

(310121013)\left(\begin{array}{ccc} 3 & -1 & 0 \\ -1 & 2 & -1 \\ 0 & -1 & 3 \end{array}\right)

Solution We start with computing the characteristic equation from

3λ1012λ1013λ=(3λ)(2λ)(3λ)2(3λ)=(λ1)(λ3)(λ4)\begin{aligned} \left|\begin{array}{ccc} 3-\lambda & -1 & 0 \\ -1 & 2-\lambda & -1 \\ 0 & -1 & 3-\lambda \end{array}\right| & =(3-\lambda)(2-\lambda)(3-\lambda)-2(3-\lambda) \\ & =(\lambda-1)(\lambda-3)(\lambda-4) \end{aligned}

The eigenvalues are therefore 1,3 , and 4 . We now find the corresponding eigenvectors. For λ=4\lambda=4,

(A4I)=(110121011)(x1x2x3)=(000).(A-4 I)=\left(\begin{array}{ccc} -1 & -1 & 0 \\ -1 & -2 & -1 \\ 0 & -1 & -1 \end{array}\right)\left(\begin{array}{l} x_{1} \\ x_{2} \\ x_{3} \end{array}\right)=\left(\begin{array}{l} 0 \\ 0 \\ 0 \end{array}\right) .

This system gives the following equations

x1x2=0x12x2x3=0x2x3=0.\begin{aligned} -x_{1}-x_{2} & =0 \\ -x_{1}-2 x_{2}-x_{3} & =0 \\ -x_{2}-x_{3} & =0 . \end{aligned}

Again we note that the equations are not linearly independent; we can add the first and the third one to obtain the second one. We have

x1=x2,x3=x12x2=x1x_{1}=-x_{2}, \quad x_{3}=-x_{1}-2 x_{2}=x_{1}

by setting x1=cx_{1}=c, where cc is an arbitrary constant, we have the following eigenvector,

x=c(111)\boldsymbol{x}=c\left(\begin{array}{c} 1 \\ -1 \\ 1 \end{array}\right)

Following the same steps, the eigenvectors corresponding to the eigenvalues 1 and 3 are

x=d(121), and x=e(101)\boldsymbol{x}=d\left(\begin{array}{l} 1 \\ 2 \\ 1 \end{array}\right), \text { and } \boldsymbol{x}=e\left(\begin{array}{c} -1 \\ 0 \\ 1 \end{array}\right)

where dd and ee are constants.

Exercises

Find the eigenvalues and eigenvectors of the following matrices:

  1. (1/32/34/31/3)\left(\begin{array}{cc}-1 / 3 & 2 / 3 \\ 4 / 3 & 1 / 3\end{array}\right)
  2. (371/2107)\left(\begin{array}{cc}-3 & -71 / 2 \\ -10 & 7\end{array}\right)
  3. (3/21/23/21/23/21/2003)\left(\begin{array}{ccc}3 / 2 & 1 / 2 & 3 / 2 \\ 1 / 2 & 3 / 2 & -1 / 2 \\ 0 & 0 & 3\end{array}\right)
  4. (142257/2002)\left(\begin{array}{ccc}1 & -4 & -2 \\ 2 & -5 & -7 / 2 \\ 0 & 0 & 2\end{array}\right)
  5. (8/302/314/322/37/307/3)\left(\begin{array}{ccc}8 / 3 & 0 & -2 / 3 \\ 14 / 3 & -2 & -2 / 3 \\ -7 / 3 & 0 & 7 / 3\end{array}\right)