11.3. Absolute stability¶

In Section 11.2 we applied several different time stepping methods to a linear, constant coefficient problem in the form

\[ \mathbf{u}'(t)=\mathbf{A}\mathbf{u}(t). \]

All of these methods are zero-stable in the sense of Section 6.8, in the limit as the time step size \(\tau \to 0\).1 Yet for some experiments with fixed \(\tau\), as in Demo 11.2.3, we have observed exponential growth in the different limit \(n\to \infty\).

Observe that if \(\mathbf{A}\) has the eigenvalue decomposition \(\mathbf{A}=\mathbf{V}\mathbf{D}\mathbf{V}^{-1}\), then

\[\begin{align*} \mathbf{u}'&=(\mathbf{V}\mathbf{D}\mathbf{V}^{-1})\mathbf{u}\\ (\mathbf{V}^{-1} \mathbf{u}') &= \mathbf{D} (\mathbf{V}^{-1} \mathbf{u}) \\ \mathbf{y}' &= \mathbf{D} \mathbf{y}, \end{align*}\]

where \(\mathbf{y}(t)=\mathbf{V}^{-1}\mathbf{u}(t)\). Because \(\mathbf{D}\) is diagonal, the dynamics of the components of \(\mathbf{y}\) are completely decoupled: each row is a self-contained equation of the form \(y_j'=\lambda_j y_j\), where \(\lambda_j\) is an eigenvalue of \(\mathbf{A}\).

The diagonalization argument suggests that we can look at the scalar problems

(11.3.1)¶\[y' = \lambda y, \quad y(0)=1,\]

arising from the eigenvalues. These eigenvalues may not be real numbers, so in this section, \(i\) stands for the imaginary unit, not an integer index. If we write \(\lambda\) in real and imaginary parts as \(\lambda=\alpha + i\beta\), then by Euler’s identity, the exact solution of (11.3.1) has magnitude

\[ \bigl |e^{(\alpha+i\beta)t} \bigr| = \bigl |e^{\alpha t} \bigr| \cdot \bigl |e^{i \beta t} \bigr| = e^{\alpha t}. \]

Observation 11.3.1

Solutions of (11.3.1) are bounded as \(t\to\infty\) if and only if \(\alpha = \operatorname{Re} \lambda \le 0\).

We now consider the counterpart of this observation for the solution produced by a numerical IVP solver.

Definition 11.3.2 : Absolute stability

Let \(\lambda\) be a complex number, and let \(y_0,y_1,y_2,\ldots,y_n\) be the numerical solution at times \(0,\tau,2\tau,\ldots,n\tau\) of (11.3.1) using a Runge–Kutta or multistep method with fixed stepsize \(\tau\). Then the method is said to be absolutely stable at \(\zeta = \tau\lambda\) if \(|y_n|\) is bounded above as \(n\to\infty\).

The fact that absolute stability depends only on the product \(\zeta = \tau\lambda\), and not independently on the individual factors, is a result of how the IVP solvers are defined, as we will see below. Since \(\lambda\) has units of inverse time according to (11.3.1), \(\zeta\) is dimensionless.

Stability regions¶

Each numerical IVP solver has its own collection of \(\zeta\) values for which it is absolutely stable.

Definition 11.3.3 : Stability region

The stability region of an IVP solver is the collection of all \(\zeta\in\complex\) for which the method is absolutely stable.

Example 11.3.4

Consider an Euler discretization of \(y'=\lambda y\):

\[ y_{k+1} = y_k + \tau( \lambda y_k) = (1+ \zeta ) y_k. \]

Given that \(y_0=1\) by (11.3.1), we easily deduce that \(y_k = (1+\zeta)^k\) for all \(k\), and therefore

\[ |y_k| = |1+\zeta|^k. \]

Hence \(|y_k|\) remains bounded above as \(k\to \infty\) if and only if \(|1+\zeta| \le 1\). Because \(\zeta\) is a complex number, it’s easiest to interpret this condition geometrically:

\[ |\zeta + 1 | = |\zeta - (-1) | \le 1. \]

That is, the distance in the plane from \(\zeta\) to the point \(-1\) is less than or equal to one. This description defines a closed disk of radius 1 centered at \((-1,0)\).

Example 11.3.5

The backward Euler method discretizes (11.3.1) as

\[ y_{k+1} = y_k + \tau( \lambda y_{k+1}) \quad \Rightarrow \quad y_{k+1} = \frac{1}{1-\zeta} y_k. \]

Therefore, \(y_k=(1-\zeta)^{-k}\) for all \(k\), and absolute stability requires \(|1-\zeta|^{-1} \le 1\), or

\[ |\zeta-1|\ge 1. \]

This inequality describes the region outside of the open disk of radius 1 centered at \(1\) on the real axis of the complex plane.

Example 11.3.6

The improved Euler method IE2 defined in (6.4.4) discretizes (11.3.1) as

\[{y}_{i+1} = y_i + \zeta \left( y_i + \tfrac{1}{2}\zeta y_i \right) = (1 + \zeta + \tfrac{1}{2}\zeta^2) y_i.\]

The stability region consists of all \(\zeta\) such that \(| 1 + \zeta + \tfrac{1}{2}\zeta^2 | \le 1\). Although it is not elementary to describe this region geometrically, its boundary points satisfy

\[ 1 - e{i\theta} + \zeta + \tfrac{1}{2}\zeta^2 = 0 \]

for some real \(\theta\), and thus we can use the quadratic formula to find all the boundary points.

Stability regions for the most common IVP integrators are given in Fig. 11.3.1 and Fig. 11.3.2. Note that the implicit Adams-Moulton methods are larger than those for the explicit Adams-Bashforth methods of the same order. For the implicit backward differentiation methods, the exteriors of the curves provide large regions of stability, but significant portions of the imaginary axis may be excluded. Finally, while the single-step Runge-Kutta methods have smaller regions of stability, those of orders 3 and 4 do include significant portions of the imaginary axis.

Fig. 11.3.1 Stability regions for Adams–Bashforth methods of order 1–4 (left) and Adams–Moulton methods of order 2–5 (right). The plots are in the complex \(\zeta\)-plane.¶

Fig. 11.3.2 Stability regions for backward differentiation methods of order 1–4 (left, exteriors of curves) and Runge–Kutta methods of order 1–4 (right). The plots are in the complex \(\zeta\)-plane.¶

For any particular method and value of \(\lambda\) in (11.3.1), we can use the stability region to deduce which, if any, values of the time step \(\tau\) will give bounded solutions. Both the magnitude and the argument (angle) of \(\lambda\) play a role in determining such constraints.

Example 11.3.7

Suppose \(\lambda=-4\) and Euler’s method is applied. Since the time step is always positive, \(\zeta=-4\tau\) is always on the negative real axis. The only part of that line that lies within the stability region of Euler as derived in Example 11.3.4 is the real interval \([-2,0]\). Hence we require \(\zeta\ge -2\), or \(\tau \le 1/2\). By contrast, the stability region of backward Euler includes the entire negative real axis, so absolute stability is unconditional, i.e., assured regardless of \(\tau\).

Now suppose instead that \(\lambda=i\), so that \(\zeta=i\tau\). Clearly \(\zeta\) is always on the positive imaginary axis. But no part of this axis, aside from the origin, lies in the stability region of Euler’s method, so it is unconditionally unstable in this circumstance. The conclusion for backward Euler is the opposite; any value of \(\tau\) will do, because the entire imaginary axis is within the stability region.

Example 11.3.7 does not contradict our earlier statements about the zero stability and convergence of Euler’s method in general, even for the case \(\lambda=i\). But those statements are based on the limit \(\tau\to 0\) for \(t\) in a finite interval \([a,b]\). Both this limit and the limit \(t\to \infty\) imply the number of steps \(n\) goes to infinity, but the limits behave differently.

The fact that implicit methods have larger stability regions than their explicit counterparts is the primary justification for using them. While they have larger work requirements per step, they sometimes can take steps that are orders of magnitude larger than explicit methods and still remain stable.

When adaptive time stepping methods are used, as in most software for IVPs, the automatically determined time step is chosen to satisfy absolute stability requirements (otherwise errors grow exponentially). This phenomenon was manifested in Demo 11.2.6: in the explicit IVP method rk23, error control forced tiny step sizes compared to those used by Rodas4P, which is based on implicit methods.

Heat equation¶

Now we return to the semidiscretization (11.2.3) of the heat equation, which was solved by Euler in Demo 11.2.3 and backward Euler in Demo 11.2.5.

Demo 11.3.8

Both time stepping methods solved \(\mathbf{u}'=\mathbf{D}_{xx}\mathbf{u}\).

m = 40;  _,_,Dₓₓ = FNC.diffper(m,[0,1]);

The eigenvalues of this matrix are real and negative:

λ = eigvals(Dₓₓ)
scatter(real(λ),imag(λ),title="Eigenvalues",frame=:zerolines,
    xaxis=("Re λ"),yaxis=("Im λ",(-1000,1000)),aspect_ratio=1)

The Euler method is absolutely stable in the region \(|\zeta+1| \le 1\) in the complex plane:

phi = 2π*(0:360)/360
z = @. exp(1im*phi) - 1;   # unit circle shifted to the left by 1

plot(Shape(real(z),imag(z)),color=RGB(.8,.8,1),
    xaxis=("Re ζ"),yaxis=("Im ζ"),aspect_ratio=1,
    title="Stability region",frame=:zerolines) 

In order to get inside this region, we have to find \(\tau\) such that \(\lambda \tau > -2\) for all eigenvalues \(\lambda\). This is an upper bound on \(\tau\).

λ_min = minimum(λ)
@show max_τ = -2 / λ_min;

max_τ = -2 / λ_min = 0.0003125

Here we plot the resulting values of \(\zeta=\lambda \tau\).

ζ = λ*max_τ
scatter!(real(ζ),imag(ζ),title="Stability region and ζ values")

In backward Euler, the region is \(|\zeta-1|\ge 1\). Because they are all on the negative real axis, all of the \(\zeta\) values will fit no matter what \(\tau\) is chosen.

plot(Shape([-6,6,6,-6],[-6,-6,6,6]),color=RGB(.8,.8,1))

z = @. exp(1im*phi) + 1;   # unit circle shifted right by 1
plot!(Shape(real(z),imag(z)),color=:white)

scatter!(real(ζ),imag(ζ),
    xaxis=([-4,2],"Re ζ"),yaxis=([-3,3],"Im ζ"),aspect_ratio=1,
    title="Stability region and ζ values",frame=:zerolines)

The matrix \(\mathbf{D}_{xx}\) occurring in (11.2.3) for semidiscretization of the periodic heat equation has eigenvalues that can be found explicitly. Assuming that \(x\in[0,1)\) (with periodic boundary conditions), for which \(h=1/m\), then the eigenvalues are

(11.3.2)¶\[\lambda_j = -4m^2 \sin^2 \frac{j\pi}{m}, \qquad j = 0,\ldots,m-1. \]

This result agrees with the observation in Demo 11.3.8 that the eigenvalues are real and negative. Furthermore, they lie within the interval \([-4m^2,0]\). In Euler time integration, this implies that \(-4\tau m^2\ge -2\), or \(\tau\ge 1/(2m^2)=O(m^{-2})\). For backward Euler, there is no time step restriction, and we say that backward Euler is unconditionally stable for this problem.

In summary, three things happen as \(h\to 0\):

The spatial discretization becomes more accurate like \(O(h^2)\).
The size of the matrix increases like \(O(h^{-1})\).
If we use an explicit time stepping method, then absolute stability requires \(O(h^{-2})\) steps.

The last restriction becomes rather burdensome as \(h\to 0\), i.e., as we improve the spatial discretization, which is why implicit methods are preferred for diffusion. While any convergent IVP solver will get the right solution as \(\tau\to 0\), the results are exponentially large nonsense until \(\tau\) is small enough to satisfy absolute stability.

Exercises¶

✍ Use an eigenvalue decomposition to write the system

\[\begin{split} \mathbf{u}'(t) = \begin{bmatrix} 0 & 4 \\ -4 & 0 \end{bmatrix} \mathbf{u}(t) \end{split}\]

as an equivalent diagonal system.
✍ For each system, state whether its solutions are bounded as \(t\to \infty\).

(a) \(\mathbf{u}'(t) = \displaystyle \begin{bmatrix} 1 & 3 \\ 3 & 1 \end{bmatrix} \mathbf{u}(t)\)

(b) \(\mathbf{u}'(t) = \displaystyle \begin{bmatrix} -1 & 3 \\ -3 & -1 \end{bmatrix} \mathbf{u}(t)\)

(c) \(\mathbf{u}'(t) = \displaystyle \begin{bmatrix} 0 & 4 \\ -4 & 0 \end{bmatrix} \mathbf{u}(t)\)
✍ Using Fig. 11.3.1 and Fig. 11.3.2, estimate the time step restriction (if any) for the system

\[\begin{split} \mathbf{u}'(t) = \begin{bmatrix} -4 & 0 & 0 \\ 0 & -2 & 0 \\ 0 & 0 & -0.5 \end{bmatrix} \mathbf{u}(t) \end{split}\]

for the following IVP methods:

(a) RK4 \(\qquad\) (b) AM4 \(\qquad\) (c) AB2
✍ Using Fig. 11.3.1 and Fig. 11.3.2, find the time step restriction (if any) for the system

\[\begin{split} \mathbf{u}'(t) = \begin{bmatrix} -1 & 0 & 0 \\ 0 & 0 & 4 \\ 0 & -4 & 0 \end{bmatrix} \mathbf{u}(t) \end{split}\]

for the following IVP methods:

(a) RK4 \(\qquad\) (b) AM4 \(\qquad\) (c) AB3
✍ Of the following methods, which would be unsuitable for a problem having eigenvalues on the imaginary axis? Justify your answer(s).

(a) AM2 \(\qquad\) (b) AB2 \(\qquad\) (c) RK2 \(\qquad\) (d) RK3
✍ Of the following methods, which would have a time step restriction for a problem with real, negative eigenvalues? Justify your answer(s).

(a) AM2 \(\qquad\) (b) AM4 \(\qquad\) (c) BD4 \(\qquad\) (d) RK4
✍ Let \(\mathbf{D}_{xx}\) be \(m\times m\) and given by (11.2.2). For any integer \(k \in \{0,\ldots,m-1\}\), define \(\omega = \exp(2i k\pi/m)\) and \(\mathbf{v} = \bigl[ 1,\; \omega,\; \omega^2,\; \ldots,\; \omega^{m-1} \bigr].\) Show that \(\mathbf{v}\) is an eigenvector of \(\mathbf{D}_{xx}\), with eigenvalue

\[ \lambda = -4m^2 \sin^2 \left( \frac{k\pi}{m} \right). \]

(This establishes that the eigenvalues all lie within the real interval \([-4m^2,0]\).)
✍ (a) Derive an algebraic inequality equivalent to absolute stability for the AM2 (trapezoid) formula.

✍ (b) Argue that the inequality in part (a) is equivalent to the restriction \(\operatorname{Re}(\zeta)\le 0\). (Hint: Complex magnitude is equivalent to distance in the plane.)

1: In Chapter 6 we used \(h\) rather than \(\tau\) to denote the time step size, but now we reserve \(h\) for spacing in the \(x\) direction.

Fundamentals of Numerical Computation

Absolute stability

Contents

11.3. Absolute stability¶

Stability regions¶

Heat equation¶

Exercises¶