The barycentric formula
Contents
9.2. The barycentric formula#
The Lagrange formula (9.1.3) is useful theoretically but not ideal for computation. For each new value of \(x\), all of the cardinal functions \(\ell_k\) must be evaluated at \(x\), which requires a product of \(n\) terms. Thus the total work is \(O(n^2)\) for every value of \(x\). Moreover, the formula is numerically unstable. An alternative version of the formula improves on both issues.
Derivation#
We again will use the function \(\Phi\) from Definition 9.1.6,
We also define the barycentric weights
The following formula is the key to efficient and stable evaluation of a polynomial interpolant.
Given points \((t_k,y_k)\) for \(k=0,\ldots,n\) with all the \(t_k\) distinct, the unique polynomial of degree \(n\) or less that interpolates the points is
The Lagrange cardinal polynomial (9.1.2) can be written as
and thus the interpolating polynomial in (9.1.3) is
Obviously, the constant function \(p(x)\equiv 1\) is its own polynomial interpolant on any set of nodes. The uniqueness of the interpolating polynomial, as proved in Theorem 9.1.1, allows us to plug \(y_k=1\) for all \(k\) into (9.2.4) to obtain
This is solved for \(\Phi(x)\) and put back into (9.2.4) to get (9.2.2).
Equation (9.2.2) is certainly an odd-looking way to write a polynomial! Indeed, it is technically undefined when \(x\) equals one of the nodes, but in fact, \(\lim_{x\to t_k} p(x) = y_k\), so a continuous extension to the nodes is justified. (See Exercise 3.)
Let us write out the barycentric formula for the interpolating polynomial for the quadratic case (\(n=2\)) for Example 9.1.5. The weights are computed from (9.2.1):
and similarly, \(w_1 = -36/\pi^2\) and \(w_2=18/\pi^2\).
Note that in (9.2.2), any common factor in the weights cancels out without affecting the results. Hence it’s a lot easier to use \(w_0=w_2=1\) and \(w_1=-2\). Then
Further algebraic manipulation could return this expression to the classical Lagrange form derived in Example 9.1.5.
Implementation#
For certain important node distributions, simple formulas for the weights \(w_k\) are known. Otherwise, the first task of an implementation is to compute the weights \(w_k\), or more conveniently, \(w_k^{-1}\).
We begin with the singleton node set \(\{t_0\}\), for which one gets the single weight \(w_0=1\). The idea is to grow this singleton into the set of all nodes through a recursive formula. Define \(\omega_{k,m-1}\) (for \(k< m\)) as the inverse of the weight for node \(k\) using the set \(\{t_0,\ldots,t_{m-1}\}\). Then
A direct application of (9.2.1) can be used to find \(\omega_{m,m}\). This process is iterated over \(m=1,\ldots,n\) to find \(w_k=\omega_{k,n}^{-1}\).
In Function 9.2.3 we give an implementation of the barycentric formula for polynomial interpolation.
Polynomial interpolation by the barycentric formula
1"""
2 polyinterp(t,y)
3
4Construct a callable polynomial interpolant through the points in
5vectors `t`,`y` using the barycentric interpolation formula.
6"""
7function polyinterp(t,y)
8 n = length(t)-1
9 C = (t[n+1]-t[1]) / 4 # scaling factor to ensure stability
10 tc = t/C
11
12 # Adding one node at a time, compute inverses of the weights.
13 ω = ones(n+1)
14 for m in 0:n-1
15 d = tc[1:m+1] .- tc[m+2] # vector of node differences
16 @. ω[1:m+1] *= d # update previous
17 ω[m+2] = prod( -d ) # compute the new one
18 end
19 w = 1 ./ ω # go from inverses to weights
20
21 # This function evaluates the interpolant at given x.
22 p = function (x)
23 terms = @. w / (x - t)
24 if any(isinf.(terms)) # there was division by zero
25 # return the node's data value
26 idx = findfirst(x.==t)
27 f = y[idx]
28 else
29 f = sum(y.*terms) / sum(terms)
30 end
31 end
32 return p
33end
About the code
As noted in Example 9.2.2, a common scaling factor in the weights does not affect the barycentric formula (9.2.2). In lines 9–10 this fact is used to rescale the nodes in order to avoid eventual tiny or enormous numbers that could go outside the bounds of double precision.
The return value is a function that evaluates the polynomial interpolant. Within this function, isinf
is used to detect either Inf
or -Inf
, which occurs when \(x\) exactly equals one of the nodes. In this event, the corresponding data value is returned.
Computing all \(n+1\) weights in Function 9.2.3 takes \(O(n^2)\) operations. Fortunately, the weights depend only on the nodes, not the data, and once they are known, computing \(p(x)\) at a particular value of \(x\) takes just \(O(n)\) operations.
We show the barycentric formula for values from the function \(\sin(e^{2x})\) at equally spaced nodes in \([0,1]\) with \(n=3\) and \(n=6\).
f = x -> sin(exp(2*x))
plot(f,0,1,label="function",legend=:bottomleft)
t = (0:3)/3
y = f.(t)
scatter!(t,y,color=:black,label="nodes")
p = FNC.polyinterp(t,y)
plot!(p,0,1,label="interpolant",title="Interpolation on 4 nodes")
The curves must intersect at the interpolation nodes. For \(n=6\) the interpolant is noticeably better.
plot(f,0,1,label="function",legend=:bottomleft)
t = (0:6)/6
y = f.(t)
p = FNC.polyinterp(t,y)
scatter!(t,y,color=:black,label="nodes")
plot!(p,0,1,label="interpolant",title="Interpolation on 7 nodes")
Stability#
You might suspect that as the evaluation point \(x\) approaches a node \(t_k\), subtractive cancellation error will creep into the barycentric formula because of the term \(1/(x-t_k)\). While such errors do occur, they turn out not to cause trouble, because the same cancellation happens in the numerator and denominator. In fact, the stability of the barycentric formula has been proved, though we do not give the details.
Exercises#
✍ (a) Find the barycentric weights for the nodes \(t_0=0\), \(t_1=1\), \(t_2=3\).
(b) Compute the interpolant at \(x=2\) for the nodes in part (a) and the data \(y_0=-2\), \(y_1=2\), \(y_2=1\).
✍ For each case of Exercise 9.1.1, write out the barycentric form of the interpolating polynomial.
✍ Show using L’Hôpital’s rule on (9.2.2) that \(p(t_i)=y_i\) for all \(i=0,\ldots,n\).
⌨ In each case, use Function 9.2.3 to interpolate the given function using \(n+1\) evenly spaced nodes in the given interval. Plot each interpolant together with the exact function.
(a) \(f(x) = \ln (x), \quad n = 2,3,4, \quad x\in [1,10]\)
(b) \(f(x) = \tanh (x), \quad n = 2,3,4, \quad x \in [0-3,2]\)
(c) \(f(x) = \cosh (x), \quad n = 2,3,4, \quad x \in [-1,3]\)
(d) \(f(x) = |x|, \quad n = 3,5,7, \quad x \in [-2,1]\)
⌨ Using code from Function 9.2.3, compute the barycentric weights numerically using \(n+1\) equally spaced nodes in \([-1,1]\) for \(n=30\), \(n=60\), and \(n=90\). On a single graph, plot \(|w_i|\) as a function of \(t_i\) on a log-linear scale. (The resulting graphs are an indication of the trouble with equally spaced nodes that is explored in Section 9.3.)
✍ Derive this fact stated implicitly in (9.2.1):
\[\begin{split} \Phi'(x_k) = \prod_{\substack{j=0\\j\neq k}}^n (t_k - t_j). \end{split}\]✍ Use (9.2.3) to show that if \(j\neq k\),
\[ \ell_k'(x_j) = \frac{w_k}{w_j(x_j-x_k)}. \]