2022-02-04 Stability

2022-02-04 Stability¶

Last time¶

Breaking Newton’s method
Exploration
Multiple roots
Conditioning of the rootfinding problem

Today¶

Forward and backward stability
Beyond IEEE double precision
Discuss portfolios

using Plots
default(linewidth=4, legendfontsize=12)

Which is better to model inputs to a rootfinder?¶

A: coefficients \(a_k\) in
\[p(x) = \prod_k (x - a_k)\]

B: coefficients \(b_k\) in
\[p(x) = \sum_k b_k x^k\]

Figure from Trefethen and Bau (1999)¶

Ill conditioning of roots of Wilkinson's polynomial

Forward vs backward error and stability ¶

Stability¶

“nearly the right answer to nearly the right question”

\[ \frac{\lvert \tilde f(x) - f(\tilde x) \rvert}{| f(\tilde x) |} \in O(\epsilon_{\text{machine}}) \]

for some \(\tilde x\) that is close to \(x\)

Backward Stability¶

“exactly the right answer to nearly the right question”

\[ \tilde f(x) = f(\tilde x) \]

for some \(\tilde x\) that is close to \(x\)

Every backward stable algorithm is stable.
Not every stable algorithm is backward stable.

Map angle to the unit circle¶

theta = 1 .+ LinRange(0, 3e-15, 100)
scatter(cos.(theta), sin.(theta), legend=:none, aspect_ratio=:equal)

../_images/2022-02-04-stability_10_0.svg

GKS: Possible loss of precision in routine SET_WINDOW

theta = LinRange(1., 1+2e-5, 100)
mysin(t) = cos(t - (1e10+1)*pi/2)
plot(cos.(theta), sin.(theta), aspect_ratio=:equal)
scatter!(cos.(theta), mysin.(theta))

../_images/2022-02-04-stability_11_0.svg

Are we observing A=stability, B=backward stability, or C=neither?¶

The numbers \((\widetilde\cos \theta, \widetilde\sin \theta = (\operatorname{fl}(\cos \theta), \operatorname{fl}(\sin\theta))\) do not lie exactly on the unit circles.
- There does not exist a \(\tilde\theta\) such that \((\widetilde\cos \theta, \widetilde \sin\theta) = (\cos\tilde\theta, \sin\tilde\theta)\)

Accuracy of backward stable algorithms (Theorem)¶

A backward stable algorithm for computing \(f(x)\) has relative accuracy

\[ \left\lvert \frac{\tilde f(x) - f(x)}{f(x)} \right\rvert \lesssim \kappa(f) \epsilon_{\text{machine}} . \]

Backward stability is generally the best we can hope for.

In practice, it is rarely possible for a function to be backward stable when the output space is higher dimensional than the input space.

Beyond IEEE-754 double precision¶

Lower precision IEEE-754 and relatives¶

The default floating point type in Python is double precision, which requires 8 bytes (64 bits) to store and offers \(\epsilon_{\text{machine}} \approx 10^{-16}\).

IEEE-754 single precision (float in C and related languages) is half the size (4 bytes = 32 bits) and has long been popular when adequate.
IEEE-754 half precision reduces both range (exponent bits) and precision (mantissa) bits.
bfloat16 is a relatively new non-IEEE type that is popular in machine learning because it’s easy to convert to/from single precision (just truncate/round the mantissa).

Type	\(\epsilon_{\text{machine}}\)	exponent bits	mantissa bits
double	1.11e-16	11	52
single	5.96e-8	8	23
half	4.88e-4	5	10
bfloat16	3.91e-3	8	7

Posits¶

Mixed-precision algorithms¶

Sometimes reducing the precision (from double to single, or from single to half) compromises the quality of results (due to ill-conditioning or poor stability), but one can recover accurate results by using higher precision in a small fraction of the computation. These are mixed-precision algorithms, and can be a useful optimization technique.

Such techniques can give up to 2x improvement if memory bandwidth (for floating point data) or pure vectorizable flops are the bottleneck. In case of single to half precision, the benefit can be greater than 2x when using special hardware, such as GPUs with “tensor cores”.

Warning: Premature use of mixed-precision techniques can often obscure better algorithms that can provide greater speedups and/or reliability improvements.

2022-01-31 Formulation

2022-02-07 Linear Algebra

Numerical Computation

2022-02-04 Stability

Contents

2022-02-04 Stability¶

Last time¶

Today¶

Which is better to model inputs to a rootfinder?¶

Figure from Trefethen and Bau (1999)¶

Forward vs backward error and stability ¶

Stability¶

Backward Stability¶

Map angle to the unit circle¶

Are we observing A=stability, B=backward stability, or C=neither?¶

Accuracy of backward stable algorithms (Theorem)¶

Beyond IEEE-754 double precision¶

Lower precision IEEE-754 and relatives¶

Posits¶

Mixed-precision algorithms¶

Numerical Computation

2022-02-04 Stability

Contents

2022-02-04 Stability¶

Last time¶

Today¶

Which is better to model inputs to a rootfinder?¶

Figure from Trefethen and Bau (1999)¶

Forward vs backward error and stability¶

Stability¶

Backward Stability¶

Map angle to the unit circle¶

Are we observing A=stability, B=backward stability, or C=neither?¶

Accuracy of backward stable algorithms (Theorem)¶

Beyond IEEE-754 double precision¶

Lower precision IEEE-754 and relatives¶

Posits¶

Mixed-precision algorithms¶

Forward vs backward error and stability ¶