Propagation of uncertainty

For the propagation of uncertainty through time, see Chaos theory § Sensitivity to initial conditions.

In statistics, propagation of uncertainty (or propagation of error) is the effect of variables' uncertainties (or errors, more specifically random errors) on the uncertainty of a function based on them. When the variables are the values of experimental measurements they have uncertainties due to measurement limitations (e.g., instrument precision) which propagate to the combination of variables in the function.

The uncertainty u can be expressed in a number of ways. It may be defined by the absolute error $Δ x$ . Uncertainties can also be defined by the relative error $(Δ x)/ x$ , which is usually written as a percentage. Most commonly, the uncertainty on a quantity is quantified in terms of the standard deviation, $σ$ , the positive square root of variance, $σ 2$ . The value of a quantity and its error are then expressed as an interval $x \pm u$ . If the statistical probability distribution of the variable is known or can be assumed, it is possible to derive confidence limits to describe the region within which the true value of the variable may be found. For example, the 68% confidence limits for a one-dimensional variable belonging to a normal distribution are approximately ± one standard deviation $σ$ from the central value $x$ , which means that the region $x \pm σ$ will cover the true value in roughly 68% of cases.

If the uncertainties are correlated then covariance must be taken into account. Correlation can arise from two different sources. First, the measurement errors may be correlated. Second, when the underlying values are correlated across a population, the uncertainties in the group averages will be correlated.^[1]

Linear combinations

Let $\{f_{k}(x_{1},x_{2},\dots ,x_{n})\}$ be a set of m functions which are linear combinations of $n$ variables $x_{1},x_{2},\dots ,x_{n}$ with combination coefficients $A_{{k1}},A_{{k2}},\dots ,A_{{kn}},(k=1\dots m)$ .

f_{k}=\sum _{i=1}^{n}A_{ki}x_{i}{\text{ or }}\mathrm {f} =\mathrm {Ax} \,

and let the variance-covariance matrix on x be denoted by ${\mathrm {\Sigma ^{x}}}\,$ .

{\mathrm {\Sigma ^{x}}}={\begin{pmatrix}\sigma _{1}^{2}&\sigma _{{12}}&\sigma _{{13}}&\cdots \\\sigma _{{12}}&\sigma _{2}^{2}&\sigma _{{23}}&\cdots \\\sigma _{{13}}&\sigma _{{23}}&\sigma _{3}^{2}&\cdots \\\vdots &\vdots &\vdots &\ddots \\\end{pmatrix}}={\begin{pmatrix}{\mathit {\Sigma }}_{1}^{x}&{\mathit {\Sigma }}_{{12}}^{x}&{\mathit {\Sigma }}_{{13}}^{x}&\cdots \\{\mathit {\Sigma }}_{{12}}^{x}&{\mathit {\Sigma }}_{2}^{x}&{\mathit {\Sigma }}_{{23}}^{x}&\cdots \\{\mathit {\Sigma }}_{{13}}^{x}&{\mathit {\Sigma }}_{{23}}^{x}&{\mathit {\Sigma }}_{3}^{x}&\cdots \\\vdots &\vdots &\vdots &\ddots \\\end{pmatrix}}

Then, the variance-covariance matrix ${\mathrm {\Sigma ^{f}}}\,$ of f is given by

\mathit{\Sigma}^f_{ij}= \sum_k^n \sum_\ell^n A_{ik} \mathit{\Sigma}^x_{k\ell} A_{j\ell}

or, in matrix notation:

\mathrm{\Sigma^f= A \Sigma^x A^\top}

This is the most general expression for the propagation of error from one set of variables onto another. When the errors on x are uncorrelated the general expression simplifies to

{\mathit {\Sigma }}_{{ij}}^{f}=\sum _{k}^{n}A_{{ik}}{\mathit {\Sigma }}_{k}^{x}A_{{jk}}.

where ${\mathit {\Sigma }}_{k}^{x}=\sigma _{{x_{k}}}^{2}$ is the variance of k-th element of the x vector. Note that even though the errors on x may be uncorrelated, the errors on f are in general correlated; in other words, even if ${\mathrm {\Sigma ^{x}}}$ is a diagonal matrix, ${\mathrm {\Sigma ^{f}}}$ is in general a full matrix.

The general expressions for a scalar-valued function, f, are a little simpler.

f=\sum _{i}^{n}a_{i}x_{i}:f={\mathrm {ax}}\,

\sigma _{f}^{2}=\sum _{i}^{n}\sum _{j}^{n}a_{i}{\mathit {\Sigma }}_{{ij}}^{x}a_{j}={\mathrm {a\Sigma ^{x}a^{\top }}}

(where a is a row-vector).

Each covariance term, $\sigma _{ij}$ can be expressed in terms of the correlation coefficient $\rho _{{ij}}\,$ by $\sigma _{{ij}}=\rho _{{ij}}\sigma _{i}\sigma _{j}\,$ , so that an alternative expression for the variance of f is

\sigma _{f}^{2}=\sum _{i}^{n}a_{i}^{2}\sigma _{i}^{2}+\sum _{i}^{n}\sum _{{j(j\neq i)}}^{n}a_{i}a_{j}\rho _{{ij}}\sigma _{i}\sigma _{j}.

In the case that the variables in x are uncorrelated this simplifies further to

\sigma _{{f}}^{{2}}=\sum _{i}^{n}a_{{i}}^{{2}}\sigma _{{i}}^{{2}}.

In the simplest case of identical coefficients and variances, we find

\sigma _{{f}}={\sqrt {n}}a\sigma .

Non-linear combinations

When f is a set of non-linear combination of the variables x, an interval propagation could be performed in order to compute intervals which contain all consistent values for the variables. In a probabilistic approach, the function f must usually be linearized by approximation to a first-order Taylor series expansion, though in some cases, exact formulas can be derived that do not depend on the expansion as is the case for the exact variance of products.^[2] The Taylor expansion would be:

f_{k}\approx f_{k}^{0}+\sum _{i}^{n}{\frac {\partial f_{k}}{\partial {x_{i}}}}x_{i}

where $\partial f_{k}/\partial x_{i}$ denotes the partial derivative of f_k with respect to the i-th variable, evaluated at the mean value of all components of vector x. Or in matrix notation,

{\mathrm {f}}\approx {\mathrm {f}}^{0}+{\mathrm {J}}{\mathrm {x}}\,

where J is the Jacobian matrix. Since f⁰ is a constant it does not contribute to the error on f. Therefore, the propagation of error follows the linear case, above, but replacing the linear coefficients, A_ik and A_jk by the partial derivatives, ${\frac {\partial f_{k}}{\partial x_{i}}}$ and ${\frac {\partial f_{k}}{\partial x_{j}}}$ . In matrix notation, ^[3]

\mathrm {\Sigma } ^{\mathrm {f} }=\mathrm {J} \mathrm {\Sigma } ^{\mathrm {x} }\mathrm {J} ^{\top }.

That is, the Jacobian of the function is used to transform the rows and columns of the variance-covariance matrix of the argument. Note this is equivalent to the matrix expression for the linear case with $\mathrm{J = A}$ .

Simplification

Neglecting correlations or assuming independent variables yields a common formula among engineers and experimental scientists to calculate error propagation, the variance formula:^[4]

$s_{f}={\sqrt {\left({\frac {\partial f}{\partial x}}\right)^{2}s_{x}^{2}+\left({\frac {\partial f}{\partial y}}\right)^{2}s_{y}^{2}+\left({\frac {\partial f}{\partial z}}\right)^{2}s_{z}^{2}+\cdots }}$

where $s_{f}$ represents the standard deviation of the function $f$ , $s_{x}$ represents the standard deviation of $x$ , $s_{y}$ represents the standard deviation of $y$ , and so forth.

It is important to note that this formula is based on the linear characteristics of the gradient of $f$ and therefore it is a good estimation for the standard deviation of $f$ as long as $s_{x},s_{y},s_{z},\ldots$ are small compared to the partial derivatives.^[5]

Example

Any non-linear differentiable function, f(a,b), of two variables, a and b, can be expanded as

f\approx f^{0}+{\frac {\partial f}{\partial a}}a+{\frac {\partial f}{\partial b}}b

hence:

\sigma _{f}^{2}\approx \left|{\frac {\partial f}{\partial a}}\right|^{2}\sigma _{a}^{2}+\left|{\frac {\partial f}{\partial b}}\right|^{2}\sigma _{b}^{2}+2{\frac {\partial f}{\partial a}}{\frac {\partial f}{\partial b}}\sigma _{ab}.

In the particular case that $f=ab\!$ , ${\frac {\partial f}{\partial a}}=b,{\frac {\partial f}{\partial b}}=a$ . Then

\sigma _{f}^{2}\approx b^{2}\sigma _{a}^{2}+a^{2}\sigma _{b}^{2}+2ab\,\sigma _{{ab}}

\left({\frac {\sigma _{f}}{f}}\right)^{2}\approx \left({\frac {\sigma _{a}}{a}}\right)^{2}+\left({\frac {\sigma _{b}}{b}}\right)^{2}+2\left({\frac {\sigma _{a}}{a}}\right)\left({\frac {\sigma _{b}}{b}}\right)\rho _{ab}.

Caveats and warnings

Error estimates for non-linear functions are biased on account of using a truncated series expansion. The extent of this bias depends on the nature of the function. For example, the bias on the error calculated for log x increases as x increases, since the expansion to 1 + x is a good approximation only when x is small.

For highly non-linear functions, there exist five categories of probabilistic approaches for uncertainty propagation;^[6] see Uncertainty Quantification#Methodologies for forward uncertainty propagation for details.

Reciprocal

In the special case of the inverse or reciprocal $1/B$ , where $B=N(0,1)$ , the distribution is a reciprocal normal distribution, and there is no definable variance. For such inverse distributions and for ratio distributions, there can be defined probabilities for intervals, which can be computed either by Monte Carlo simulation or, in some cases, by using the Geary–Hinkley transformation.^[7]

Shifted reciprocal

The statistics, mean and variance, of the shifted reciprocal function ${\frac {1}{p-B}}$ for $B=N(\mu ,\sigma )$ , however, exist in a principal value sense if the difference between the shift or pole $p$ and the mean $\mu$ is real. The mean of this transformed random variable is then indeed the scaled Dawson's function ${\frac {{\sqrt {2}}}{\sigma }}F\left({\frac {p-\mu }{{\sqrt {2}}\sigma }}\right)$ .^[8] In contrast, if the shift $p-\mu$ is purely complex, the mean exists and is a scaled Faddeeva function, whose exact expression depends on the sign of the imaginary part, $\operatorname {Im}(p-\mu )$ . In both cases, the variance is a simple function of the mean.^[9] Therefore, the variance has to be considered in a principal value sense if $p-\mu$ is real, while it exists if the imaginary part of $p-\mu$ is non-zero. Note that these means and variances are exact, as they do not recur to linearisation of the ratio. The exact covariance of two ratios with a pair of different poles $p_{1}$ and $p_{2}$ is similarly available.^[10] The case of the inverse of a complex normal variable $B$ , shifted or not, exhibits different characteristics.^[8]

Example formulas

This table shows the variances of simple functions of the real variables $A,B\!$ , with standard deviations $\sigma _{A},\sigma _{B},\,$ covariance $\sigma _{{AB}}$ and exactly known real-valued constants $a,b\,$ (i.e., $\sigma _{a}=\sigma _{b}=0$ ).

Function	Variance	Standard Deviation
$f=aA\,$	$\sigma _{f}^{2}=a^{2}\sigma _{A}^{2}$	$\sigma_f = \|a\|\sigma_A$
$f=aA+bB\,$	$\sigma _{f}^{2}=a^{2}\sigma _{A}^{2}+b^{2}\sigma _{B}^{2}+2ab\,\sigma _{{AB}}$	$\sigma _{f}={\sqrt {a^{2}\sigma _{A}^{2}+b^{2}\sigma _{B}^{2}+2ab\,\sigma _{{AB}}}}$
$f=aA-bB\,$	$\sigma _{f}^{2}=a^{2}\sigma _{A}^{2}+b^{2}\sigma _{B}^{2}-2ab\,\sigma _{{AB}}$	$\sigma _{f}={\sqrt {a^{2}\sigma _{A}^{2}+b^{2}\sigma _{B}^{2}-2ab\,\sigma _{{AB}}}}$
$f=AB\,$	$\sigma _{f}^{2}\approx f^{2}\left[\left({\frac {\sigma _{A}}{A}}\right)^{2}+\left({\frac {\sigma _{B}}{B}}\right)^{2}+2{\frac {\sigma _{AB}}{AB}}\right]$ ^[11]^[12]	$\sigma _{f}\approx \left\|f\right\|{\sqrt {\left({\frac {\sigma _{A}}{A}}\right)^{2}+\left({\frac {\sigma _{B}}{B}}\right)^{2}+2{\frac {\sigma _{AB}}{AB}}}}$
$f={\frac {A}{B}}\,$	$\sigma _{f}^{2}\approx f^{2}\left[\left({\frac {\sigma _{A}}{A}}\right)^{2}+\left({\frac {\sigma _{B}}{B}}\right)^{2}-2{\frac {\sigma _{{AB}}}{AB}}\right]$ ^[13]	$\sigma _{f}\approx \left\|f\right\|{\sqrt {\left({\frac {\sigma _{A}}{A}}\right)^{2}+\left({\frac {\sigma _{B}}{B}}\right)^{2}-2{\frac {\sigma _{{AB}}}{AB}}}}$
$f=aA^{{b}}\,$	$\sigma _{f}^{2}\approx \left({a}{b}{A}^{{b-1}}{\sigma _{A}}\right)^{2}=\left({\frac {{f}{b}{\sigma _{A}}}{A}}\right)^{2}$	$\sigma _{f}\approx \left\|{a}{b}{A}^{{b-1}}{\sigma _{A}}\right\|=\left\|{\frac {{f}{b}{\sigma _{A}}}{A}}\right\|$
$f=a\ln(bA)\,$	$\sigma _{f}^{2}\approx \left(a{\frac {\sigma _{A}}{A}}\right)^{2}$ ^[14]	$\sigma _{f}\approx \left\|a{\frac {\sigma _{A}}{A}}\right\|$
$f=a\log _{{10}}(A)\,$	$\sigma _{f}^{2}\approx \left(a{\frac {\sigma _{A}}{A\ln(10)}}\right)^{2}$ ^[14]	$\sigma _{f}\approx \left\|a{\frac {\sigma _{A}}{A\ln(10)}}\right\|$
$f=ae^{{bA}}\,$	$\sigma _{f}^{2}\approx f^{2}\left(b\sigma _{A}\right)^{2}$ ^[15]	$\sigma _{f}\approx \left\|f\left(b\sigma _{A}\right)\right\|$
$f=a^{{bA}}\,$	$\sigma _{f}^{2}\approx f^{2}(b\ln(a)\sigma _{A})^{2}$	$\sigma _{f}\approx \left\|f(b\ln(a)\sigma _{A})\right\|$
$f=a\sin(bA)\,$	$\sigma _{f}^{2}\approx \left[ab\cos(bA)\sigma _{A}\right]^{2}$	$\sigma _{f}\approx \left\|ab\cos(bA)\sigma _{A}\right\|$
$f=a\cos \left(bA\right)\,$	$\sigma _{f}^{2}\approx \left[ab\sin(bA)\sigma _{A}\right]^{2}$	$\sigma _{f}\approx \left\|ab\sin(bA)\sigma _{A}\right\|$
$f=A^{B}\,$	$\sigma _{f}^{2}\approx f^{2}\left[\left({\frac {B}{A}}\sigma _{A}\right)^{2}+\left(\ln(A)\sigma _{B}\right)^{2}+2{\frac {B\ln(A)}{A}}\sigma _{{AB}}\right]$	$\sigma _{f}\approx \left\|f\right\|{\sqrt {\left({\frac {B}{A}}\sigma _{A}\right)^{2}+\left(\ln(A)\sigma _{B}\right)^{2}+2{\frac {B\ln(A)}{A}}\sigma _{{AB}}}}$

For uncorrelated variables ( $\rho _{{AB}}=0$ ) the covariance terms are also zero, as $\sigma _{{AB}}=\rho _{{AB}}\sigma _{A}\sigma _{B}\,$ .

In this case, expressions for more complicated functions can be derived by combining simpler functions. For example, repeated multiplication, assuming no correlation gives,

f=ABC;\left({\frac {\sigma _{f}}{f}}\right)^{2}\approx \left({\frac {\sigma _{A}}{A}}\right)^{2}+\left({\frac {\sigma _{B}}{B}}\right)^{2}+\left({\frac {\sigma _{C}}{C}}\right)^{2}.

For the case $f=AB$ we also have Goodman's expression^[2] for the exact variance: for the uncorrelated case it is

V(XY)=E(X)^{2}V(Y)+E(Y)^{2}V(X)+E((X-E(X))^{2}(Y-E(Y))^{2})

and therefore we have:

\sigma _{f}^{2}=A^{2}\sigma _{B}^{2}+B^{2}\sigma _{A}^{2}+\sigma _{A}^{2}\sigma _{B}^{2}

Example calculations

Inverse tangent function

We can calculate the uncertainty propagation for the inverse tangent function as an example of using partial derivatives to propagate error.

Define

f(x)=\arctan(x),

where $σ x$ is the absolute uncertainty on our measurement of $x$ . The derivative of $f (x)$ with respect to $x$ is

{\frac {df}{dx}}={\frac {1}{1+x^{2}}}.

Therefore, our propagated uncertainty is

\sigma _{{f}}\approx {\frac {\sigma _{x}}{1+x^{2}}},

where $σ f$ is the absolute propagated uncertainty.

Resistance measurement

A practical application is an experiment in which one measures current, $I$ , and voltage, $V$ , on a resistor in order to determine the resistance, $R$ , using Ohm's law, $R = V / I$ .

Given the measured variables with uncertainties, $I \pm σ I$ and $V \pm σ V$ , and neglecting their possible correlation, the uncertainty in the computed quantity, $σ R$ is

\sigma_R \approx \sqrt{ \sigma_V^2 \left(\frac{1}{I}\right)^2 + \sigma_I^2 \left(\frac{-V}{I^2}\right)^2 } = R\sqrt{ \left(\frac{\sigma_V}{V}\right)^2 + \left(\frac{\sigma_I}{I}\right)^2 }.

References

↑ Kirchner, James. "Data Analysis Toolkit #5: Uncertainty Analysis and Error Propagation" (PDF). Berkeley Seismology Laboratory. University of California. Retrieved 22 April 2016.
1 2 Goodman, Leo (1960). "On the Exact Variance of Products". Journal of the American Statistical Association. 55 (292): 708–713. doi:10.2307/2281592. JSTOR 2281592.
↑ Ochoa1,Benjamin; Belongie, Serge "Covariance Propagation for Guided Matching"
↑ Ku, H. H. (October 1966). "Notes on the use of propagation of error formulas". Journal of Research of the National Bureau of Standards. National Bureau of Standards. 70C (4): 262. doi:10.6028/jres.070c.025. ISSN 0022-4316. Retrieved 3 October 2012.
↑ Clifford, A. A. (1973). Multivariate error analysis: a handbook of error propagation and calculation in many-parameter systems. John Wiley & Sons. ISBN 0470160551.
↑ Lee, S. H.; Chen, W. (2009). "A comparative study of uncertainty propagation methods for black-box-type problems". Structural and Multidisciplinary Optimization. 37 (3): 239–253. doi:10.1007/s00158-008-0234-7.
↑ Hayya, Jack; Armstrong, Donald; Gressis, Nicolas (July 1975). "A Note on the Ratio of Two Normally Distributed Variables". Management Science. 21 (11): 1338–1341. doi:10.1287/mnsc.21.11.1338. JSTOR 2629897.
1 2 Lecomte, Christophe (May 2013). "Exact statistics of systems with uncertainties: an analytical theory of rank-one stochastic dynamic systems". Journal of Sound and Vibrations. 332 (11): 2750–2776. doi:10.1016/j.jsv.2012.12.009.
↑ Lecomte, Christophe (May 2013). "Exact statistics of systems with uncertainties: an analytical theory of rank-one stochastic dynamic systems". Journal of Sound and Vibrations. 332 (11). Section (4.1.1). doi:10.1016/j.jsv.2012.12.009.
↑ Lecomte, Christophe (May 2013). "Exact statistics of systems with uncertainties: an analytical theory of rank-one stochastic dynamic systems". Journal of Sound and Vibrations. 332 (11). Eq.(39)-(40). doi:10.1016/j.jsv.2012.12.009.
↑ "A Summary of Error Propagation" (PDF). p. 2. Retrieved 2016-04-04.
↑ "Propagation of Uncertainty through Mathematical Operations" (PDF). p. 5. Retrieved 2016-04-04.
↑ "Strategies for Variance Estimation" (PDF). p. 37. Retrieved 2013-01-18.
1 2 Harris, Daniel C. (2003), Quantitative chemical analysis (6th ed.), Macmillan, p. 56, ISBN 0-7167-4464-3
↑ "Error Propagation tutorial" (PDF). Foothill College. October 9, 2009. Retrieved 2012-03-01.

External links

A detailed discussion of measurements and the propagation of uncertainty explaining the benefits of using error propagation formulas and Monte Carlo simulations instead of simple significance arithmetic
GUM, Guide to the Expression of Uncertainty in Measurement
EPFL An Introduction to Error Propagation, Derivation, Meaning and Examples of Cy = Fx Cx Fx'
uncertainties package, a program/library for transparently performing calculations with uncertainties (and error correlations).
soerp package, a python program/library for transparently performing *second-order* calculations with uncertainties (and error correlations).
Joint Committee for Guides in Metrology (2011). JCGM 102: Evaluation of Measurement Data - Supplement 2 to the "Guide to the Expression of Uncertainty in Measurement" - Extension to Any Number of Output Quantities (PDF) (Technical report). JCGM. Retrieved 13 February 2013.

Authority control	GND: 4479158-6

This article is issued from Wikipedia - version of the 11/11/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.