26 June, 2017

Uncertainty Estimation (II): Combined & Expanded Uncertainties. Cover Factor kp

5. Estimation of Combined & Expanded Uncertainty. Cover factor kp

A physical measurement, however simple, has a model associated with the actual process. The physical model is represented by a mathematical model that in many cases implies approximations.


5.1. Uncertainties Propagation Law (UPL), or "Ley de Propagación de Incertidumbres (LPI)”. Combined Uncertainty
In most cases, the measurand Y is not measured directly, but is determined from other N quantities X1, X2, ..., XN , by a functional relationship f:.
The function f does not express both a physical law and the measurement process and must contain all the magnitudes that contribute to the final result, including corrections, even if these are of null value, to be able to consider the uncertainties of such corrections..
In principle, by means of type A evaluation or type B evaluation, we would be able to know the distribution functions of each of the input magnitudes, and we could derive from this the distribution function of the indirect magnitude.
Taking Taylor's first-order series around the expected value into account, we can obtain the uncertainty propagation law, which facilitates the estimation of variance.
The terms ci , cj are the coefficients of sensitivity and indicate the weight of each of the different input quantities in the output variable, represented by the measurement function. The second term is the covariance term in which the influence of input quantities on others appears, in case they are correlated. If the input magnitudes are independent, the equation can be simplified, the second term disappearing, leaving only the first.

Example 1:
UPL or “LPI” for models with shape:
In this case we obtain:
A tipical example for this shape model can be the Young’s modulus for metals elasticity.
E = σ/ε
When it is not possible to write the measurement model in an explicit way, the sensitivity coefficients can not be calculated analytically, but numerically, by introducing small changes xi + Δxi into the input quantities and observing the changes that occur in the output quantities.

5.2. Uncertainty Propagation Law “LPI” limitations.

Uncertainty Propagation Law can be applied when:
  1. Only an output magnitude appears in the mathematical model.
  2. The mathematical model is an explicit model, ie, Y = f (Xi).
  3. The mathematical expectation, the typical uncertainties and the mutual uncertainties of the input magnitudes could be calculated.
  4. The model is a good approximation to a linear development around the best estimator of the input magnitudes.
When it comes to nonlinear models, we can perform the second order approximation of the Taylor series, or even obtain the values of mathematical expectation and variance without approximations, directly, much more complex solutions mathematically than the law of propagation of uncertainties.

After the elaboration of the GUM guide, we have worked on additional guides to this one, for the evaluation of uncertainties by other methods. One of them has influenced uncertainty calculation using the Montecarlo method.

The basic idea of this method, useful for both linear and nonlinear models, is that assuming a model Y = f (Xi), where all input quantities can be described by their distribution functions, a mathematical algorithm programmed to generate a sequence of values τi = (x1, .......xN) where each xi is randomly generated from its distribution function (extractions from its distribution function). The value of y1 obtained from each sequence τi is calculated using the measurement model, the process being repeated a large number of times, on the order of 10E5 or 10E6. This high number of repetitions allows to obtain a distribution function for the magnitude y, and he calculation of their mathematical expectation and their standard deviation in mathematical form, which will lead us to the results of the best combined estimator and associated uncertainty.

In this process the results of the measurement of the input parameters are not really used to give a measurement result, but to establish with them the distribution function of the input magnitudes and to be able to randomly generate values of the function magnitude of those input quantities. As it is an automated generation, many more values can be generated than if the measurement were actually performed and with all of them finding the distribution function of the magnitude that depends on those input magnitudes. Mathematically known, the results and associated uncertainties are then obtained.

5.3. Expanded Uncertainty

The purpose of the combined standard uncertainty is to characterize the quality of the measurements. In practice what is needed is to know the interval within which it is reasonable to suppose, with a high probability of not being wrong, that the infinite values that can be "reasonably" attributed to the measurand are found. We might ask if we could use the combined standard uncertainty to define that interval (y-u , y+u). In this case, the probability that the true value of the measurand is inside the range (y-u, y+u) is low since, assuming that the distribution function of the measurand “y” is a normal function, we are talking about 68.3% of probability.

To increase this probability to more useful values for later decision making, we can multiply the combined uncertainty by a number called the “cover factorkp and to use the interval (y-uc(y) kp , y+ uc(y) kp).

The product kp uc(y) = Up is called expanded uncertainty, where kp is the coverage factor for a confidence level p.

Mathematically this means that:
 Hence the area of the density function associated with Y within this interval is:
The interval we want to know is ( y - Up , y + Up ).
The relation between p y kp depends, of course, on the density function f(Y) that is obtained from the information accumulated during the measurement process.

 

5.4. Statistical Distributions

5.4.1. Rectangular distribution
For a confidence level p we calculate the integral of the distribution function:
Standard deviation is then:
In order to calculate the coverage factor:
Operating shows:
Example:
Given a certain magnitude t, it is known to be described by a symmetrical rectangular distribution whose limits are 96 and 104. Determine the coverage factor and the expanded uncertainty for a 99% confidence level.

In this case we have a = 4, μ = 100, y σ = 2,31, and that for a confidence level of 99% the coverage factor is 1.71. Then, the expanded uncertainty is U99 = kp u = 1,71×2,31=3,949

5.4.2.- Triangular distribution
For a confidence level p, the integral of the density function is calculated:
As the standard deviation is σ= a/√6 Then kp=√6(1-√(1-p) :
Operating gives:


5.4.3. Normal distribution
In most measurement processes the distribution that best describes what is observed is the normal distribution.
Normal distribution with μ = 0 y σ = 1 is termed standard or standard distribution.
The integration required for the determination of confidence intervals is more complicated. However, in the case of the standard normal distribution there are tables that allow the calculation to be carried out in a simple way.

The way to obtain the confidence intervals is by typing. If we have a variable or magnitude Y that is distributed according to an N(µ,σ), we define a normal variable typified as Z=(Y-µ)/σ.
The integration of the standard distribution N (0,1) is in the tables. From these the confidence intervals are determined.

Example:
If, for a confidence level p the confidence interval defined for a distribution N (0,1) is (-kp, kp); As Z is a distribution N (0,1):
In a normal distribution the cover factor kp, for a confidence level p is:
The expanded uncertainty is calculated by Up (y) = kp · uc (y).

5.4.4.- Student's T-distribution

Student's T-distribution is used to make hypothesis tests when the sample size is small.
Let Z be a random variable of expectation µz and standard deviation σz of which n observations are made, estimating a mean z value and an experimental standard deviation s(z).

It is possible to define the following variable whose distribution is the T-Student with ν freedom degrees
The number of degrees of freedom is ν = n -1 for an amount estimated by the arithmetic mean of n independent observations.

If n independent observations are used to make a least square fit of m parameters, the number of degrees of freedom is ν = n -m

For a T-Student variable with ν freedom degrees, el the interval for the confidence level p is (-tp (ν) , tp (v)).

Factor tp (ν) is found inside T-Student tables.

Example:
Suppose that the measurand Y is simply a magnitude that is estimated by the arithmetic mean X of n independent observations, where s(X) is the experimental standard deviation:
The best estimate of Y is y = X with associated uncertainty uc (y) = s(X)
The variable is distributed according to the t-Student. Thus:
Cover Factor is tp(ν) and Expanded Uncertainty is Up (ν) = tp (ν) uc (y).

 5.5. Expanded Uncertainty determination after measurement.

The problem that arises is to determine the expanded uncertainty after a measurement has been made.
If it is a direct measure, we can act in different ways:
- Uncertainty determined according to type A assessment of uncertainty
The cover factor for a given confidence level is obtained from the Student's T-distribution with n-1 degrees of freedom, where n is the number of measurements, or the normal distribution, if the number of repetitions is sufficiently large.
- Uncertainty determined if the distribution Y is rectangular, centered on the estimator y

The cover factor is obtained from the rectangular distribution.

If it is a matter of calculating the expanded uncertainty for an indirect measure, it is somewhat complicated to calculate the distribution function, which must be performed by special analytical or numerical methods. But we can think of simplifying:

A first approximation would be to calculate the probability distribution function by convolution of the probability distributions of the input quantities.

A second approximation would be to assume the indirect measure as a linear function of the input quantities
 5.5.1. Central limit theorem

The central limit theorem in its different versions ensures that the sum of independent and equidistributed random variables converges to a normal one. On paper convergence is commonly very fast, but actual experiments make one despair before seeing the Gaussian bell. There is no contradiction in this, for example we can understand the probability 1/2 to obtain face as a limit when we throw infinite times a coin and we can not demand that after 20 or 30 runs we have a precise approximation counting the percentage of hits.
The following graphs show the histograms of the sum of the 10 dice scores compared to the corresponding normal when the experiment is repeated one hundred, one thousand and ten thousand times. Naturally they come from a computer simulation.
Sum of scores of ten dice thrown a hundred times.
Sum of scores of ten dice thrown a thousand times.
Sum of scores of ten dice thrown ten thousand times.
Suppose an indirect measure is a linear function of the input quantities according to equation 33:
• The central limit theorem says:
“The distribution associated with Y will approximate a normal distribution of expectation
and variance
with E(Xi) the expectetation of Xi and σ2(Xi) the variance of Xi. This happens if Xi are independents and σ2(Y) is much larger than any other individual component ci σ2(Y)”
Conditions of the theorem are fulfilled when the combined uncertainty uc(y) is not dominated by typical uncertainty components obtained by type A evaluations based on few observations or type B evaluations based on rectangular distributions.

Convergence towards the normal distribution will be the faster the greater the number N of variables involved, the more normal these are, and when there is no dominant.
Consequently, a first approximation to determine the expanded uncertainty defining a confidence level interval p will be to use the cover factor proper of a normal distribution, kp.
If the number of random readings is small, then the value of uA derived may be inaccurate, and the distribution of the random component is best represented by the t-Student distribution. We would overestimate the uncertainty, especially if q was small and uA y uB were comparable in size.

5.5.2. Effective degrees of freedom. Welch-Satterthwaite approach

The problem can be solved by using the Welch-Satterthwaite approximation formula, which calculates the number of effective degrees of freedom of the combination of the t-Student distribution with a Gaussian distribution. The resulting distribution will be treated as a t-Student distribution with a calculated number of freedom degrees.
or:
If relative uncertainties are used, the number of effective degrees of freedom is:
In evaluations type B, to calculate the effective degrees of freedom, we will use the approximation given by the following formula:
This formula is obtained by considering the "uncertainty" of uncertainty. The greater the number of degrees of freedom, the greater the confidence in uncertainty. In practice, in type B evaluations vi→∞

In Type A evaluations the calculation of the number of degrees of freedom will depend on the statistic used to evaluate the most probable value.

References:
Next: Uncertainty Estimation (III): Examples of uncertainty calculation: resolution, drift, influence of temperature on measurements, steps to follow to estimate the result of a measurement & Conclusions

No comments:

Post a Comment