Environmental Data Analysis BC ENV 3017

Statistics 2b

Confidence Intervals

z confidence interval

t-distribution and confidence intervals

So far we have assumed we know the standard deviation σ and that we are dealing with large n (n>25). When we substitute σ by the observed standard deviation SD then we cannot use the normal distribution anymore, but need to use the Student t-distribution. The t-distribution has one more parameter, the degrees of freedom (sample size n -1).
The factor of '2' we have so far used has to be replaced by a 't-value' derived from the t-distribution
we can then calculate the t confidence interval using the t-distribution:

confidence interval: ± t-value * SE

The t-value can be calculated using the Excel function TINV (alpha,n-1). For a 95% confidence interval and n=5 observations in a sample, for example, it would be TINV(0.05,4) = 2.776. Again, we would then state that with 95% confidence we think that our procedure captures the true mean in the confidence interval.

Experimental errors

All experiments are characterized by an experimental error.

There are two kinds of errors:

In summary:

individual measurement = exact value + bias + chance error

Each measurement result should be given with its error. However, it is often very difficult to quantify the systematic error, and in most cases the given error is the statistical error only. This error only states how precise an experiment was and not how accurate it was.

Error reporting

Errors are reported as absolute or relative errors, for example:

ozone concentration at West Point, 8/3/1993, 14:00:

(125 ± 5) ppm or (125 ± 4%) ppm

The error can be a standard deviation, a SE (or 2*SE) or, e.g. a 95% confidence interval. You'll need to state what your error bars reflect!

Error Propagation

In many cases you need to calculate a value based on your measurement results using a formula. What is the SE of the derived number? For formulas that only include only simple mathematical operations the propagation of errors is relatively simple. The following rule approximates the error of the derived number:

addition, subtractions => absolute error = Square root (sum of  (absolute errors)2)

multiplication, division => relative error = Square root (sum of  (relative errors)2)

You can determine the error (D f)  of a more complex function f (x, y, z, ....) by using the partial derivatives of the function and the errors of the individual variables  (D x, D y, D z,....):

error propagation


Resources: