Environmental Data Analysis BC
ENV 3017
Statistics 3
Significance tests
Assume you want to find out if the a
list of data is statistically different from a certain value. In this
case
you need to perform a significance test.
There are 2 ways how to determine if samples are different from each
other of different from a certain value, one uses confidence intervals,
one uses hypothesis testing as main concept.
Confidence intervals
Let us assume, you want to
calibrate an ozone detector. Your standard has an ozone conc. of 70ppb.
Your
measurements
were: 78, 83, 68, 72, 88; average: 77.80, SD = 8.07. Is this
significantly
different from 70?
One way to address this would be to calculate the confidence interval
and see if the expected value falls within the confidence interval. It
turns out to be 77.80±10.02 and includes the 70ppb.
Therefore we cannot say that the instrument has drifted. But of
course we could perform more measurements (increase n) and look at the
question more carefully.
Another example is to use our use a sample of PM counts obtained at
the one place but counted by different students. The samples were collected in the following way.
| student1 |
student2 |
difference |
| 52 |
51 |
1 |
| 53 |
52 |
1 |
| 54 |
53 |
1 |
| 55 |
54 |
1 |
| 53 |
50 |
3 |
So, here we want to know if the there is a significant difference
between the two students looking at the same sample . We
- calculate the difference between the count rates of the two
students for each square (because the 2 count rates in each row are not
independent, they are for the same square
- determine the mean difference and the SE for the difference
- determine the confidence interval (degrees of freedom = n-1 = 4):
- mean ± TINV(0.05,4)*SEdiff
- see if 0 falls within that confidence interval
We can also look at the same results by assuming that the samples
were measured by the same student but reflect different samples:
| sample1 |
sample2 |
|
| 52 |
51 |
|
| 53 |
52 |
|
| 54 |
53 |
|
| 55 |
54 |
|
| 53 |
50 |
|
Now it does not make sense to calculate a difference for each pair (row), because the counts are unrelated.
So in this case we would
- calculate the mean count rates and their SE
- determine the differnce between the two means
- determine the SE of the difference
- determine the confidence interval (degrees of freedom = 8):
- mean ± TINV(0.05,8)*SEdiff
- see if 0 falls within that confidence interval
Hypothesis testing
Another approach is to use the
concept of hypothesis testing
to address the questions raised above and perform a formal t-test.
Let us assume, you want to
calibrate
the ozone detector. Your standard has an ozone conc. of 70ppb. Your
measurements
were: 78, 83, 68, 72, 88; average: 77.80, SD = 8.07. Is this
significantly
different from 70? In order to perform this test, we need to formulate
the problem as a hypothesis:
- Null-hypothesis (Ho):
There is
no difference between the expected value and the observed value.
Observed
value = expected value.
- Alternative hypothesis
(Ha):
The expected and observed value are different. Observed value < or
>
expected value.
The significance test investigates
if
the null-hypothesis can be rejected. It yields the probability (P-value)
for the null-hypothesis being true. In other words, we are testing the
following question: What is the probability that we obtained our
observed
average (which is different from the expected average) by chance?

The t-statistic is used to
measure
the difference between the data and the expected values in standard
units.
We are 2.2 SE units off the expected value! What is the probability
that
we are that far off (or more) from the expected value? We can use the
normal
curve to estimate this probability in n is large (fig).
Because we deal here with small
numbers
of measurements, we need to use Student’s t curve, which looks
very
similar to the normal curve (fig).
(Use of the normal distribution is only justified if we have a very
large
sample size or if we know the true SD (s). In case we can use the normal
distribution,
the t-statistic mentioned above would be called z-statistics.)
When you use the student curve,
you
will be asked to give the degrees of freedom:
degrees of freedom = number of
measurements
-1
In our example, we need to use
the
two tailed t-test (TDIST). We get a P-value of 0.097 or 9.7%. The
cut-off
value is typically chosen at 5%. If the probability is below 5%, we
need
to reject the null-hypothesis. In our case, we cannot reject the null
hypothesis.
that means there is no significant difference between our
observed
and expected value. By performing more measurements, we can improve the
SE and more precisely check, if our instrument is systematically off.
This test can also be used to
investigate
if there are systematic differences between experiments, for example
between
samples obtained at different sites. Depending on certain constraints,
you need to use one of the following t-tests that EXCEL offers or use
the equivalent Statplus functions:
- t-Test: Paired two sample
for
Means (or Statplus: 1-sample t-test, paired)
- t-Test: Two-sample assuming
unequal
standard deviations or variances (=SD2) (or Statplus: 2-sample t-test)
- (t-Test: Two-sample assuming
equal variances)
Usually you do not know if the
variances
are equal or not. Just use the "unequal" test then. It will give you
the
right answer also in the case the variances are equal. The best way to
understand these cases is to go through examples.
Resources:
- Berk & Carey, chapter 6
- Freedman, D., Pisani, R.,
Purves,
R., and Adhikari, A. (1991) Statistics. WW Norton & Company, New
York,
2nd ed. 514pp.
- Fisher, F.E. (1973)
Fundamental
Statistics
Concepts. Canfield Press, San Francisco, 371 pp.
- Berenson, M.L., Levine, D.M.,
and
Rindskopf, D. (1988) Applied statistics - A first course. Prentice
Hall,
Englewood Cliffs, NJ, 557pp.