Environmental Data Analysis BC ENV 3017
Histogram and normal distribution
-
Open a new MS Excel workbook and create a column of 100 random numbers
between 0 and 1 using the MS Excel rand() function.
-
In a second column create 100 numbers that are the average of 10 random
numbers (type: '=(rand()+rand()+rand() ....)/10').
-
Paste these two columns into the adjacent column using the 'Paste Special,
values' command (Otherwise EXCEL will update your random numbers after
every additional entry).
-
Determine max, min, average, and standard deviation of these lists.
-
Make histograms of both columns and overlay the normal distribution using
the statplus histogram function.
-
How do the two histograms/distributions compare?
-
Are your data normally distributed?
-
Calculate averages of your lists using 2, 5, 10, 20, 40, 75, and 100 samples
(entries) on your list and plot the average versus the number of samples
you have used as a basis for calculation of the average. What should be
the average if you had an indefinite number of samples? How do you explain
the pattern that you see?
-
(A variation of this would be to make up the (normalized)
histograms step by step and overlay the normal distribution function.)