Plotting and Manipulating Your Data

A key feature of good scientific communication is making good figures and plots.  Many more readers will see your figures and plots than will ever read the text of the paper. PLease note: we can touch upon only a very small subset of data analysis tools and types of graphs in this brief lecture. You may need to use a specific tool for your project - please discuss this with your advisor and research mentor!

Types of Plots (This is not an exhaustive list).

  1. Y versus X- scatter plot, simple line or symbol plot.
  2. time series- data points are plotted versus time
  3. linear regression plot- scatter plot plus best fitting trend line
  4. moving average- data are averaged in blocks around a central point, (for example 10 points on either side of a given point).   Make the most sense for time series data with large amounts of variability.  Problems: data points at the very end and start of the time series cannot be included.
  5. anomaly plot (the average trend is subtracted out- only the differences between the average and the data are plotted)
  6. pie chart- best for displaying data that should add up to 100%
  7. maps - X-Y-Z plot
  8. log-log plot or semi-log plot - used for displaying data with large ranges in the numbers (for example, data points range from 1 to 1000).  Problem: can obscure serious errors in the data.

Characteristics of Good Plots, Figures, and Tables

Technical issues

Avoid becoming a graphical sinner!

Resources

Jaffe, A.J., and Spirer, H.F. (1987) Misused statistics. Marcel Dekker, Inc., New York, 237pp. (HA29.J29 1987)