MATH 315 Exam 01
Due 2025-10-01 by 12:51pm
Please submit Exam 01 just like you would a homework assignment
-
Use the function
read.csv
to read in the dataset found at the following URLhttps://raw.githubusercontent.com/roualdes/data/refs/heads/master/possum.csv
The dataset is about possums and has a help file.
-
What type of study did these data likely come from? Why?
-
Load the libraries
ggplot2
anddplyr
. -
What are all the variable types?
-
Identify reasonable explanatory and response variables (one of each). Be specific about which is which and why.
-
Use
ggplot
to make a histogram of a variable of your choice. -
Calculate the mean and median for the same variable from question 6. above. Explain why the shape of the data in the histogram above makes sense relative to the values of the mean and the median.
-
Use
ggplot
to make a boxplot (multiple boxes). Write two complete English sentences describing the plot, using keywords from this class. You should use at least quantile/quartile, interquartile range, and mean. -
Calculate the summary statistics
mean
,std
, and number of datan
, of some numerical variable grouped by a categorical variable of your choice. Write one or two complete English sentences describing these summary statistics in context of the data. -
Use
ggplot
to make a scatter plot (of points) from two variables of your choice. Write one complete English sentence describing the plot, using keywords from this class. -
For each part, compare the datasets 1. and 2. below, based on means and standard deviations. Which, if any, has the larger mean? Which, if any, has the larger standard deviation? Explain your answers.
a.
- 0, 2, 4, 6, 8, 10
- 20, 22, 24, 26, 28, 30
b.
- 100, 200, 300, 400, 500
- 0, 50, 300, 550, 600