MATH 315 Exam 01 Practice
Due 2025-09-29 by 12:51pm
Please submit Exam 01 Practice just like you would a homework assignment
-
Use the function
read.csv
to read in the dataset found at the following URLhttps://raw.githubusercontent.com/roualdes/data/refs/heads/master/horse_prices.csv
The dataset is about horses and their prices and has a help file.
-
What type of study did these data likely come from? Why?
-
Load the libraries
ggplot2
anddplyr
. -
What are all the variable types?
-
Identify reasonable explanatory and response variables (one of each). Be specific about which is which and why.
-
Make a histogram of a variable of your choice.
-
Calculate the mean and median for the same variable from question 6. above. Explain why the shape of the data in the histogram above makes sense relative to the values of the mean and the median.
-
Use
ggplot
to make a boxplot (multiple boxes). Write one or two complete English sentences describing the plot, using keywords from this class. You should use at least quantile/quartile, interquartile range, and mean. -
Calculate the summary statistics
mean
,std
, and number of datan
, of some numerical variable grouped by a categorical variable of your choice. Write two complete English sentences describing these summary statistics in context of the data. -
For each part, compare the datasets 1. and 2. below, based on means and standard deviations. Which, if any, has the larger mean? Which, if any, has the standard deviation? Explain your answers.
a.
- 3, 5, 5, 5, 8, 11, 11, 11, 13
- 3, 5, 5, 5, 8, 11, 11, 11, 20
b.
- -20, 0, 0, 0, 15, 25, 30, 30
- -40, 0, 0, 0, 15, 25, 30, 30