MATH 456 Homework 05
Due 2026-03-10 by 11:59pm
-
Download the following dataset into your Homework 04 repository: penguins. Here's the metadata. Please
pushthis dataset along with yourqmdfile for Homework 05. I don't need the output file nor any of the output file's dependencies.In an attempt to predict a penguins body mass,
body_mass_g, use the explanatory variablesbill_length_mm,flipper_length_mm,island,sex, andspecies. -
Use
dplyr's functionsselectandmutateto select only the variables of interest and throw away any rows containingNAs. -
Use
ggplot2to make an appropriate scatter plot. Color the points using a categorical/qualitative variable. -
Fit a model with unique intercepts by
sexand a shared slope acrossbill_length_mm. -
Interpret the slope of the model above in context of the data.
-
Interpret adjusted
in context of the data. -
Fit a model with unique intercepts by
speciesand unique slopes acrossbill_length_mmbyspecies. -
Set up and conclude a hypothesis test for the term
speciesChinstrap:bill_length_mmusing a level of signifiance of. -
Interpret the conclusion of your hypothesis test in context of the data.
-
Interpret the coefficient for the term
speciesChinstrap:bill_length_mmin context of the data. -
Interpret the coefficient for the term
speciesGentooin context of the data. -
Interpret adjusted
in context of the data. -
Looking at the (many) p-values for all the coefficients and adjusted
values for both models above, what can we say about the relationship between adjusted and p-values in general? -
Using only the variables mentioned above, fit a model with the highest adjusted
you can.