1. Pick a dataset (CSV file) from my GitHub repository named data. Most files have READMEs as .txt files1. Perform a short analysis on two numerical variable of your choice using simple linear regression. A short analysis should include at least:

    1. A sentence or two, in your own words (ie not directly copied from the README), explaining what the dataset is all about and what variable you will investigate in your analysis. Be sure to be explicit about what the observations are. Decide and justify which numerical variable should be the explanatory and which the response variable.

    2. A well labeled, units and all, plot of your variable. Put axis labels on your plot by using bp.labels(...).

    3. Point estimates of the intercept \(\beta_0\) and slope \(\beta_1\). Use Scipy’s function minimize(...) along with the simplified log-likelihood.

    4. Write one complete English sentence explaining each value you just found, in context of the data. Does each estimate make sense? Explain.

    5. Use the bootstrap method to produce confidence intervals for each value you just found, for a percent confidence of your choice.

    6. Write one complete English sentence describing each confidence interval you just found, in context of the data.

    7. Predict the value of the response variable when the explanatory variable is equal to its mean.

    8. Write one complete English sentence describing the value you just found, in context of the data.

    9. Use your bootstrap resampled statistics, without redoing the bootstrap, to produce a confidence interval for the value of the response variable when the explanatory variable is equal to its mean.

    10. Write one complete English sentence describing the confidence interval you just found, in context of the data.

    11. Extraploate, that is make a prediction outside of the range of your data. Does this prediction make sense, why or why not?

    12. Add to or make a separate well labeled plot that includes a visualization of your analysis.


  1. If there isn’t an associated README consider helping me out by writing one and filing a PR.