Homework 1#
Deadline
Due Sunday, Sept 8, 2024 at midnight on Crowdmark.
Instructions#
This homework covers two classes. Problems listed below are from the textbook.
Mon 8/26, we covered Ch 1 and Reviewed Python (Sec 2.4).
Be sure to post a gif in the slack channel!
2.4.9: Use the auto data set from class.
Weds 8/28, we covered Sec 2.2
2.4.1Remembered I don’t like this question, removing it from the homework.2.4.2. Explain your reasoning. Note that parts a and b were likely discussed in class.
2.4.4 a,b. Describe one example for each of part a, and b.
Weds 9/4, we covered Ch 2.2.1 - 2.2.2
2.4.3 (Note we have only done irreducible error, not Bayes error since that has to do with classification)
2.4.8
Note that there is a typo in 2.4.8f, at least with respect to the version of the data set we have. The information in the
Top10perc
column is a percentage 0-100 rather than a value between 0 and 1. So use the following replacement code instead.
college['Elite'] = pd.cut(college['Top10perc'], [0,50,100], labels=['No', 'Yes'])
Note: the content from Fri, 9/6, will be included on HW2 due next week.
Important
Standard instructions for submissions and deadlines can be found on the Homework Info Page.