This homework uses the same data as Lab 2, except I’ve taken a subsample of 1000 people. To get the data and store it in
cdc, add this to the start of your script:
(This code is reading a comma delimited file from my website, you’ll need to be connected to the internet for it to work.)
a. Calculate a new column, called
wt_diff, for the difference between the subjects weight (
weight) and their desired weight (
wtdesire) using the following code.
b. Report summary statistics (average, standard deviation and sample size) of weight and desired weight for males and females separately.
c. Construct a histogram of
wt_diff for females only, and describe it in context of the data.
d. Construct a plot with separate histograms of weight for males and females. Don’t forget to play with binwidth, but only include one plot your report.
Conduct a paired t-test comparing the weight (
weight) and the desired weight (
wtdesire) for females only. Report your results in a statistical summary, the summary should include a sentence describing the result of the test, a sentence interpreting the point estimate, and a sentence interpreting the confidence interval.