Nathaniel Z. answered 06/09/24
Ph.D. Candidate in Economics with 4+ Years of Teaching Experience
I would have to see the specific data, but here are some hints.
a/b. Assuming standard OLS/Linear Regression the formula for calculating the slope coefficient is cov(x,y)/var(x). In your case, cov(log concentration dde, shell thickness)/var(log concentration dde). In R, one way to calculate this is with the lm() command. I recommend something like this:
model <- lm(shell thickness ~ log concentration dde, data = csv)
summary(model)
c. residuals are the vertical distance between your model's prediction and the true value. In R, consider the following steps:
- y_hat <- 2.45 - 0.5*(log concentration dde)
- resid <- shell thickness - y_hat
- rss_new <- sum(resid^2)
- You should have the RSS from the previously fitted model
- Show that the RSS from the previously fitted model is smaller than the RSS from this new model
d. The main Gauss-Markov assumptions (linear model assumptions) are:
- homoskedasticity (constant variance of error terms)
- independent variable (x's) are unrelated to error term
- errors have a mean of zero
- no multicollinearity if you have multiple x's (i.e the covariance between x's is essentially 0)
You can check that these assumptions are satisfied by graphing the residuals. Consider either a histogram of the residuals or a scatter plot of the residuals with log concentration of DDE on the x-axis.
I hope this helps.