Nicholas T. answered 05/12/23
Statistician and Data Analyst with 7+ Years of Teaching Experience
We first need to find the linear correlation coefficient, r, between driver age and driver deaths per 100,000. This can be done using technology, such as a graphing calculator or Excel. I will use Excel, plugging the variables into separate columns, then using the CORREL formula to obtain the correlation coefficient. This results in:
r=-0.390.
We then want to find the test statistic for a hypothesis test for significance of correlation. This is calculated as:
t=r*sqrt[(n-2)/(1-r^2)]
Where r=the correlation coefficient and n=the number of data pairs. Plugging in r=-0.390 and n=7, we get a test statistic of -0.947.
The last thing to do is use the test statistic to find the p-value. Because this is a right-tailed test (Because we have > in the alternative hypothesis), the p-value is the area to the right of the test statistic. The test statistic follows a t distribution, with n-2 degrees of freedom. We can find this p-value in Excel by using the following formula:
=T.DIST.RT(-0.947,5)
Where -0.947 is the test statistic and 5 is the degrees of freedom. This results in a p-value of 0.8064. This is not less than the significance level of 0.05, so we would fail to reject the null. We cannot conclude that the correlation is positive (and since the sample correlation r was negative, we would never conclude that it was positive).
Let me know if you have any questions or if anything is unclear!