Jack N. answered 02/21/22
Statistics Tutor
Let's begin by defining what we know:
Residents in our sample = n = 169
Mean amount of trash produced per person per week = x-bar = M = 35.8
Standard deviation of our sample = s = 6.1
A. We need to think about the guidelines for selecting the Student's t distribution or the standard normal z distribution.
1. It is advisable to select the Student's t distribution if we do not know the population variance.
2. However, as the number of observations in our sample increases, the Student's t distribution more closely approximates the standard normal z distribution.
3. We have a sample greater than 120. Therefore, I would argue that it does not substantively matter if we use a student's t distribution with degrees of freedom (n-1) 168, or the standard normal z distribution.
4. Given the information above, generate an argument for your selection of distribution.
B. Find a 90% confidence interval.
1. It is intuitive to remember that the confidence interval will be symmetrical on either side of our mean.
2. As such, we will take our mean (35.8), add a value to it, and then subtract a value from the mean, to find our interval range.
3. The formula to find the confidence interval for a population mean is as follows:
Lower bound = sample_mean - (confidence level)*s/sqrt(n)
Upper bound = sample_mean + (confidence level)*s/sqrt(n)
Let's find the values to fill out the formula:
Sample_mean = 35.8
Confidence level = if we we want a 90% confidence interval, we need capture .90 of the distribution within the interval, leaving .05 of the distribution in each tail. The z score for .05 in each tail is 1.645. (if we used a student't t distribution, our value would be 1.654)
Sample standard deviation = s = 6.1
Observations in sample = n = 169
Lower bound = 35.8 - (1.654)*(6.1/sqrt(169))
Upper bound = 35.8 + (1.654)*(6.1/sqrt(169))
C. What percent of future samples (n = 169) would contain the true population mean?
Here we want to think about the idea of repeated samples. Let's pull a quote from "Statistics for Business and Economics" - Newbold, Carlson, and Thorne 8th edition.
"If the population is repeatedly sampled and intervals are calculated... then in the long run [90%] of the intervals would contain the true value of the [population mean]." p. 271
I hope this is helpful!