correlation and regression

Question

Run a simple correlation (Pearson) analysis on the data below with the

hypothetical situation. H0 : Gender affects Income level.

H1 : Gender do not affects Income level.

Use bivariate correlation.

Gender Income

1. Male 500

2. Female 350

3. Male 700

4. Female 800

5. Male 650

6. Female 450

7. Male 900

8. Female 500

9. Male 700

10. Female 650

11. Male 550

12. Male 400

13. Male 700

14. Male 900

15. Male 500

16. Male 800

17. Female 600

18. Female 900

19. Female 500

20. Male 600

Please,

Patrick B. · Accepted Answer

You must have the same number of men and women sampled in the data.

Remember, these are supposed to be ordered pairs on the scatter plot.

Since there are 20 stats, you have 10 data points (X,Y) where X is the

man's salary income and Y is the women's salary income.

You incorrectly labeled some of the stats MALE instead of FEMALE.

So you must figure out which ones are incorrectly labeled. Once you do

you can go to the following website, input the data, and the calculator will

find the R correlation coefficient for you, per the formula.

https://www.socscistatistics.com/tests/pearson/Default2.aspx

One option that can instead be explored here is to do a hypothesis test comparing

the MEAN incomes of MEN vs. WOMEN.

Null hypothesis is that there is no difference between the means.

Alternative hypothesis is that there is a difference in the means

For the men, the mean is 658.33 with a variance of var1=24924.24 and standard deviation 157.8741.

For the ladies, the mean is 593.75 with a variance of var2=33883.93 and standard deviation 184.0759

N1=12 and N2=8 of course.

The standard error = sqrt ( Var1/N1 + Var2/N2)=79.45131

the test statistic is (mean1-mean2)-D / SE

where D is the hypothesized difference, which in this case is D=0

So test stat t = (658.3 - 593.75)/ 79.45131

= 0.812867

That test stat does not lie within the rejection region at

ANY level of confidence, the conclusion is that there is

no difference of the means.

Normal distribution is assumed, despite there only being

N=20 < 30 statistics in the sample. The T-table with 30-20=10

degrees of freedom shows the same results, although the

official formula requires a much more tedious calculation

for the degrees of freedom.

Please repost

.

correlation and regression

1 Expert Answer

Still looking for help? Get the right answer, fast.

OR

RELATED TOPICS

RELATED QUESTIONS

Correlation and Regression

Solve the following CORRELATION and REGRESSION question and please show your calculations, and explain your findings.

RECOMMENDED TUTORS

IXL

Rosetta Stone

Education.com

TPT

Vocabulary.com

ABCya

SpanishDictionary.com

Inglés.com

Emmersion

correlation and regression

1 Expert Answer

Still looking for help? Get the right answer, fast.

OR

RELATED TOPICS

RELATED QUESTIONS

Correlation and Regression

Solve the following CORRELATION and REGRESSION question and please show your calculations, and explain your findings.

RECOMMENDED TUTORS

find an online tutor