
Konopelski B. answered 08/14/22
Statistics, Finance, SPSS, SAT Math and physics Tutor
Descriptive statistics
Mean x= 9
variance of x : s2 x 11
Mean of y 7.50
Sample variance of y : s2
y 4.125
Correlation
between x and y 0.82
Linear regression line y = 3.00 + 0.500x
R2= 0.67
The first scatter plot appears to be a simple linear relationship, corresponding to two variables correlated where y could be modelled as gaussian with mean linearly dependent on x.
The second graph while a relationship between the two variables is obvious, it is not linear, and the Pearson correlation coefficient is not relevant. A more general regression and the corresponding coefficient of determination would be more appropriate.
In the third graph the modelled relationship is linear, but should have a different regression line The calculated regression is offset by the one outlier which exerts enough influence to lower the correlation coefficient from 1 to 0.816.
Finally, the fourth graph shows an example when one high-leverage point is enough to produce a high correlation coefficient, even though the other data points do not indicate any relationship between the variables.