Given the following four points, determine whether the correlation coefficient will be high or low for the least squares line with the best goodness of fit, and plot the best fit point for x = 5.

(2, 2)

(3, 5)

(4, 4)

(5, 1)

1 Answer

James Dowd

Updated on December 27th, 2020

The correlation coefficient is a measure of the strength between two different variables and can exist between -1 and 1. -1 indicates a strong negative relationship, 0 indicates no association, and 1 indicates a strong positive relationship. 

All of our needed equations to calculate both the line of least squares as well as the correlation coefficient exist across pages 69 and 70 of our FE Reference Handbook:

Based on these equations, we first want to start by finding the sum of squares of x, the sum of squares of y, and the sum of x-y products:

Sxx=i=1nxi2-(1n)(i=1nxi)2 Sxx=(22+32+42+52)-(14(2+3+4+5)2) Sxx=(4+9+16+25)-(14(196)) Sxx=(54)-(49)=5 Syy=i=1nyi2-(1n)(i=1nyi)2 Syy=(22+52+42+12)-(14(2+5+4+1)2) Syy=(46)-(14(144)) Syy=(46)-(44)=2 Sxy=i=1nxiyi-(1n)(i=1nxi)(i=1nyi) Sxy=((2)(2)+(3)(5)+(4)(4)+(5)(1))-(14(2+3+4+5)(2+5+4+1)) Sxy=(4+15+16+5)-(14(14)(12)) Sxy=(4+15+16+5)-(14(14)(12)) = 40 - 42 = -2

Now that we have these three numbers, let's find our correlation coefficient:

R=SxySxxSyy=-2(5)(2)=-210-.632

Since -.632 > -.5, this coefficient represents a strong (high) negative correlation.

Let's now find our least squares line:

y=a+bx b=SxySxx=-25 a=y¯-bx¯ a=(2+5+4+14-(-25(2+3+4+54))) a=(3-(-2820)) a=(8820) =225  y=225-2x5

Solving for y at x=5:

y=225-2x5 y=225-2(5)5 y=225-105=125

Therefore, our point is at (5, 12/5). Everything is plotted visually below:

Copyright © 2024 Savvy Engineer