Given the following four points, determine whether the correlation coefficient will be high or low for the least squares line with the best goodness of fit, and plot the best fit point for x = 5.
(2, 2)
(3, 5)
(4, 4)
(5, 1)
The correlation coefficient is a measure of the strength between two different variables and can exist between -1 and 1. -1 indicates a strong negative relationship, 0 indicates no association, and 1 indicates a strong positive relationship.
All of our needed equations to calculate both the line of least squares as well as the correlation coefficient exist across pages 69 and 70 of our FE Reference Handbook:
Based on these equations, we first want to start by finding the sum of squares of x, the sum of squares of y, and the sum of x-y products:
Now that we have these three numbers, let's find our correlation coefficient:
Since -.632 > -.5, this coefficient represents a strong (high) negative correlation.
Let's now find our least squares line:
Solving for at x=5:
Therefore, our point is at (5, 12/5). Everything is plotted visually below: