A linear regression equation for a data set has a correlation coefficient of r=0.4. Would you be conﬁdent using your equation to predict what will happen outside of the data set?
- If by "outside of the data set" you mean "out of your data's range", the answer is no.
- To know if your model fits your data, use the
- You cannot use a model for prediction out of the range of the dataset.
Why ? Because you simply have no idea of the shape of the relationship out of your data's range.
For example, let's see the graph below. Let's say that your data are the black dots. The dotted lines represent your data's range. A linear relationship would well fit to your data and you can apply your model inside the range.
However, outside the range, the data do not follow a linear relationship anymore. So, you cannot predict them with the relation you applied to your data.
- Let's say you want to predict values in the range of your dataset.
To know if your model fits your data correctly, you should look at the
In fact, the correlation coefficient measure the strength of the relation between X and Y, and will not say anything about your model.