Linear Regression and Correlation
Find lines of best fit and interpret correlation in real-world contexts.
Try the Interactive Version!
Learn step-by-step with practice exercises built right in.
Linear Regression and Correlation
Scatter Plots Review
A scatter plot shows the relationship between two quantitative variables.
Direction: Positive, negative, or none Form: Linear or nonlinear Strength: Strong, moderate, or weak
Correlation Coefficient ()
Measures the strength and direction of a linear relationship:
- : Perfect positive linear
- : Perfect negative linear
- : No linear relationship
- close to 1: Strong
Line of Best Fit (Regression Line)
The line that best represents the trend in the data:
Making Predictions
Use the regression equation: If and :
Residuals
- Positive residual: actual is above the line
- Negative residual: actual is below the line
Interpreting the Slope
"For every 1-unit increase in , the predicted increases/decreases by units."
Interpreting the Y-Intercept
"When , the predicted is ." (May not always make practical sense.)
Cautions
- Correlation ≠ Causation
- Don't extrapolate beyond the data range
- Outliers can strongly affect the regression line
- only measures linear relationships
Example interpretation: "There is a strong positive linear relationship () between hours studied and test score. For each additional hour of studying, the predicted test score increases by about 5 points."
📚 Practice Problems
No example problems available yet.