Residuals and Residual Plots
Analyze residual plots to assess the fit of a regression model.
Try the Interactive Version!
Learn step-by-step with practice exercises built right in.
Residuals and Residual Plots
What Are Residuals?
A residual measures how far each observation is from the regression line.
Residual Plots
A residual plot graphs residuals (vertical axis) against the explanatory variable or the predicted values (horizontal axis).
Interpreting Residual Plots
Good Fit (Linear Model Appropriate)
- Points scattered randomly around the horizontal line
- No obvious pattern
- Roughly equal spread throughout
Curved Pattern
- Indicates the relationship is not linear
- A curved model (quadratic, exponential, etc.) may be more appropriate
- Consider transforming the data
Fan Shape (Heteroscedasticity)
- Spread of residuals increases (or decreases) as increases
- Indicates non-constant variability
- May need a transformation
Outliers in Residuals
- Points with large residuals (far from 0) are regression outliers
- These may indicate unusual observations worth investigating
Using Residual Plots to Assess Models
| Residual Plot Pattern | Assessment | |----------------------|------------| | Random scatter | Linear model is appropriate ✅ | | Curved pattern | Need a nonlinear model ❌ | | Fan/funnel shape | Non-constant variance ❌ | | Clusters | Possibly missing a variable |
Standard Deviation of Residuals ()
Interpretation: "The actual [y-values] typically differ from the values predicted by the LSRL by about [units]."
We divide by because we estimated two parameters ( and ).
Key Properties of Residuals
- The mean of residuals is always 0:
- The residuals have no linear relationship with
- The sum of squared residuals is minimized by the LSRL
AP Tip: On the AP exam, when asked "Is a linear model appropriate?", always refer to the residual plot (not the scatterplot or ). A residual plot showing random scatter indicates the linear model is appropriate.
📚 Practice Problems
No example problems available yet.