Least-Squares Regression

Find and interpret the least-squares regression line (LSRL) and make predictions.

🎯⭐ INTERACTIVE LESSON

Try the Interactive Version!

Learn step-by-step with practice exercises built right in.

Start Interactive Lesson →

Least-Squares Regression

The Regression Line

The least-squares regression line (LSRL) is the line that minimizes the sum of squared residuals:

y^=a+bx\hat{y} = a + bx

where:

  • y^\hat{y} = predicted value of yy
  • bb = slope
  • aa = y-intercept

Computing the LSRL

Slope: b=rsysxb = r \cdot \frac{s_y}{s_x}

Y-intercept: a=yˉbxˉa = \bar{y} - b\bar{x}

The LSRL always passes through the point (xˉ,yˉ)(\bar{x}, \bar{y}).

Interpreting the Slope

"For each additional [unit of x], the predicted [y variable] changes by [b] [units of y]."

Example: If y^=2.5+0.8x\hat{y} = 2.5 + 0.8x where xx = study hours and yy = exam score: "For each additional hour of studying, the predicted exam score increases by 0.8 points."

Interpreting the Y-Intercept

"When [x variable] = 0, the predicted [y variable] is [a] [units]."

Caution: The y-intercept often doesn't have practical meaning (e.g., 0 hours of studying may not make sense in context).

Making Predictions

Substitute the xx-value into the equation to get y^\hat{y}.

Extrapolation

Extrapolation = predicting yy for xx-values outside the range of the data. This is dangerous because the linear pattern may not continue.

Residuals

residual=yy^=observedpredicted\text{residual} = y - \hat{y} = \text{observed} - \text{predicted}

  • Positive residual: actual > predicted (point above line)
  • Negative residual: actual < predicted (point below line)
  • residuals=0\sum \text{residuals} = 0 for the LSRL

Properties of LSRL

  1. Minimizes (yiy^i)2\sum(y_i - \hat{y}_i)^2
  2. Passes through (xˉ,yˉ)(\bar{x}, \bar{y})
  3. Sum of residuals = 0
  4. rr and bb have the same sign
  5. Regression toward the mean: predicted values are closer to yˉ\bar{y} than observed values

AP Tip: When interpreting slope, always use "predicted" — not "will increase by." The relationship is an estimate, not a guarantee.

📚 Practice Problems

No example problems available yet.