Coefficient of Determination

Interpret rยฒ as the proportion of variability explained by the regression model.

๐ŸŽฏโญ INTERACTIVE LESSON

Try the Interactive Version!

Learn step-by-step with practice exercises built right in.

Start Interactive Lesson โ†’

Coefficient of Determination (r2r^2)

Definition

r2=(correlationย coefficient)2r^2 = \text{(correlation coefficient)}^2

r2r^2 measures the proportion of variability in the response variable (yy) that is explained by the linear relationship with the explanatory variable (xx).

Interpretation

"r2ร—100%r^2 \times 100\% of the variability in [y variable] is explained by the linear relationship with [x variable]."

Example: If r=0.85r = 0.85, then r2=0.7225r^2 = 0.7225. "72.25% of the variability in exam scores is explained by the linear relationship with hours studied."

Understanding r2r^2 Visually

r2r^2 compares two models:

  1. No model: Use yห‰\bar{y} to predict every observation (total variability = โˆ‘(yiโˆ’yห‰)2\sum(y_i - \bar{y})^2)
  2. Regression model: Use y^=a+bx\hat{y} = a + bx (remaining variability = โˆ‘(yiโˆ’y^i)2\sum(y_i - \hat{y}_i)^2)

r2=1โˆ’โˆ‘(yiโˆ’y^i)2โˆ‘(yiโˆ’yห‰)2=1โˆ’SSresidualSStotalr^2 = 1 - \frac{\sum(y_i - \hat{y}_i)^2}{\sum(y_i - \bar{y})^2} = 1 - \frac{SS_{\text{residual}}}{SS_{\text{total}}}

Properties of r2r^2

  1. 0โ‰คr2โ‰ค10 \leq r^2 \leq 1
  2. r2=1r^2 = 1: Perfect linear fit (all points on the line)
  3. r2=0r^2 = 0: No linear relationship
  4. Higher r2r^2 = better linear model fit
  5. r2r^2 doesn't tell you about the direction (use rr for that)

What r2r^2 Doesn't Tell You

  1. Whether the relationship is truly linear (check residual plot)
  2. Whether there is causation
  3. Whether extrapolation is valid
  4. Whether there are influential points

r2r^2 in Context

| r2r^2 | Quality of Linear Fit | |-------|----------------------| | 0.90+ | Excellent | | 0.70โ€“0.90 | Good | | 0.50โ€“0.70 | Moderate | | Below 0.50 | Weak |

Connection to Regression Output

In computer regression output, r2r^2 is often labeled:

  • "R-sq" or "R-squared"
  • "Coefficient of determination"
  • The square root gives โˆฃrโˆฃ|r| (check the slope for sign)

AP Tip: The most common error is confusing rr and r2r^2. Remember: rr is the correlation (direction + strength), and r2r^2 is the proportion of variability explained. Always interpret r2r^2 as a percentage in context.

๐Ÿ“š Practice Problems

No example problems available yet.