Statistics and Data Interpretation - Complete Interactive Lesson
Part 1: Mean, Median, Mode
Data Analysis & Statistics
Part 1 of 7 — Mean, Median, Mode, Range
Mean (Average)
Median
The middle value when data is ordered. For even number of values: average the two middle values.
→ median =
→ median =
Mode
Most frequent value. Can have none, one, or multiple modes.
Range
Range max min
SAT Favorites 🎯
"Adding/removing a value": If the mean of 5 numbers is 20, their sum is 100. Add a 6th number = 38: new mean .
"Which measure changes?": Adding an outlier affects the mean much more than the median.
Central Tendency 🎯
Key Takeaways — Part 1
- Mean: sum ÷ count. Use "sum = mean × count" to find missing totals
- Median: middle value (or average of two middles)
- Outliers affect the mean much more than the median
- For "add a value" problems: recalculate sum, then divide by new count
Part 2: Data Displays
Data Analysis & Statistics
Part 2 of 7 — Standard Deviation & Data Spread
Standard Deviation (σ)
Measures how spread out values are from the mean. You won't calculate it on the SAT, but you must understand it.
- Low SD: values are close to the mean (consistent data)
- High SD: values are far from the mean (varied data)
- SD = 0: all values are the same
Comparing Standard Deviations
Data A: → low SD (clustered near 50)
Data B: → high SD (spread out)
Part 3: Interpreting Tables
Data Analysis & Statistics
Part 3 of 7 — Scatterplots & Line of Best Fit
Scatterplots
Each point represents two measurements for one individual/item.
Correlation
- Positive: as increases, increases
- Negative: as increases, decreases
- No correlation: no clear pattern
- Strength: how closely points follow a line (strong vs. weak)
Line of Best Fit (Regression Line)
where:
Part 4: Standard Deviation
Data Analysis & Statistics
Part 4 of 7 — Two-Way Tables
Reading Two-Way Tables
| Cat | Dog | Total | |
|---|---|---|---|
| Male | 30 | 50 | 80 |
| Female | 40 | 30 | 70 |
| Total | 70 | 80 | 150 |
Types of Questions
Marginal frequency: What percent prefer dogs?
Joint frequency: What percent are female AND prefer cats?
Part 5: Statistical Inference
Data Analysis & Statistics
Part 5 of 7 — Probability
Basic Probability
Always between 0 (impossible) and 1 (certain).
Complement
Part 6: Problem-Solving Workshop
Data Analysis & Statistics
Part 6 of 7 — Sampling and Study Design
Types of Studies
| Type | Description | Can show causation? |
|---|---|---|
| Observational | Observe without intervention | No (only association) |
| Experiment | Randomly assign treatments | Yes! |
| Survey | Ask questions | No (only opinion) |
Random Sampling
A sample is representative if every member of the population has an equal chance of being selected.
- Random sample: conclusions can be generalized to the population
- Convenience sample (e.g., only surveying friends): results may be biased
Bias
- Selection bias: sample doesn't represent the population
- Response bias: wording of questions influences answers
- Voluntary response bias: only people with strong opinions respond
SAT Wording to Watch For
❌ "The study proves that X causes Y" — only experiments with random assignment can suggest causation.
✓ "The study suggests an association between X and Y" — appropriate for observational studies.
Part 7: Review & Applications
Data Analysis & Statistics
Part 7 of 7 — Review & SAT Mixed Practice
Data & Statistics Quick Reference
| Concept | Key Point |
|---|---|
| Mean | Sum ÷ count; affected by outliers |
| Median | Middle value; resistant to outliers |
| SD | Measures spread; add → same, multiply → changes |
| Scatterplot slope | Predicted change in per unit |
| Residual | Actual − predicted |
| Two-way tables | Watch the denominator! |
| Probability AND | Multiply (if independent) |
| Probability OR | Add, then subtract overlap |