Data Analysis and Statistics

Mean, median, mode, range, and interpreting data

Data Analysis and Statistics (SAT)

Measures of Central Tendency

Mean (Average)

Formula: $\text{Mean} = \frac{\text{Sum of all values}}{\text{Number of values}}$

Example: Test scores: 80, 85, 90, 95 $\text{Mean} = \frac{80+85+90+95}{4} = \frac{350}{4} = 87.5$

Median (Middle Value)

Steps:

Put numbers in order
Find the middle value
If even number of values, average the two middle ones

Example: 3, 7, 9, 12, 15 Median = 9 (middle value)

Example: 2, 5, 8, 11 Median = $\frac{5+8}{2} = 6.5$ (average of middle two)

Mode (Most Frequent)

The value that appears most often

Example: 2, 3, 3, 5, 7, 3, 9 Mode = 3 (appears three times)

Note: Can have no mode or multiple modes

Range

Maximum value - Minimum value

Example: 10, 15, 22, 18, 30 Range = $30 - 10 = 20$

Standard Deviation

Measures spread/variability

Low standard deviation: Data close together
High standard deviation: Data spread out

You don't need the formula! SAT gives context:

"Which dataset has greater variability?" → higher standard deviation

Reading Data from Graphs

Bar Graphs

Height shows value
Compare categories
Look for trends

Line Graphs

Shows change over time
Slope indicates rate of change
Look for increases/decreases

Scatter Plots

Each point = one observation
Look for patterns/trends
Positive/negative correlation

Box Plots (Box-and-Whisker)

Shows 5-number summary:

Minimum
Q1 (25th percentile)
Median (Q2, 50th percentile)
Q3 (75th percentile)
Maximum

Interquartile Range (IQR): $Q3 - Q1$

Interpreting Data

Correlation vs. Causation

Correlation: Two things are related Causation: One CAUSES the other

SAT Trap: Just because things correlate doesn't mean one causes the other!

Example:

Ice cream sales and drowning both increase in summer
Correlation: YES
Causation: NO (heat causes both, not each other)

Positive vs. Negative Correlation

Positive: Both increase together

Example: Study time and test scores

Negative (Inverse): One increases, other decreases

Example: Speed and travel time

No correlation: No clear pattern

Probability and Data

Basic Probability

$P(\text{event}) = \frac{\text{Number of favorable outcomes}}{\text{Total number of outcomes}}$

Two-Way Tables

| | Group A | Group B | Total | |---|---------|---------|-------| | Yes | 30 | 20 | 50 | | No | 10 | 40 | 50 | | Total | 40 | 60 | 100 |

Questions:

"What percent of Group A said Yes?" $\frac{30}{40} = 75\%$
"What percent of all respondents said Yes?" $\frac{50}{100} = 50\%$

SAT Question Types

Type 1: Calculate Mean/Median

Given data, find the measure

Use formulas/procedures above

Type 2: Effect of Adding/Removing Data

"If we add a value of 100, how does the mean change?"

Calculate new mean with additional value

Type 3: Reading Graphs

"According to the graph, in which year...?"

Direct lookup from visual

Type 4: Interpreting Studies

"The study shows that X is associated with Y. Can we conclude X causes Y?"

Usually NO - correlation ≠ causation

SAT Strategies

Mean Tricks

If all values increase by 5, mean increases by 5 If all values double, mean doubles

Median is Resistant

Outliers don't affect median much Outliers greatly affect mean

Use Calculator

Sum, mean functions save time

Read Carefully

"What percent of Group A" vs "What percent of all"

Common SAT Traps

Trap 1: Mean vs. Median Confusion

Mean = average Median = middle

Trap 2: Percent vs. Number

30% ≠ 30 people

Trap 3: Correlation = Causation

Associated ≠ causes

Trap 4: Wrong Denominator

Percent of subgroup vs. percent of total

SAT Tips

Median: Always order the data first!
Mean: Sum ÷ count
Outliers affect mean more than median
Read graph labels carefully (units, scale)
Check denominators for percent questions
Correlation ≠ Causation on SAT!

📚 Practice Problems

1Problem 1easy

❓ Question:

Find the median of: 12, 8, 15, 10, 9

💡 Show Solution

Solution:

Step 1: Put in order $8, 9, 10, 12, 15$

Step 2: Find middle value 5 numbers, so middle is 3rd value

Answer: 10

SAT Tip: ALWAYS order the data first when finding median!

2Problem 2medium

❓ Question:

A dataset has 5 values with a mean of 20. If a 6th value of 32 is added, what is the new mean?

💡 Show Solution

Solution:

Original sum: $5 \times 20 = 100$

Add new value: $100 + 32 = 132$

New mean: $\frac{132}{6} = 22$

Answer: 22

SAT Tip: Mean × count = sum. Use this to find totals!

3Problem 3hard

❓ Question:

A study found that students who eat breakfast tend to have higher test scores. Which conclusion is valid?

A) Eating breakfast causes higher test scores B) There is an association between eating breakfast and test scores C) Students should be required to eat breakfast D) Skipping breakfast lowers intelligence

💡 Show Solution

Solution:

Key word: "tend to" = correlation/association

Check each:

A) Causes - too strong! Correlation ≠ causation ❌
B) Association - this is what the data shows ✓
C) Should be required - policy decision, not data conclusion ❌
D) Lowers intelligence - causation + extreme claim ❌

Answer: B - There is an association

SAT Tip: Studies show correlation/association. Saying "causes" requires controlled experiments!

🎴

Practice with Flashcards

Review key concepts with our flashcard system

📖

Browse All Topics

Explore other calculus topics