๐ AP Tip: On free-response questions, you MUST address all three features (shape, center, spread) AND mention any outliers. Missing any one costs points.
Shape
Shape
Description
Visual
Symmetric
Left and right halves are mirror images
Bell-shaped, uniform
Right-skewed
Long tail extends to the RIGHT
Most data clumped left
Left-skewed
Long tail extends to the LEFT
Most data clumped right
Unimodal
One peak
Single hump
Bimodal
Two peaks
Two humps
โ ๏ธ Common Mistake: The direction of skewness is the direction of the TAIL, not where most data is concentrated.
Center
Measure
Formula
Best When
Mean (xห)
xห=
Key Relationship:
Right-skewed โ mean > median (mean pulled toward tail)
Left-skewed โ mean < median (mean pulled toward tail)
Symmetric โ mean โ median
Spread
Measure
Formula
Description
Range
maxโmin
Simplest; sensitive to outliers
IQR
Q3โโ
Worked Example
Data: 2, 3, 3, 4, 4, 4, 5, 5, 6
Feature
Analysis
Shape
Roughly symmetric (approximately bell-shaped)
Center
Median = 4 (5th of 9 values); Mean = 936โ=4
Spread
Range = ; IQR =
AP-Style Description: "The distribution of values is roughly symmetric and unimodal with a center (median) of 4. The spread is moderate with an IQR of 2 and a range of 4. There are no apparent outliers."
Shape, Center, Spread Concepts ๐ฏ
Calculations ๐งฎ
Data: 1, 3, 5, 7, 9, 11, 13
1) What is the median?
2) What is the range?
3) What is the mean?
Choosing the Right Measure ๐
Exit Quiz โ Shape, Center, Spread โ
Part 2: Histograms & Dotplots
๐ Histograms & Dotplots
Part 2 of 7 โ Displaying Quantitative Data
Types of Graphs for Quantitative Data
Graph
Description
Best For
Histogram
Bars show frequency of data in intervals (bins)
Large data sets
Dotplot
Dots stacked above a number line
Small data sets; seeing individual values
Stemplot
Stems (leading digits) with leaves (trailing digits)
Moderate data sets; preserving exact values
Histograms
A histogram divides values into equal-width bins and shows the frequency (count) or relative frequency (proportion) of each bin.
Key Features:
Bars touch (no gaps between adjacent bins)
x-axis: quantitative variable (bins)
y-axis: frequency or relative frequency
Bin width affects appearance โ too few bins hide detail, too many create noise
Bin Width
Effect
Too wide (few bins)
Hides patterns in the data
Part 3: Mean vs Median
๐ข Mean vs Median
Part 3 of 7 โ Choosing the Right Measure of Center
Mean (xห)
xห=
Part 4: Standard Deviation
๐ Standard Deviation
Part 4 of 7 โ Measuring Spread from the Mean
Variance and Standard Deviation
The standard deviation (s) measures the typical distance of data values from the mean.
s=
Part 5: Normal Distribution
๐งฎ Normal Distribution & Empirical Rule
Part 5 of 7 โ The Bell Curve
The Normal Distribution
A normal distribution is a symmetric, bell-shaped curve completely described by two parameters:
Parameter
Symbol
Role
Mean
ฮผ
Center of the distribution
Standard Deviation
ฯ
Controls the width/spread
Notation:
Part 6: Problem-Solving Workshop
๐ ๏ธ Problem-Solving Workshop
Part 6 of 7 โ Putting It All Together
Worked Example 1: Describing a Distribution
Problem
A teacher records quiz scores for 20 students:
45, 52, 55, 60, 62, 65, 67, 68, 70, 72, 73, 75, 76, 78, 80, 82, 85, 88, 90, 95
Describe this distribution completely.
Solution
Shape: The distribution is roughly symmetric with a slight left skew (the lower scores stretch a bit further from center).
Center:
Mean: xห=
Part 7: Review & Applications
๐ Review & Applications
Part 7 of 7 โ Comprehensive Review
Complete Summary
Describing Distributions Checklist
Feature
Measures
Notes
Shape
Symmetric, left/right-skewed, unimodal/bimodal
Direction of TAIL = direction of skew
Center
Mean (xห), Median (M)
Symmetric โ mean; Skewed โ median
Spread
n
โxiโ
โ
Distribution is roughly symmetric
Median (M)
Middle value when data is ordered
Distribution is skewed or has outliers
Q1โ
Middle 50% of data; resistant to outliers
Standard Deviation (s)
s=nโ1โ(xiโโxห)2โโ
Average distance from mean
6
โ
2=
4
Q3โโQ1โ=5โ3=2
Outliers
None apparent
Too narrow (many bins)
Creates too much noise
Just right
Reveals the overall shape clearly
Dotplots
Each data value is represented by a dot above a number line. Dots stack when values repeat.
Example:
Data: 1, 2, 2, 3, 3, 3, 4, 4, 5
Value
1
2
3
4
5
Dots
โข
โขโข
โขโขโข
โขโข
โข
Shape: symmetric
Center: 3
Spread: range = 4
No obvious outliers
Stemplots (Stem-and-Leaf Plots)
Component
Role
Stem
Leading digit(s)
Leaf
Trailing digit
Key
Tells how to read the values (e.g., 3
Example: Ages: 21, 23, 25, 31, 34, 38, 42, 45
Stem
Leaves
2
1 3 5
3
1 4 8
4
2 5
Key: 2|1 = 21 years
Back-to-back stemplots compare two groups using the same stems.
Reading Graphs โ What to Look For
Feature
What to Check
Shape
Symmetric? Skewed? Unimodal? Bimodal?
Center
Where is the "middle" of the data?
Spread
How spread out are the values?
Outliers
Any values far from the rest?
Gaps/Clusters
Are there groups of data with gaps between them?
๐ AP Tip: When comparing two distributions, use comparative language: "Distribution A is more spread out than Distribution B" rather than describing each separately.
The mean is the "balance point" of the distribution
Median (M)
Procedure:
Order all data from smallest to largest
If n is odd: median = middle value (position 2n+1โ)
If n is even: median = average of the two middle values
Property
Detail
Uses every data value
No โ only the position matters
Sensitive to outliers
No โ resistant to extreme values
Best for
Skewed distributions or data with outliers
When to Use Each
Situation
Use
Why
Symmetric data
Mean or Median
They're approximately equal
Right-skewed data
Median
Mean is inflated by the right tail
Left-skewed data
Median
Mean is deflated by the left tail
Data with outliers
Median
Mean is pulled by outliers
Need to calculate totals
Mean
Total=xหรn
Worked Example
Data: 2, 4, 6, 8, 100
Measure
Calculation
Result
Mean
52+4+6+8+100โ=5120โ
24
Median
Middle (3rd) value of ordered data
6
The outlier 100 pulls the mean up to 24, but the median (6) better represents the typical value.
๐ AP Tip: When asked "which measure of center is more appropriate," always explain WHY. Connect your answer to the shape of the distribution or the presence of outliers.
Effect of Transformations
Transformation
Effect on Mean
Effect on Median
Add constant c to all values
Add c
Add c
Multiply all values by k
Multiply by k
Multiply by k
Add an outlier
Pulled toward outlier
Minimal change
Remove an outlier
Moves back toward center
Minimal change
Mean vs Median Concepts ๐ฏ
Calculations ๐งฎ
1) Mean of: 8, 12, 16, 20, 24
2) Median of: 3, 7, 9, 15, 21, 25
3) Mean of: 1, 2, 3, 4, 100
Choosing the Right Measure ๐
Exit Quiz โ Mean vs Median โ
nโ1
โi=1nโ(xiโโxห)2
โ
โ
Term
Definition
xiโโxห
Deviation โ how far each value is from the mean
(xiโโxห)2
Squared deviation โ removes negatives
nโ1โ(xiโโxห
s=s2โ
Standard deviation โ back in original units
โ ๏ธ Why nโ1? We divide by nโ1 (not n) because the sample mean xห is estimated from the data. This gives a better estimate of the population SD. The value nโ1 is called the degrees of freedom.
Properties of Standard Deviation
Property
Detail
sโฅ0 always
Standard deviation can never be negative
s=0
Only when ALL values are identical
Units
Same units as the original data
Sensitive to outliers
Yes โ outliers inflate s
Affected by transformations
Adding c: s unchanged. Multiplying by k: s multiplied by $
Worked Example
Data: 2, 4, 6, 8, 10. Find s.
Step 1: Mean xห=52+4+6+8+10โ=530โ=6
Step 2: Deviations and squared deviations:
xiโ
xiโโxห
(xiโโxห)2
2
โ4
16
4
โ2
4
6
0
0
8
2
4
10
4
16
Sum
40
Step 3: Variance: s2=5โ140โ=440โ=10
Step 4: Standard deviation: s=10โโ3.16
Interpretation: The values are typically about 3.16 units from the mean of 6.
Effect of Transformations on s
Transformation
Effect on s
Example
Add constant c
s unchanged
Data + 10: same spread
Multiply by k
$s \times
k
๐ AP Tip: Adding a constant shifts all values equally, so the spread doesn't change. Multiplying stretches or compresses the data, changing the spread.
Standard Deviation Concepts ๐ฏ
SD Calculations ๐งฎ
1) Data: 3, 5, 7. Mean = 5. What is the variance s2? (Hint: sum of squared deviations รท (nโ1))
2) What is the standard deviation s of the data above?
3) Data: 10, 10, 10. Standard deviation s = ?
SD Properties ๐
Exit Quiz โ Standard Deviation โ
X
โผ
N(ฮผ,ฯ)
The Empirical Rule (68-95-99.7 Rule)
For any normal distribution:
Range
Percentage
ฮผยฑ1ฯ
68% of data
ฮผยฑ2ฯ
95% of data
ฮผยฑ3ฯ
99.7% of data
Worked Example 1: Empirical Rule
IQ scores: ฮผ=100, ฯ=15
Range
Interval
Percentage
ฮผยฑ1ฯ
100ยฑ15=[85,115]
68%
ฮผยฑ2ฯ
100ยฑ30=[70,130]
95%
ฮผยฑ3ฯ
100ยฑ45=[55,145]
99.7%
So about 68% of IQ scores fall between 85 and 115.
Z-Scores
A z-score standardizes any value by telling how many SDs it is from the mean:
z=ฯxโฮผโ
z-score
Meaning
z=0
At the mean
z=1
1 SD above the mean
z=โ2
2 SDs below the mean
๐ Key Insight: Z-scores let you compare values from DIFFERENT distributions. A student with z=1.5 on a math test did relatively better than a student with z=0.8 on an English test.
Worked Example 2: Z-Scores
SAT Math: ฮผ=500, ฯ=100. A student scores 680.
z=100680โ500โ=100180โ=1.8
This score is 1.8 standard deviations above the mean.
Using the Normal Distribution
To find the percentage of values below a value x:
Calculate z=ฯxโฮผโ
Use the z-table (or calculator: normalcdf) to find the area
Calculator Command
Purpose
normalcdf(lower, upper, ฮผ, ฯ)
Area between two values
invNorm(area, ฮผ, ฯ)
Value at a given percentile
โ ๏ธ AP Tip: Always show the z-score calculation AND draw a sketch of the normal curve with the area shaded.
Normal Distribution Concepts ๐ฏ
Z-Score Calculations ๐งฎ
Distribution: ฮผ=200, ฯ=25
1) What is the z-score for x=250?
2) What value has z=โ1?
3) The 68% interval is [lower,upper]. What is the lower bound?
Normal Curve Properties ๐
Exit Quiz โ Normal Distribution โ
20
โxiโ
โ
=
201458โ=
72.9
Median: Average of 10th and 11th values = 272+73โ=72.5
Spread:
Range: 95โ45=50
IQR: Q1โ=63.5, Q3โ=81, IQR =81โ63.5=17.5
Outliers: Using the 1.5 ร IQR rule:
Lower fence: 63.5โ1.5(17.5)=37.25 โ no values below this
Upper fence: 81+1.5(17.5)=107.25 โ no values above this
No outliers
AP-Style Response: "The distribution of quiz scores is roughly symmetric and unimodal with a center (median) of 72.5 points. The scores have moderate spread with an IQR of 17.5 points and a range of 50 points. There are no apparent outliers."
Worked Example 2: Normal Distribution Application
Problem
Heights of adult women are normally distributed with ฮผ=64.5 inches and ฯ=2.5 inches.
(a) What percentage of women are taller than 69.5 inches?
(b) What height corresponds to the 16th percentile?
Solution
(a)z=2.569.5โ64.5โ=2.55โ=2.0
69.5 inches is exactly ฮผ+2ฯ. By the Empirical Rule, 95% of data falls within ฮผยฑ2ฯ, so 5% is outside this range, and 2.5% is above ฮผ+2ฯ.
Answer: About 2.5% of women are taller than 69.5 inches.
(b) The 16th percentile means 16% of data is below this value. By the Empirical Rule:
50% is below the mean
34% is between ฮผโฯ and ฮผ (half of 68%)
So 16% is below ฮผโฯ
ฮผโฯ=64.5โ2.5=62.0 inches
Answer: The 16th percentile is approximately 62.0 inches.
๐ AP Tip: On free-response questions, SHOW the fence calculations when identifying outliers. Don't just say "there are outliers" โ prove it mathematically.
Workshop Practice ๐ฏ
Calculations ๐งฎ
1)ฮผ=100, ฯ=15. What percentage of data is between 85 and 115? (answer as whole number)
2)Q1โ=30, Q3โ=50. What is the IQR?
3) Using the values above, what is the lower fence for outliers?
Decision Making ๐
Exit Quiz โ Problem-Solving Workshop โ
Range, IQR, SD (s)
Symmetric โ SD; Skewed โ IQR
Outliers
1.5 ร IQR rule
Always check and mention
Formula Reference
Formula
Expression
Mean
xห=nโxiโโ
Median position
(n+1)/2
Range
max โ min
IQR
Q3โโQ1โ
Variance
s2=nโ1โ(x
Standard Deviation
s=s2โ
Z-score
z=ฯxโฮผโ
Lower fence
Q1โโ1.5รIQR
Upper fence
Q3โ+1.5รIQR
Mean vs Median Decision Guide
If...
Then use...
Because...
Data is symmetric, no outliers
Mean and SD
Mean uses all values; SD pairs with mean
Data is skewed
Median and IQR
Both are resistant to skewness
Data has outliers
Median and IQR
Both are resistant to outliers
You need to find totals
Mean
Total = xหรn
The Empirical Rule (Normal Distributions Only)
Range
Percentage
Above upper
Below lower
ฮผยฑ1ฯ
68%
16%
16%
ฮผยฑ2ฯ
95%
2.5%
2.5%
ฮผยฑ3ฯ
99.7%
0.15%
0.15%
Key Concepts from Every Part
Part
Topic
Essential Takeaway
1
Shape, Center, Spread
Always describe all three + outliers
2
Histograms & Dotplots
Visual displays for quantitative data
3
Mean vs Median
Skewed โ median; Symmetric โ mean
4
Standard Deviation
Measures typical distance from mean; sโฅ0
5
Normal Distribution
68-95-99.7 rule; z-scores standardize
6
Problem-Solving
Combine concepts; show work; use context
Transformations Summary
Operation
Effect on Center
Effect on Spread
Add c
Add c
No change
Multiply by k
Multiply by k
Multiply by $
๐ Final AP Tip: On every free-response question, use CONTEXT. Don't just say "the distribution is right-skewed with a center of 72." Say "the distribution of quiz scores is right-skewed with a median of 72 points."
Comprehensive Review ๐ฏ
Formula Application ๐งฎ
1) Data: 10, 15, 20, 25, 30. What is the mean?
2)ฮผ=80, ฯ=6. What z-score corresponds to x=92?
3)Q1โ=25, Q3โ=45. What is the upper fence for outliers?