Normal Distributions
Properties and calculations with normal curves
Normal Distributions
Introduction
The normal distribution (also called Gaussian distribution or bell curve) is the most important probability distribution in statistics. Many natural phenomena approximately follow a normal distribution, and it forms the foundation for much of statistical inference.
Characteristics of Normal Distributions
Shape Properties
1. Bell-shaped curve:
- Symmetric around the center
- Single peak at the mean
- Tails extend infinitely in both directions (approaching but never touching the x-axis)
2. Symmetric:
- Left side mirrors right side
- Mean = Median = Mode
- If folded at center, both halves match perfectly
3. Unimodal:
- Single peak (at the mean)
- Highest point at center
- Decreases smoothly on both sides
4. Asymptotic:
- Tails get closer and closer to x-axis
- Never actually reach zero
- Theoretically extends to and
The Normal Curve Equation
Probability density function:
Don't memorize this! Just know:
- Defined by two parameters: (mean) and (standard deviation)
- Shape determined entirely by and
Parameters: Mean and Standard Deviation
Mean ()
Controls location:
- Center of distribution
- Peak of curve
- Balance point
Changing :
- Shifts distribution left or right
- Doesn't change shape
- Doesn't change spread
Example:
- Distribution A: , centered at 50
- Distribution B: , centered at 70
- B is shifted 20 units right from A
Standard Deviation ()
Controls spread:
- How spread out distribution is
- Width of bell curve
- Distance from mean to inflection points
Changing :
- Larger → wider, flatter curve
- Smaller → narrower, taller curve
- Doesn't change center
- Total area under curve stays 1.0
Example:
- Distribution A: , narrow and tall
- Distribution B: , wide and flat
- Both centered at same , but B more spread out
The Empirical Rule (68-95-99.7 Rule)
For normal distributions:
68% of data within 1 standard deviation of mean
to
95% of data within 2 standard deviations of mean
to
99.7% of data within 3 standard deviations of mean
to
Example Application
IQ scores: ,
68% of people have IQ between: and
95% of people have IQ between: and
99.7% of people have IQ between: and
Using the Empirical Rule
Quick mental calculations:
Example: Heights of adult males: inches, inches
Q: What percentage between 67 and 73 inches?
A: 67 to 73 is → 68%
Q: What percentage above 76 inches?
A: 76 is , so 95% are between 64 and 76
Above 76 = (100% - 95%) / 2 = 2.5%
Q: What percentage below 64 inches?
A: 64 is
Below 64 = (100% - 95%) / 2 = 2.5%
The Standard Normal Distribution (Z-distribution)
Definition
Standard Normal: Normal distribution with and
Denoted: or Z-distribution
Why it matters:
- Reference distribution
- All normal distributions can be standardized to this
- Tables and calculators use standard normal
Z-Scores
Z-score (standardized score): Number of standard deviations from the mean
Formula:
Where:
- = observed value
- = mean
- = standard deviation
Interpretation:
- : At the mean
- : One SD above mean
- : One SD below mean
- : 2.5 SD above mean
Calculating Z-Scores
Example: Test scores with ,
Score of 83:
Score is 1 SD above mean
Score of 67:
Score is 1 SD below mean
Score of 91:
Score is 2 SD above mean
Using Z-Scores
Purposes:
- Standardize different distributions for comparison
- Find probabilities using standard normal table
- Identify unusual values (typically |z| > 2 or 3)
- Compare across different scales
Example comparison:
Student A: Math score 85 (class , )
Student B: English score 88 (class , )
Student B performed better relative to their class (higher z-score)!
Finding Areas Under the Normal Curve
Methods
1. Empirical Rule (for z = ±1, ±2, ±3)
2. Standard Normal Table (z-table)
- Gives area to LEFT of z-score
- Also called cumulative probability
3. Calculator
- normalcdf function
- More accurate, easier
Using the Table
Area to the left of z:
- Look up z in table directly
- Example: z = 1.23 → area = 0.8907
- Meaning: 89.07% of data below z = 1.23
Area to the right of z:
- Area to right = 1 - area to left
- Example: z = 1.23 → area to left = 0.8907
- Area to right = 1 - 0.8907 = 0.1093 (10.93%)
Area between two z-scores:
- Find area to left of each
- Subtract smaller from larger
- Example: Between z = -1 and z = 1
- Area left of 1: 0.8413
- Area left of -1: 0.1587
- Between: 0.8413 - 0.1587 = 0.6826 (68.26%)
Calculator Method
TI-83/84:
normalcdf(lower, upper, mean, SD)
Examples:
Area between 65 and 75 (, ):
normalcdf(65, 75, 70, 5) → 0.6827
Area above 80:
normalcdf(80, 999999, 70, 5) → 0.0228
Area below 60:
normalcdf(-999999, 60, 70, 5) → 0.0228
Finding Values from Areas (Inverse Normal)
The Inverse Problem
Given: Probability (area)
Find: Corresponding x-value or z-score
Example: Find the score such that 90% of students score below it
Calculator Method
TI-83/84:
invNorm(area to left, mean, SD)
Examples:
90th percentile (, ):
invNorm(0.90, 70, 5) → 76.4
Meaning: 90% score below 76.4
25th percentile (Q1):
invNorm(0.25, 70, 5) → 66.6
75th percentile (Q3):
invNorm(0.75, 70, 5) → 73.4
Assessing Normality
Why It Matters
Many statistical methods assume normality. We need to check if data is approximately normal before applying these methods.
Methods to Assess Normality
1. Histogram/Dotplot:
- Look for bell shape
- Check for symmetry
- Quick visual check
2. Normal Probability Plot (Q-Q Plot):
- Plot observed values vs. expected normal values
- If roughly linear → approximately normal
- If curved or non-linear → not normal
3. Numerical Checks:
- Mean ≈ Median (symmetry)
- Few outliers by 1.5 × IQR rule
- Most data within
What to Look For
Approximately normal:
✓ Bell-shaped histogram
✓ Linear normal probability plot
✓ Mean ≈ Median
✓ About 68% within 1 SD, 95% within 2 SD
Not normal:
❌ Skewed histogram
❌ Curved normal probability plot
❌ Mean ≠ Median
❌ Many outliers
❌ Gaps or multiple peaks
Common Mistakes
❌ Assuming all data is normal
Many distributions are NOT normal!
❌ Confusing z-scores with original values
z-scores are standardized, no units
❌ Using empirical rule for non-normal data
Only valid for normal distributions
❌ Forgetting to standardize before using table
Must convert to z-scores first
❌ Reading wrong side of table
Most tables give area to LEFT
❌ Not checking normality assumption
Methods based on normality won't work if data isn't normal
Quick Reference
Normal Distribution:
- Parameters: (mean), (SD)
- Notation:
- Properties: Symmetric, bell-shaped, unimodal
Empirical Rule (68-95-99.7):
- 68% within
- 95% within
- 99.7% within
Z-Score:
- Formula:
- Interpretation: # of SDs from mean
- Standard normal: ,
Calculator Commands:
- normalcdf(lower, upper, μ, σ) for area/probability
- invNorm(area, μ, σ) for x-value
Assessing Normality:
- Histogram: bell-shaped?
- Normal plot: linear?
- Mean ≈ Median?
Remember: The normal distribution is powerful but not universal. Always check if the normality assumption is reasonable before using methods that require it!
📚 Practice Problems
No example problems available yet.
Practice with Flashcards
Review key concepts with our flashcard system
Browse All Topics
Explore other calculus topics