🎯⭐ INTERACTIVE LESSON

Biostatistics for the MCAT

Learn step-by-step with interactive practice!

← Back to Standard Lesson

Biostatistics for the MCAT - Complete Interactive Lesson

Part 1: Descriptive Statistics & Data Distributions

Biostatistics Fundamentals

Part 1 of 4 — Descriptive Statistics & Data Distributions

Types of Data

Type	Definition	Examples
Continuous	Can take any value in a range	Height, weight, temperature, time
Discrete	Can only take specific values	Number of cells, number of mutations
Nominal	Categorical, no order	Blood type (A, B, AB, O)
Ordinal	Categorical, with order	Stage of cancer (I, II, III, IV)

Measures of Central Tendency

Measure	Definition	When to Use
Mean	Sum of values ÷ count	Normal distribution; sensitive to outliers
Median	Middle value	Skewed data; resistant to outliers
Mode	Most frequent value	Categorical data

Example: A drug trial shows patient recovery times: 5, 6, 7, 8, 100 days.

Mean = 25.2 days (affected by outlier)
Median = 7 days (better representation)

Measures of Spread

Measure	Formula	Interpretation
Range	Max − Min	Spread across all data
Variance	$\sigma^2 = \frac{\sum(x - \bar{x})^2}{n}$

68-95-99.7 Rule (Normal Distribution):

68% of data within 1 SD of mean
95% within 2 SD
99.7% within 3 SD

Descriptive Statistics 🎯

Key Takeaways — Part 1

Central Tendency: Use median for skewed data; mean for symmetric distributions
Spread: SD is most useful on MCAT; interpret via 68-95-99.7 rule
Outliers: Robust stats (median, IQR) better than mean ± SD when outliers present
Log scales: Many biomedical values are log-normally distributed (viral loads, enzyme concentrations—use log-transform)

Worked Examples — Descriptive Statistics

<details> <summary>Example 1: Choose mean vs median with an outlier</summary>

Data: 4, 5, 5, 6, 40

Mean = 60/5 = 12.
Median = 5.
Outlier (40) inflates the mean.

Best central tendency: median.

</details> <details> <summary>Example 2: Use the 68-95-99.7 rule</summary>

Mean = 70, SD = 5. Estimate the range containing about 95% of values.

95% is roughly mean ± 2 SD.
70 ± 10 gives 60 to 80.

Approximate 95% interval: 60 to 80.

</details> <details> <summary>Example 3: Interpret standard deviation practically</summary>

Two test forms have the same mean score (80). Form A has SD 3; Form B has SD 12.

Same mean means same average performance.
Lower SD means scores cluster more tightly.
Higher SD means performance is more variable.

Conclusion: Form A is more consistent across students.

</details>

Part 2: Hypothesis Testing & p-values

Biostatistics Fundamentals

Part 2 of 4 — Hypothesis Testing & p-values

Hypothesis Types

Hypothesis	Definition	Example
Null (H₀)	No effect or difference	The drug has no effect on blood pressure
Alternative (H₁)	There is an effect	The drug lowers blood pressure

One-tailed vs Two-tailed:

One-tailed: Predicts direction (Drug lowers BP) → p-value not split
Two-tailed: No direction (Drug changes BP) → p-value split between tails

Type I & II Errors

Error	What Happens	Probability
Type I	Reject H₀ when it's true (False positive)	$\alpha$ (significance level)

Part 3: Confidence Intervals & Effect Size

Biostatistics Fundamentals

Part 3 of 4 — Confidence Intervals & Effect Size

Confidence Intervals (CI)

A CI gives a range where the true parameter likely lies (unlike a single p-value).

95% CI = Sample mean ± 1.96 × SE
(where SE = SD / √n)

Interpretation: "We are 95% confident the true population mean falls within this range."

CI Width	What it means
Narrow CI	More precise estimate (good sample size)
Wide CI	Less precise estimate (small sample size)
CI doesn't cross 0	Statistically significant difference
CI crosses 0	Not statistically significant

Example: Study finds mean blood pressure reduction of 10 mmHg (95% CI: 5–15 mmHg).

Interpretation: Likely true reduction is between 5–15 mmHg
Since CI doesn't include 0, the effect is significant

Effect Size

Effect size quantifies magnitude of difference (independent of sample size).

Measure	What it shows	Range
Cohen's d

Part 4: Correlation, Causation & Study Design

Biostatistics Fundamentals

Part 4 of 4 — Correlation vs Causation & Study Design Implications

Correlation Coefficient (r)

Measures strength and direction of linear relationship between two variables.

r = -1 → Perfect negative correlation
r = 0 → No correlation
r = +1 → Perfect positive correlation

r Value	Interpretation
±0.0–0.3	Weak correlation
±0.3–0.7	Moderate correlation
±0.7–1.0	Strong correlation

Critical: r close to ±1 does NOT prove causation!

Correlation ≠ Causation

Three mechanisms for correlation:

Causation: X → Y (aspirin → reduced heart attack risk)
Reverse Causation: Y → X (depression ← poor health status)
Confounding Variable: Z → both X and Y (smoking → both yellow teeth AND lung cancer)

Example: Ice cream sales correlate with drowning deaths.

Confounder: Summer heat drives both (neither causes the other)

Study Design & Confounders

Design	Controls Confounders?

Critical: α = 0.05 means 5% chance of Type I error (standard MCAT threshold)

p-value Interpretation

p < 0.05 → Reject H₀ (statistically significant)
p ≥ 0.05 → Fail to reject H₀ (not significant)

Example: A study finds p = 0.03 for a new antibiotic efficacy.

Interpretation: 3% chance these results occurred by random chance if the antibiotic has no real effect
Conclusion: Reject H₀; the antibiotic likely has real efficacy

Power = 1 − β (ability to detect a true effect)
Larger sample size → More power
Higher power = better study (typically aim for 80%+ power)

Hypothesis Testing 🎯

Key Takeaways — Part 2

p-value = Probability of observing data (or more extreme) if H₀ is true
α=0.05 = 5% threshold; p < 0.05 → reject H₀
Type I error (α): False positive; Type II error (β): False negative
Power (1−β): Increases with larger sample size; typical goal is 80%+
MCAT Tip: p < 0.05 = statistically significant; always check the p-value first

Worked Examples — Hypothesis Testing & p-values

<details> <summary>Example 1: Decide significance quickly</summary>

Study result: p = 0.03, α = 0.05.

Compare p to α.
Since 0.03 < 0.05, reject H₀.

Conclusion: statistically significant finding.

</details> <details> <summary>Example 2: Identify a Type I error</summary>

A test concludes a drug works, but in reality it does not.

Rejected H₀ when H₀ was true.
This is a false positive.

Error type: Type I error (α).

</details> <details> <summary>Example 3: Why larger sample size helps</summary>

Small trial p = 0.07; larger trial on same effect p = 0.02.

Larger n reduces standard error.
Smaller uncertainty improves ability to detect true effects.
Power increases, reducing Type II error risk.

Takeaway: bigger sample, higher power.

</details>

Example: Two antacid drugs show:

Drug A: Mean relief = 7 hours (Large sample, p=0.001)
Drug B: Mean relief = 6.9 hours (Huge sample, p=0.02)
p-value suggests B is "significant," but effect size is trivial (~0.01 hours difference)

Confidence Intervals & Effect Size 🎯

Key Takeaways — Part 3

95% CI: Range where true population parameter likely falls
Narrow CI = Better precision (larger N); CI crosses 0 = Not significant
Effect Size: Magnitude of difference (Cohen's d, OR, r); independent of sample size
p-value vs Effect Size: p-value answers "Is there an effect?" (yes/no). Effect size answers "How big?"
MCAT Tip: Always check both—significant p-value ≠ meaningful effect; large CI suggests underpowered study

Worked Examples — Confidence Intervals & Effect Size

<details> <summary>Example 1: Interpret a confidence interval</summary>

Treatment effect = 4 units, 95% CI: 1 to 7.

Interval does not include 0.
Effect is statistically significant.
True effect is plausibly between 1 and 7 units.

Conclusion: significant positive effect with moderate precision.

</details> <details> <summary>Example 2: Compare precision between studies</summary>

Study A CI: 10 to 30. Study B CI: 18 to 22.

Study B has a much narrower CI.
Narrower CI means lower uncertainty in estimate.

More precise estimate: Study B.

</details> <details> <summary>Example 3: p-value vs practical importance</summary>

A huge sample finds p < 0.001 for a score increase of 0.2 points.

p-value says the effect is unlikely due to chance.
Effect size is tiny.
Statistical significance does not guarantee clinical relevance.

Takeaway: evaluate both p-value and effect size.

</details>

RCT Gold Standard: Randomization balances known and unknown confounders across groups.

Observational Study: 
  Does statin use → lower cholesterol?
  (Confounded by: diet, exercise, genetics)

RCT Gold Standard:
  Randomize patients to statin vs placebo
  (Randomization balances confounders)

Correlation, Causation & Study Bias 🎯

Key Takeaways — Part 4

Correlation (r): Measures linear relationship; high |r| ≠ causation
Confounding: Third variable influences both exposure and outcome
Study Hierarchy (for causation inference): Observational < Case-Control < Cohort < RCT
RCT Gold Standard: Randomization balances known and unknown confounders
Temporal Relationship: Exposure must precede outcome for causation (cohort better than cross-sectional)
MCAT Tip: Always ask "Could a confounder explain this correlation?" and "What study design would prove causation?"

Worked Examples — Correlation, Causation & Design

<details> <summary>Example 1: Correlation does not prove causation</summary>

Data show coffee intake correlates with heart disease.

Correlation indicates association only.
Smoking could confound both coffee intake and disease risk.
Need stronger design (e.g., randomized intervention) for causal inference.

Conclusion: association present, causation unproven.

</details> <details> <summary>Example 2: Rank study designs for causal strength</summary>

Given cross-sectional, case-control, cohort, and RCT:

RCT is strongest due to randomization.
Cohort is next because exposure precedes outcome.
Case-control and cross-sectional are more confounded.

Strongest to weakest: RCT > cohort > case-control > cross-sectional.

</details> <details> <summary>Example 3: Identify temporal logic</summary>

Study records current depression and current sleep quality at one time point.

Exposure and outcome measured simultaneously.
Cannot determine which came first.
Reverse causation remains possible.

Takeaway: temporal ambiguity weakens causal claims.

</details>

Observational	Poor (confounding risk)	Weak
Case-Control	Better (matching)	Moderate
Cohort	Good (prospective tracking)	Good
RCT	Excellent (randomization)	Strong

Biostatistics for the MCAT

Biostatistics for the MCAT - Complete Interactive Lesson

Part 1: Descriptive Statistics & Data Distributions

Biostatistics Fundamentals

Types of Data

Measures of Central Tendency

Measures of Spread

Key Takeaways — Part 1

Worked Examples — Descriptive Statistics

Part 2: Hypothesis Testing & p-values

Biostatistics Fundamentals

Hypothesis Types

Type I & II Errors

Part 3: Confidence Intervals & Effect Size

Biostatistics Fundamentals

Confidence Intervals (CI)

Effect Size

Part 4: Correlation, Causation & Study Design

Biostatistics Fundamentals

Correlation Coefficient (r)

Correlation ≠ Causation

Study Design & Confounders

p-value Interpretation

Power & Sample Size

Key Takeaways — Part 2

Worked Examples — Hypothesis Testing & p-values

Key Takeaways — Part 3

Worked Examples — Confidence Intervals & Effect Size

Key Takeaways — Part 4

Worked Examples — Correlation, Causation & Design