Sampling Methods - Complete Interactive Lesson

Part 1: Types of Studies

📊 Collecting Data & Study Design

Part 1 of 7 — Observational Studies vs. Experiments

Two Ways to Gather Data

Study Type	Description	Can Establish Causation?
Observational Study	Researcher observes without intervening	❌ No — only association
Experiment	Researcher actively imposes treatments	✅ Yes — with proper design

🔑 Key Principle: Only a well-designed experiment can establish a cause-and-effect relationship.

Observational Studies

In an observational study, researchers simply observe and record data without manipulating any variables.

Types:

Retrospective — looks at past data (e.g., medical records)
Prospective — follows subjects forward in time (e.g., tracking diet over 10 years)

Example: Studying whether coffee drinkers have lower rates of depression by surveying existing habits.

⚠️ Confounding variables lurk in observational studies. Maybe coffee drinkers also exercise more — that could be the real reason for lower depression.

Experiments

In an experiment, the researcher imposes treatments on subjects and measures the response.

Key Elements:

Element	Definition
Explanatory variable	What the researcher manipulates (treatment)
Response variable	What is measured as the outcome
Experimental units	The individuals being studied
Treatments	Specific conditions applied to units

Concept Check 🎯

Identifying Study Components 🧮

A pharmaceutical company randomly assigns 200 patients to receive either a new drug or a placebo, then measures blood pressure after 8 weeks.

1) What is the explanatory variable? (drug/placebo or blood pressure)

2) How many treatment groups are there?

3) Can this study establish causation? (yes or no)

Study Design Classification 🔽

Part 2: Sampling Methods

🎲 Sampling Methods

Part 2 of 7 — How to Select a Representative Sample

Why Sampling Matters

We rarely have the resources to study an entire population. Instead, we take a sample and use it to make inferences about the population.

🔑 Goal: The sample should be representative of the population — every individual should have a known chance of being selected.

Probability Sampling Methods

Method	How It Works	Advantage
Simple Random Sample (SRS)	Every individual has an equal chance of selection	Gold standard — no systematic bias
Stratified Random Sample	Divide into groups (strata), then SRS within each	Ensures representation of all subgroups
Cluster Sample	Randomly select entire groups (clusters), survey all within	Cost-effective for geographically spread populations
Systematic Sample	Select every $k$ th individual from a list

Part 3: Bias in Sampling

⚠️ Sources of Bias

Part 3 of 7 — What Can Go Wrong

Types of Bias

Bias Type	What Goes Wrong	Example
Selection bias	Some members of the population are systematically excluded	Phone survey excludes people without phones
Nonresponse bias	Selected individuals don't participate	Mail survey — people who respond may differ from those who don't
Response bias	Respondents give inaccurate answers	Wording of questions influences answers
Voluntary response bias	Only people with strong opinions respond	Online polls attract extremists
Undercoverage	Part of the population has no chance of being selected	Using a phone book misses unlisted numbers

🔑 A biased sampling method will produce biased results no matter how large the sample.

Reducing Bias

Use random selection to avoid selection bias
Follow up with to reduce nonresponse bias

Part 4: Experimental Design

🔬 Principles of Experimental Design

Part 4 of 7 — Control, Randomize, Replicate, Block

Four Principles of Good Experiments

Principle	What It Means	Why It Matters
Control	Hold extraneous variables constant or use a control group	Isolates the effect of the treatment
Randomization	Randomly assign subjects to treatment groups	Equalizes confounding variables across groups
Replication	Use enough subjects to detect real effects	Reduces chance variation
Blocking	Group similar subjects together, then randomize within blocks	Controls for known sources of variation

🔑 Random assignment → reduces confounding → supports causal claims

Completely Randomized Design

The simplest experimental design:

Pool all experimental units
Randomly assign each to a treatment group
Compare responses

Example: 60 patients randomly assigned to Drug A (30) vs. Placebo (30)

Randomized Block Design

Part 5: Random Variables

🎰 Random Variables & Expected Value

Part 5 of 7 — Discrete Random Variables

What Is a Random Variable?

A random variable $X$ assigns a numerical value to each outcome of a random process.

Type	Values	Example
Discrete	Countable (finite or countably infinite)	Number of heads in 10 flips
Continuous	Any value in an interval	Height, weight, time

Probability Distribution of a Discrete RV

A table showing all values and their probabilities:

$x$	0	1

Part 6: Problem-Solving Workshop

🛠️ Problem-Solving Workshop

Part 6 of 7 — Applying Study Design Concepts

Strategy for AP Statistics Study Design Questions

Identify the study type — Is a treatment being imposed? If yes → experiment. If no → observational.
Check for bias — Look for selection bias, nonresponse, response bias, voluntary response.
Identify confounding — What other variables could explain the observed relationship?
Evaluate design — Does it use random assignment? Control group? Blinding? Blocking?

Worked Example 1

Scenario: A school wants to test whether a new math curriculum improves test scores. They implement the new curriculum in School A and keep the old one in School B, then compare end-of-year scores.

Analysis:

❌ Not a randomized experiment — schools were not randomly assigned
⚠️ Confounding: Schools may differ in student demographics, teacher quality, funding
🔧 Better design: Randomly assign classrooms within the SAME school to old vs. new curriculum

Worked Example 2

Scenario: Researchers want to know if a new drug lowers cholesterol. They recruit 200 volunteers, randomly assign 100 to the drug and 100 to a placebo, and measure cholesterol after 3 months. Neither patients nor doctors know who gets which pill.

Analysis:

✅ Randomized experiment — can establish causation
✅ Control group (placebo) — accounts for placebo effect
✅ Double-blind — reduces bias from expectations
✅ Replication — 100 per group is adequate

Workshop Problems 🎯

Part 7: Review & Applications

📋 Review & Applications

Part 7 of 7 — Comprehensive Review

Key Concepts Summary

Concept	Key Point
Observational vs. Experiment	Only experiments with random assignment → causation
SRS	Every individual has equal probability of selection
Stratified	Divide into strata, SRS within each
Cluster	Randomly select whole groups
Bias	Systematic error — not fixed by larger $n$
Confounding	Third variable explains apparent relationship
Random assignment	Reduces confounding in experiments
Blocking	Control for known sources of variation

Sampling Methods

Simple Random Sample (SRS)

Stratified Random Sampling

Matched Pairs Design

Expected Value (Mean)

Variance and Standard Deviation