Sampling Methods - Complete Interactive Lesson
Part 1: Types of Studies
๐ Collecting Data & Study Design
Part 1 of 7 โ Observational Studies vs. Experiments
Two Ways to Gather Data
| Study Type | Description | Can Establish Causation? |
|---|---|---|
| Observational Study | Researcher observes without intervening | โ No โ only association |
| Experiment | Researcher actively imposes treatments | โ Yes โ with proper design |
๐ Key Principle: Only a well-designed experiment can establish a cause-and-effect relationship.
Observational Studies
In an observational study, researchers simply observe and record data without manipulating any variables.
Types:
- Retrospective โ looks at past data (e.g., medical records)
- Prospective โ follows subjects forward in time (e.g., tracking diet over 10 years)
Example: Studying whether coffee drinkers have lower rates of depression by surveying existing habits.
โ ๏ธ Confounding variables lurk in observational studies. Maybe coffee drinkers also exercise more โ that could be the real reason for lower depression.
Experiments
In an experiment, the researcher imposes treatments on subjects and measures the response.
Key Elements:
| Element | Definition |
|---|---|
| Explanatory variable | What the researcher manipulates (treatment) |
| Response variable | What is measured as the outcome |
| Experimental units | The individuals being studied |
| Treatments | Specific conditions applied to units |
Concept Check ๐ฏ
Identifying Study Components ๐งฎ
A pharmaceutical company randomly assigns 200 patients to receive either a new drug or a placebo, then measures blood pressure after 8 weeks.
1) What is the explanatory variable? (drug/placebo or blood pressure)
2) How many treatment groups are there?
3) Can this study establish causation? (yes or no)
Study Design Classification ๐ฝ
Part 2: Sampling Methods
๐ฒ Sampling Methods
Part 2 of 7 โ How to Select a Representative Sample
Why Sampling Matters
We rarely have the resources to study an entire population. Instead, we take a sample and use it to make inferences about the population.
๐ Goal: The sample should be representative of the population โ every individual should have a known chance of being selected.
Probability Sampling Methods
| Method | How It Works | Advantage |
|---|---|---|
| Simple Random Sample (SRS) | Every individual has an equal chance of selection | Gold standard โ no systematic bias |
| Stratified Random Sample | Divide into groups (strata), then SRS within each | Ensures representation of all subgroups |
| Cluster Sample | Randomly select entire groups (clusters), survey all within | Cost-effective for geographically spread populations |
| Systematic Sample | Select every th individual from a list |
Part 3: Bias in Sampling
โ ๏ธ Sources of Bias
Part 3 of 7 โ What Can Go Wrong
Types of Bias
| Bias Type | What Goes Wrong | Example |
|---|---|---|
| Selection bias | Some members of the population are systematically excluded | Phone survey excludes people without phones |
| Nonresponse bias | Selected individuals don't participate | Mail survey โ people who respond may differ from those who don't |
| Response bias | Respondents give inaccurate answers | Wording of questions influences answers |
| Voluntary response bias | Only people with strong opinions respond | Online polls attract extremists |
| Undercoverage | Part of the population has no chance of being selected | Using a phone book misses unlisted numbers |
๐ A biased sampling method will produce biased results no matter how large the sample.
Reducing Bias
- Use random selection to avoid selection bias
- Follow up with to reduce nonresponse bias
Part 4: Experimental Design
๐ฌ Principles of Experimental Design
Part 4 of 7 โ Control, Randomize, Replicate, Block
Four Principles of Good Experiments
| Principle | What It Means | Why It Matters |
|---|---|---|
| Control | Hold extraneous variables constant or use a control group | Isolates the effect of the treatment |
| Randomization | Randomly assign subjects to treatment groups | Equalizes confounding variables across groups |
| Replication | Use enough subjects to detect real effects | Reduces chance variation |
| Blocking | Group similar subjects together, then randomize within blocks | Controls for known sources of variation |
๐ Random assignment โ reduces confounding โ supports causal claims
Completely Randomized Design
The simplest experimental design:
- Pool all experimental units
- Randomly assign each to a treatment group
- Compare responses
Example: 60 patients randomly assigned to Drug A (30) vs. Placebo (30)
Randomized Block Design
Part 5: Random Variables
๐ฐ Random Variables & Expected Value
Part 5 of 7 โ Discrete Random Variables
What Is a Random Variable?
A random variable assigns a numerical value to each outcome of a random process.
| Type | Values | Example |
|---|---|---|
| Discrete | Countable (finite or countably infinite) | Number of heads in 10 flips |
| Continuous | Any value in an interval | Height, weight, time |
Probability Distribution of a Discrete RV
A table showing all values and their probabilities:
| 0 | 1 |
|---|
Part 6: Problem-Solving Workshop
๐ ๏ธ Problem-Solving Workshop
Part 6 of 7 โ Applying Study Design Concepts
Strategy for AP Statistics Study Design Questions
- Identify the study type โ Is a treatment being imposed? If yes โ experiment. If no โ observational.
- Check for bias โ Look for selection bias, nonresponse, response bias, voluntary response.
- Identify confounding โ What other variables could explain the observed relationship?
- Evaluate design โ Does it use random assignment? Control group? Blinding? Blocking?
Worked Example 1
Scenario: A school wants to test whether a new math curriculum improves test scores. They implement the new curriculum in School A and keep the old one in School B, then compare end-of-year scores.
Analysis:
- โ Not a randomized experiment โ schools were not randomly assigned
- โ ๏ธ Confounding: Schools may differ in student demographics, teacher quality, funding
- ๐ง Better design: Randomly assign classrooms within the SAME school to old vs. new curriculum
Worked Example 2
Scenario: Researchers want to know if a new drug lowers cholesterol. They recruit 200 volunteers, randomly assign 100 to the drug and 100 to a placebo, and measure cholesterol after 3 months. Neither patients nor doctors know who gets which pill.
Analysis:
- โ Randomized experiment โ can establish causation
- โ Control group (placebo) โ accounts for placebo effect
- โ Double-blind โ reduces bias from expectations
- โ Replication โ 100 per group is adequate
Workshop Problems ๐ฏ
Part 7: Review & Applications
๐ Review & Applications
Part 7 of 7 โ Comprehensive Review
Key Concepts Summary
| Concept | Key Point |
|---|---|
| Observational vs. Experiment | Only experiments with random assignment โ causation |
| SRS | Every individual has equal probability of selection |
| Stratified | Divide into strata, SRS within each |
| Cluster | Randomly select whole groups |
| Bias | Systematic error โ not fixed by larger |
| Confounding | Third variable explains apparent relationship |
| Random assignment | Reduces confounding in experiments |
| Blocking | Control for known sources of variation |