TB sections 4.1
2024-10-30
Illustrate how information from several samples are connected to the population and to the sampling distribution
Understand how the sampling distribution of the sample means relates to a sample and the population distribution
Apply the Central Limit Theorem to approximate the sampling distribution of the sample mean
(Target) Population
Sample
When we want to estimate features of the population
Much easier to measure statistics from our sample (Lesson 1)
However, statistics from our sample are not exactly the same as the population measurements that we’re aiming for
We call the population measurements population parameters
Population parameter
Sample statistic (point estimate)
Understand how the sampling distribution of the sample means relates to a sample and the population distribution
Apply the Central Limit Theorem to approximate the sampling distribution of the sample mean
Variation in population (\(\sigma\)):
\[ \mu = 65 \text{ inches}\] \[ \sigma = 3 \text{ inches}\]
Variation in population (\(\sigma\)):
\[ \mu = 65 \text{ inches}\] \[ \sigma = 3 \text{ inches}\]
Variation within samples (\(s\)):
Variation in population (\(\sigma\)):
\[ \mu = 65 \text{ inches}\] \[ \sigma = 3 \text{ inches}\]
Variation within samples (\(s\)):
Variation between samples (\(SE\)):
\[ \mu_{\overline{X}} = 65.002 \text{ inches}\] \[ SE = 0.421 \text{ inches}\]
The sampling distribution is the distribution of sample means calculated from repeated random samples of the same size from the same population
It is useful to think of a particular sample statistic as being drawn from a sampling distribution
Variation between samples (\(SE\)):
\[ \mu_{\overline{X}} = 65.002 \text{ inches}\] \[ SE = 0.421 \text{ inches}\]
How are the center, shape, and spread similar and/or different?
The sampling distribution of \(\overline{X}\) is a theoretical concept
Illustrate how information from several samples are connected to the population and to the sampling distribution
Understand how the sampling distribution of the sample means relates to a sample and the population distribution
If a sample consists of at least 30 independent observations, then the sampling distribution of the sample mean is approximated by a normal model
Aka, for “large” sample sizes ( \(n\geq 30\) ),
This is regardless of the original sample is from a different distribution
The sampling distribution is the distribution of sample means calculated from repeated random samples of the same size from the same population
It is useful to think of a particular sample statistic as being drawn from a sampling distribution
With CLT and \(\overline{X}\) as the RV for the sampling distribution
Variation between samples (\(SE\)):
\[ \mu_{\overline{X}} = 64.996 \text{ inches}\] \[ SE = 0.291 \text{ inches}\]
CLT tells us that we can model the sampling distribution of mean heights using a normal distribution:
Mean and SD of population: \[ \mu = 65 \text{ inches} \text{, } \sigma = 3 \text{ inches}\]
From the CLT, we can figure out the theoretical mean and standard deviation of our sampling distribution:
\[ \mu = 65 \text{ inches}\] \[ SE = \frac{\sigma}{\sqrt{n}} \text{ inches} = \frac{3}{\sqrt{50}} \text{ inches} = 0.424 \text{ inches}\]
I simulated the data, so I can calculate mean and SE of the sampling distribution:
Example 1
For a random sample of 100 people, what is the probability that their mean height is greater than 65 inches? We happen to know the population mean is 64 inches and population standard deviation is 4 inches.
Example 1
For a random sample of 100 people, what is the probability that their mean height is greater than 65 inches? We happen to know the population mean is 64 inches and population standard deviation is 4 inches.
Lesson 9 Slides