Sampling Distributions
Normal distribution results
Consider
- fundamental to statistical inference
- relating statistics from the sample to the population parameters
- we use arguments based on sampling distribution of the statistic to state how close the estimate is likely to be to the parameter
- sampling distribution of a statistic is the probability distribution of that statistic
- = probability distributions if samples of the same size were to be repeatedly drawn from the population
- but we dont really do it, math can figure it out
- = probability distributions if samples of the same size were to be repeatedly drawn from the population
- sampling distribution of a statistic is the probability distribution of that statistic
- sampling distributions of
- sample distribution of the sample mean(
) - assumed that population is
or very large compared to sample size - otherwise math changes a little
are n independently drawn observations from a population with mean and sd - n = sample size, not amount of sample draws
- thats how expectation works
is an unbiased estimate of - the expectation of a sum is always the sum of the expectations
- we sometimes write
as - mean of the sampling distribution of the sample mean = mean of the population
can take multiple values in different samples, but their mean will be equal to the population mean
- mean of
, the random variable, same as expectation
- mean of the sampling distribution of the sample mean = mean of the population
- if the random variables are independent, the variance of their sum is the sum of their variances
- often denoted as
- sd of the mean of multiple samples taken will have a smaller variance
is normally distributed if we are sampling from a normally distributed population - for the mean of n observations
- tends to
as - tends to the standardized normal distribution
- tends to
= mean of = standard deviation of
- assumed that population is
If