Normal distribution
Also known as the Gaussian distribution, the normal distribution is the bell-shaped probability model used when many small, roughly independent effects add together.
A continuous random variable is normal with mean and variance , written
when its probability density is
Key ideas:
- sets the centre and equals the mean, median, and mode.
- sets the spread; larger means a wider, flatter curve.
- The total area under the density curve is , so probabilities are areas.
- Standardisation converts any normal variable to a standard normal:
The empirical rule is a useful memory aid: about 68% of observations lie within , 95% within , and 99.7% within of .
The normal distribution appears in measurement error, regression residuals, diffusion models, and as an approximation justified by the central limit theorem. It is central to Stats Equations and definitions, Stats Examples, and many ML loss functions, especially least-squares regression.
Standard normal
The standard normal distribution has mean and variance :
Any normal variable can be standardised by
Z=rac{X-\mu}{\sigma}.This converts questions about into questions about the standard normal CDF. In practice this is what z-scores are doing.