Stats Equations and definitions

Probability

For event , and . Complement: Conditional probability: Bayes’ rule:

Summaries

Sample mean: Sample variance: Covariance and correlation:

Distributions

A probability mass function gives for discrete . A density gives continuous probabilities by integration: The Normal distribution has density

Likelihood and loss

For independent data , Many ML losses are negative log-likelihoods, so minimising loss often means maximising statistical plausibility.

Expectation and variance

For a random variable , expectation is the probability-weighted average value:

Variance measures spread around the mean:

\operatorname{Var}(X)=\mathbb{E}\left[(X-\mathbb{E}[X])^2 ight].

Standard deviation is

Likelihood

Given data and parameter , the likelihood is

Statistics often asks: which makes the observed data least surprising?

Standard error

For a sample mean based on independent observations with standard deviation , the standard error is

\operatorname{SE}(ar{X})= rac{\sigma}{\sqrt{n}}.

This is the uncertainty in the estimate of the mean, not the scatter of individual data points. More data reduces standard error like , which is powerful but slow: needing twice the precision costs roughly four times the data.