The Law of Large Numbers

Definition

  • Given
    • random variables X1,X2,,XnX_1, X_2, \dots, X_n that are i.i.d.i.i.d.
    • E[X]=μ\mathbb{E}[X] = \mu
    • V[X]=σ2\mathbb{V}[X] = \sigma^2
  • According to the Law of Large Numbers
Xˉn=1ni=1nXi  P  μ(n)\bar{X}_n = \frac{1}{n} \sum_{i = 1}^{n} X_i \;\xrightarrow{\mathbb{P}}\; \mu \quad (n \to \infty)

Proof

  • Let an event A{0,1}A \in \{0,1\}, where P(A=1)=pP(A = 1) = p and P(A=0)=q=1pP(A = 0) = q = 1 - p.
  • Consider nn independent trials of AA, and let XX be the number of times A=1A = 1 occurs.
  • Then XX follows a binomial distribution:
P(X=x)=(nx)pxqnx,μ=np,σ2=npq.\begin{align*} P(X = x) &= \binom{n}{x} p^{x} q^{\,n - x}, \\ \mu &= np, \\ \sigma^2 &= npq. \end{align*}
  • As nn \to \infty, with pp fixed and xx in a neighborhood of npnp, the binomial distribution admits a normal approximation (de Moivre–Laplace theorem):
XN(μ,σ2).X \approx N(\mu, \sigma^2).
  • Under this approximation, the probability mass function of XX can be approximated by the probability density function:
fX(x)=12πσ2exp ⁣((xμ)22σ2).f_X(x) = \frac{1}{\sqrt{2\pi\sigma^2}} \exp\!\left(-\frac{(x - \mu)^2}{2\sigma^2}\right).
  • Now define a new random variable
Xˉ=Xn,\bar X = \frac{X}{n},

representing the proportion of times A=1A = 1 occurs in nn trials.

  • Using a change of variables under the normal approximation, where x=nxˉx = n\bar x and dxdxˉ=n\frac{dx}{d\bar x} = n, we obtain:
fXˉ(xˉ)=nfX(nxˉ)=n2πnpqexp ⁣((nxˉnp)22npq)=12πpqnexp ⁣((xˉp)22pqn).\begin{align*} f_{\bar X}(\bar x) &= n \cdot f_X(n\bar x) \\ &= \frac{n}{\sqrt{2\pi npq}} \exp\!\left(-\frac{(n\bar x - np)^2}{2npq}\right) \\ &= \frac{1}{\sqrt{2\pi \frac{pq}{n}}} \exp\!\left(-\frac{(\bar x - p)^2}{2\frac{pq}{n}}\right). \end{align*}
  • Therefore, the mean and variance of Xˉ\bar X are:
μXˉ=p,σXˉ2=pqn.\mu_{\bar X} = p, \qquad \sigma^2_{\bar X} = \frac{pq}{n}.
  • As nn \to \infty, the variance converges to zero:
limnσXˉ2=limnpqn=0.\lim_{n \to \infty} \sigma^2_{\bar X} = \lim_{n \to \infty} \frac{pq}{n} = 0.
  • Hence, under the normal approximation, the distribution of Xˉ\bar X concentrates at pp and converges to a degenerate distribution at pp.

  • This implies that the sample proportion converges in probability to pp:

limnXn=p.\lim_{n \to \infty} \frac{X}{n} = p.

Diagram Comparison

n = 30

n = 30

n = 10000

n = 10000