Lecture 13 - 2025 / 5 / 20

Fan's Inequality

Theorem. Let $X, Y$ be discrete r.v. $X \in \mathcal H, |\mathcal H| < \infty$ . We want to estimate $X$ using $Y$ . That is we use $\hat X = g(Y)$ as an estimation of $X$ . Our goal is to minimize error $P_e := P(\hat X \ne X)$ .
[ $P_e$ is related to $P(X | Y)$ and $H(X | Y)$ ]

$P_e \ge \frac{H(X | Y) - 1}{ \log |\mathcal H|}$

Channel Coding Theorem part 2 $(R > C)$

$H(M| Y_1, \cdots, Y_n) = H(M) - I(M; Y_1, \cdots, Y_n)$
$H(M) = {nR}, I(M; Y_1, \cdots, Y_n) = nC$

$P_e \ge \frac{n(R - C) - 1}{nR} \approx \frac{R - C}{R} = \varepsilon_0$

Fisher Information and Cramér-Rao Inequality

1. Unbiased Estimation

Sample $X = (X_1, \cdots, X_n)$ , typically i.i.d. $X$ has density function $f(X; \theta) = \prod_{i=1}^{n} f(X_i; \theta)$

Our goal: estimate $\theta$ using $X_1, \cdots, X_n$ using $\hat\theta = \phi(X_1, \cdots, X_n)$ , $\phi : \mathcal X \to \R$ .

Unbiased estimation $E(\phi(X)) = \theta$ , want to give $Var(\phi (X))$ a lower bound.

2. Fisher Information

Definition (score function): $S(X; \theta) := \dfrac{\partial}{\partial \theta} \ln f(X; \theta)$

$\begin{aligned} E(S(X; \theta)) & = \int S(x; \theta) f(x; \theta) \text d x\\ & = \int \frac{\partial}{\partial \theta} \ln f(x; \theta) \cdot f(x;\theta) \text d x\\ & = \int \frac{\partial}{\partial \theta} f(x; \theta) \text d x\\ & = \frac{\partial}{\partial \theta} \int f(x; \theta) \text d x = \frac{\partial}{\partial \theta} 1 = 0 \end{aligned}$

$E(S^2(X; \theta)) = Var(S(X; \theta))$

Definition (Fisher Information): $I(\theta) = Var(S(X; \theta))$

Proposition: $I(\theta) = - E\left( \dfrac{\partial^2} {\partial \theta ^2} \ln f(X; \theta) \right)$

3. Cramér-Rao Inequality

Theorem. For any unbiased estimator $\phi : \mathcal X \to \R$ , we have $Var(\phi(X)) \ge \dfrac{1}{I(\theta)}$ .

Generally, for $d$ -dim

Score function: $E(\nabla_{\theta} \ln f(X; \theta))$
Fisher information: $- E(\nabla^2_{\theta} \ln f(X; \theta))$
Cramér-Rao: $Cov(\phi(X)) \succcurlyeq I(\theta)^{-1}$ (The difference between the two is positive semidefinite)