9.4 Asymptotic Properties of the PCs

In practice, PCs are computed from sample data. The following theorem yields results on the asymptotic distribution of the sample PCs.

THEOREM 9.4 Let $\Sigma > 0$ with distinct eigenvalues, and let $\data{U}\sim m^{-1} W_p(\Sigma ,m)$ with spectral decompositions $\Sigma =\Gamma \Lambda \Gamma ^{\top}$ , and $\data{U}=\data{G}\data{L}\data{G}^{\top}$ . Then

(a): $\sqrt m (\ell -\lambda )\stackrel{\cal L}{\longrightarrow} N_p(0,2\Lambda ^2)$ ,
where $\ell =(\ell _1,\ldots ,\ell _p)^{\top}$ and $\lambda =(\lambda _1,\ldots ,\lambda _p)^{\top}$ are the diagonals of $\data{L}$ and $\Lambda$ ,
(b): $\sqrt m(g_j-\gamma _j)\stackrel{\cal L}{\longrightarrow} N_p(0,\data{V}_j)$ ,
with $\displaystyle \data{V}_j=\lambda _j\sum\limits_{k\neq j} \frac{\lambda _k}{(\lambda _k-\lambda _j)^2 }\gamma _k\gamma _k^{\top}$ ,
(c): $\mathop{\mathit{Cov}}(g_j,g_k) = \data{V}_{jk}$ ,
where the -element of the matrix $\data{V}_{jk} (p\times p)$ is $\displaystyle -\frac{\lambda _j\lambda _k\gamma _{rk}\gamma _{sj} } {[m(\lambda _j-\lambda _k)^2] }$ ,
(d): the elements in $\ell$ are asymptotically independent of the elements in $\data{G}$ .

EXAMPLE 9.4 Since $n\data{S}\sim W_p(\Sigma ,n-1)$ if $X_1,\ldots,X_n$ are drawn from $N(\mu,\Sigma)$ , we have that

$\begin{displaymath} \sqrt {n-1}(\ell _j-\lambda _j)\stackrel{\cal L}{\longrightarrow} N(0,2\lambda _j^2),\quad j=1,\ldots ,p. \end{displaymath}$

(9.17)

Since the variance of (9.17) depends on the true mean $\lambda_j$ a log transformation is useful. Consider $f(\ell_j) = \log(\ell_j)$ . Then $\frac{d}{d \ell_j}f\vert _{\ell_j = \lambda_j} = \frac{1}{\lambda_j}$ and by the Transformation Theorem 4.11 we have from (9.17) that

$\begin{displaymath} \sqrt{n-1} (\log \ell_j - \log \lambda_j) \longrightarrow N(0,2). \end{displaymath}$

(9.18)

Hence,

$\begin{displaymath}\sqrt {\frac{n-1 }{2 }}\left (\log \ell _j-\log\lambda _j\right ) \stackrel{\cal L}{\longrightarrow} N(0,1)\end{displaymath}$

and a two-sided confidence interval at the $1-\alpha =0.95$ significance level is given by

$\begin{displaymath}\log(\ell _j)-1.96\sqrt {\frac{2 }{n-1 }}\le \log\lambda _j\le \log(\ell _j)+1.96 \sqrt {\frac{2 }{n-1 }}.\end{displaymath}$

In the bank data example we have that

$\begin{displaymath}\ell_1 =2.98.\end{displaymath}$

Therefore,

$\begin{displaymath}\log(2.98)\pm 1.96 \sqrt {\frac{2 }{199 } }=\log(2.98)\pm 0.1965.\end{displaymath}$

It can be concluded for the true eigenvalue that

$\begin{displaymath}P\left\{\lambda_1\in(2.448, 3.62)\right\}\approx 0.95.\end{displaymath}$

Variance explained by the first

PCs.

THEOREM 9.5

$\begin{displaymath}\sqrt {n-1}(\widehat \psi -\psi )\stackrel{\cal L}{\longrightarrow} N(0,\omega^2),\end{displaymath}$

where

$\begin{eqnarray*} \omega^2 &=& \data{D}^{\top}\data{V}\data{D} = \frac{2} {\{\... ...athop{\hbox{tr}}(\Sigma )\}^2 } (\psi ^2-2\beta \psi +\beta ) \end{eqnarray*}$

and

$\begin{displaymath}\beta = {\displaystyle \frac{\lambda ^2_1+\cdots+\lambda ^2_q } {\lambda ^2_1+\cdots+\lambda ^2_p } }.\end{displaymath}$

EXAMPLE 9.5 From Section 9.3 it is known that the first PC for the Swiss bank notes resolves 67% of the variation. It can be tested whether the true proportion is actually 75%. Computing

$\begin{eqnarray*} \widehat\beta &=& \frac{\ell^2_1}{\ell^2_1+\cdots+\ell ^2_p } ... ...t 9.883 } {(4.472)^2}\{(0.668)^2-2(0.902)(0.668)+0.902\}=0.142. \end{eqnarray*}$

Hence, a confidence interval at a significance of level $1-\alpha =$ 0.95 is given by

$\begin{displaymath}0.668\pm1.96\sqrt {\frac{0.142 }{199 }}=(0.615,0.720).\end{displaymath}$

Clearly the hypothesis that $\psi =$ 75% can be rejected!