The vector of eigenvalues of is
|
|
The proportions of explained variance are given in Table 9.7. It can be concluded that the representation in two dimensions should be sufficient. The correlations leading to Figure 9.14 are given in Table 9.8. The picture is different from the one obtained in Section 9.3 (see Table 9.2). Here, the first factor is mainly a left-right vs. diagonal factor and the second one is a length factor (with negative weight). Take another look at Figure 9.13, where the individual bank notes are displayed. In the upper left graph it can be seen that the genuine bank notes are for the most part in the south-eastern portion of the graph featuring a larger diagonal, smaller height () and also a larger length (). Note also that Figure 9.14 gives an idea of the correlation structure of the original data matrix.
Calculating the corresponding vector of eigenvalues gives
H | ... | Hi Tech and Communication |
E | ... | Energy |
F | ... | Finance |
M | ... | Manufacturing |
R | ... | Retail |
... | all other sectors. |
|
The two outliers in the
right-hand side of the graph are IBM and General Electric (GE), which
differ from the other companies with their
high market values.
As can be seen in the first column of ,
market value has the largest weight
in the first PC, adding to the isolation of these two companies.
If IBM and GE were to be excluded from the data set, a completely
different picture would emerge, as shown in Figure 9.16.
In this case the vector of eigenvalues becomes
|
|
From Figure 9.17 (and Table 9.10) it appears that the first factor is a ``size effect'', it is positively correlated with all the variables describing the size of the activity of the companies. It is also a measure of the economic strength of the firms. The second factor describes the ``shape'' of the companies (``profit-cash flow'' vs. ``assets-sales'' factor), which is more difficult to interpret from an economic point of view.
|
|
|
In Figure 9.19 the variables television and other leisure activities hardly play any role (look at Table 9.12). The variable television appears in (negatively correlated). Table 9.13 shows that this factor contrasts people from Eastern countries and Yugoslavia with men living in the U.S. The variable other leisure activities is the factor . It merely distinguishes between men and women in Eastern countries and in Yugoslavia. These last two factors are orthogonal to the preceeding axes and of course their contribution to the total variation is less important.