next up previous contents
Next: node23.html Up: Relevant Processing Techniques Previous: The Onedimensional Case

Principle Components Analysis

  Principle Components Analysis (PCA), also known as Karhunen-Loève expansion or Eigen-XY analysis, has in recent years found a number of applications in the fields of computer vision and pattern recognition. As mentioned in Section 2.1, PCA is based on representing typical images in terms of a compact set of orthogonal basis images. We will now provide a concise summary of the main ideas behind PCA. For more details, refer to Kirby et al. [KWD93] or Turk and Pentland [TP91].

Assume that we have M column vectors representing M different sampled signals, which we will label tex2html_wrap_inline2708 . Furthermore, let tex2html_wrap_inline2710 represent the mean of the tex2html_wrap_inline2712 's,

equation421

and let tex2html_wrap_inline2714 be defined as

equation423

For illustrative purposes, we will assume that each tex2html_wrap_inline2712 represents an tex2html_wrap_inline2718 image of a face which has been concatenated into an tex2html_wrap_inline2720 column vector by placing each column of the image atop the next.gif

In the huge tex2html_wrap_inline2724 -dimensional space of possible images, it is reasonable to assume that the set of points corresponding to human faces are somewhat clustered to a relatively low-dimensional subspace. The goal of PCA, in the present context, is to determine from the set of tex2html_wrap_inline2712 's a basis for the subspace within which most faces can be represented with a small amount of error.

Mathematically, we can express this goal as follows: we wish to find a set of M orthonormal vectors tex2html_wrap_inline2730 for which the quantity

  equation425

is maximised, subject to the orthonormality constraint

equation427

In other words, our goal is to find a set of tex2html_wrap_inline2730 's which have the largest possible projection onto each of the tex2html_wrap_inline2714 's. Assuming tex2html_wrap_inline2736 , the set of tex2html_wrap_inline2730 's offer us an efficient and compact means of representing faces as linear combinations of precomputed basis vectors. In [TP91], this set of basis vectors, i.e. the tex2html_wrap_inline2730 's, are referred to as ``eigenfaces.''

Using Rayleigh's principle, it can be shown that the tex2html_wrap_inline2730 's and tex2html_wrap_inline2744 's are given by the eigenvectors and eigenvalues of the covariance matrix

equation429

where tex2html_wrap_inline2746 is a matrix comprised of the column vectors tex2html_wrap_inline2714 placed side by side. Since the matrix tex2html_wrap_inline2750 is often enormous ( tex2html_wrap_inline2752 ), it is not practical to solve for the eigenvectors of tex2html_wrap_inline2750 directly. The number of sample points in each vector, tex2html_wrap_inline2724 , is generally much larger than the dimension M of the matrix tex2html_wrap_inline2750 , however, and as a result, all but M-1 of tex2html_wrap_inline2750 's eigenvalues are equal to zero. (The -1 arises from the subtraction of the mean vector tex2html_wrap_inline2710 .)

As described in [TP91], the vectors tex2html_wrap_inline2730 and scalars tex2html_wrap_inline2744 can be obtained indirectly by solving for the eigenvectors and eigenvalues of the tex2html_wrap_inline2774 matrix tex2html_wrap_inline2776 . To summarise, given the eigenvectors tex2html_wrap_inline2778 and eigenvalues tex2html_wrap_inline2780 of tex2html_wrap_inline2776 ,

equation431

we can premultiply both sides by tex2html_wrap_inline2746 ,

equation433

which tells us that the first M-1 eigenvectors and eigenvalues of tex2html_wrap_inline2788 are given by tex2html_wrap_inline2790 and tex2html_wrap_inline2780 , respectively


next up previous contents
Next: node23.html Up: Relevant Processing Techniques Previous: The Onedimensional Case

Markus Weber
Tue Jan 7 15:44:13 PST 1997