next up previous contents
Next: Gaze Estimation Up: Principle Components Analysis Previous: Principle Components Analysis

We can gain some insight about the uses and limitations of PCA by considering the related task of least-squares line fitting. The following discussion is based on the treatment of a related topic in [BGW91]. Given a density function tex2html_wrap_inline2794 where tex2html_wrap_inline2796 , we would like to determine the equation for a line (confined to the plane spanned by tex2html_wrap_inline2798 and tex2html_wrap_inline2800 ) which best approximates tex2html_wrap_inline2794 . We will assume without loss of generality that the centroid of tex2html_wrap_inline2794 is located at the origin. Our goal can be stated as follows: we wish to find a unit-norm vector tex2html_wrap_inline2806 which minimises the following integral:

equation435

The quantity tex2html_wrap_inline2808 represents the squared error between each point in the plane (as weighted by tex2html_wrap_inline2794 ) and the line passing through the vector tex2html_wrap_inline2806 . Since tex2html_wrap_inline2806 cannot increase the norm of tex2html_wrap_inline2816 , minimising tex2html_wrap_inline2818 is equivalent to maximising the integral

equation437

In other words, we wish to find a vector tex2html_wrap_inline2806 which has the largest possible projection onto all points in the plane as weighted by tex2html_wrap_inline2794 . This goal is analogous to that of Equation (25) for the case of a single eigenvector. Whereas the above integrals involve a continuous density function, the summation in Equation (25) incorporates the discrete ``density'' of points in the plane into the enumerated vectors tex2html_wrap_inline2824 . In this regard, PCA can be viewed as a form of generalized line-fitting to higher dimensional distributions.

This comparison to the task of line fitting hints at some of the inherent limitations of PCA-based approaches. In a case where the subspace of variations on a feature is not linear, a linear approximation may not be wise. Moreover, determining membership to a class based only on the distance to a particular eigenspace (e.g. the line passing through tex2html_wrap_inline2806 in the above line fitting example) ignores any localisation the training points might possess along the eigenspace. Thus it would be advantageous in certain cases to make adjustments to classical PCA which incorporate probabilistic characterisation of the clusters which give rise to a particular eigenspace. Such a technique is described in Section 3.6.


next up previous contents
Next: Gaze Estimation Up: Principle Components Analysis Previous: Principle Components Analysis

Markus Weber
Tue Jan 7 15:44:13 PST 1997