The dataset contains images of people collected from the web by typing common given names into Google Image Search. The coordinates of the eyes, the nose and the center of the mouth for each frontal face are provided in a ground truth file. This information can be used to align and crop the human faces or as a ground truth for a face detection algorithm. The dataset has 10,524 human faces of various resolutions and in different settings, e.g. portrait images, groups of people, etc. Profile faces or very low resolution faces are not labeled. The data contains a total of 10,524 faces in 7,092 images. The average image resolution is 304x312 pixels across the data.
Before you download the data, please note: The pictures in the dataset were harvested from the web for the purpose of carrying out not-for-profit scientific experiments and are not Caltech property. Any use of the dataset, other than fair use, must be negotiated with the pictures’ owners. Caltech is not responsible for the content nor the meaning of the images.
The data can be found here, including:
This dataset was collected by Michael Fink while visiting the Caltech Vision Group. Rob Fergus provided a script for harvesting images from Google.