Caltech-UCSD Birds-200-2011

Warning: Images in this dataset overlap with images in ImageNet. Exercise caution when using networks pretrained with ImageNet (or any network pretrained with images from Flickr) as the test set of CUB may overlap with the training set of the original network.

Browse

Click here to browse the dataset.

Details

Caltech-UCSD Birds-200-2011 (CUB-200-2011) is an extended version of the CUB-200 dataset, with roughly double the number of images per class and new part location annotations. For detailed information about the dataset, please see the technical report linked below.

  • Number of categories: 200

  • Number of images: 11,788

  • Annotations per image: 15 Part Locations, 312 Binary Attributes, 1 Bounding Box

Some related datasets are Caltech-256, the Oxford Flower Dataset, and Animals with Attributes. More datasets are available at the Caltech Vision Dataset Archive.

Citation

If you use CUB-200-2011 in your work, please cite the technical report:

  • Wah C., Branson S., Welinder P., Perona P., Belongie S. “The Caltech-UCSD Birds-200-2011 Dataset.” Computation & Neural Systems Technical Report, CNS-TR-2011-001. download pdf

BibTeX
@techreport{WahCUB_200_2011,
	Title = {{The Caltech-UCSD Birds-200-2011 Dataset}},
	Author = {Wah, C. and Branson, S. and Welinder, P. and Perona, P. and Belongie, S.},
	Year = {2011}
	Institution = {California Institute of Technology},
	Number = {CNS-TR-2011-001}
}

Download

You can download the dataset using the links below:

Contact

Contact Catherine Wah for questions about the dataset.

Publications

The following publications use the dataset. Please contact us if you are using the dataset, and we will add your paper to the list.

2014

  • Goering, C., Rodner, E., Freytag, A., Denzler, J., “Nonparametric Part Transfer for Fine-grained Recognition”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2014. pdf

  • Wah, C., Van Horn, G., Branson, S., Maji, S., Perona, P., Belongie, S., “Similarity Comparisons for Interactive Fine-Grained Categorization”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2014.

2013

  • Berg T., Belhumeur P., “POOF: Part-Based One-vs-One Features for Fine-Grained Categorization, Face Verification, and Attribute Estimation”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2013.

  • Chai, Y., Lempitsky, V., Zisserman, A., “Symbiotic Segmentation and Part Localization for Fine-Grained Categorization”, IEEE International Conference on Computer Vision (ICCV), Sydney, Australia, 2013. pdf

  • Gavves E., Fernando B., Snoek C., Smeulders A., Tuytelaars T., “Fine-Grained Categorization by Alignments”, IEEE International Conference on Computer Vision (ICCV), Sydney, Australia, 2013. pdf

  • Wah C., Belongie S., “Attribute-Based Detection of Unfamiliar Classes with Humans in the Loop”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2013. pdf

2012

  • Duan K., Parikh D., Crandall D., Grauman K. “Discovering localized attributes for fine-grained recognition”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2012.

  • Zhang N., Farrell, R., Darrell, T. “Pose pooling kernels for sub-category recognition”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2012.

2011

  • Branson S., Perona P., Belongie S. “Strong Supervision From Weak Annotation: Interactive Training of Deformable Part Models”, IEEE International Conference on Computer Vision (ICCV), Barcelona, 2011. pdf, supplementary material

  • Wah C., Branson S., Perona P., Belongie S., “Multiclass Recognition and Part Localization with Humans in the Loop”, IEEE International Conference on Computer Vision (ICCV), Barcelona, 2011. pdf

2010

  • Branson S., Wah C., Babenko B., Schroff F., Welinder P., Perona P., Belongie S., “Visual Recognition with Humans in the Loop”, European Conference on Computer Vision (ECCV), Heraklion, Crete, Sept., 2010. pdf

  • Welinder, P., Perona, P. “Online crowdsourcing: rating annotators and obtaining cost-effective labels”, Workshop on Advancing Computer Vision with Humans in the Loop at CVPR. 2010. pdf