TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

i_naturalist2017

Description:

This dataset contains a total of 5,089 categories, across 579,184 training images and 95,986 validation images. For the training set, the distribution of images per category follows the observation frequency of that category by the iNaturalist community.

Although the original dataset contains some images with bounding boxes, currently, only image-level annotations are provided (single label/image). In addition, the organizers have not published the test labels, so we only provide the test images (label = -1).

Additional Documentation: Explore on Papers With Code
Homepage: https://github.com/visipedia/inat_comp/tree/master/2017
Source code: tfds.image_classification.INaturalist2017
Versions:
- 0.1.0 (default): No release notes.
Download size: 237.35 GiB
Dataset size: 238.10 GiB
Auto-cached (documentation): No
Splits:

Split	Examples
`'test'`	182,707
`'train'`	579,184
`'validation'`	95,986

Feature structure:

FeaturesDict({
    'id': Text(shape=(), dtype=string),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=5089),
    'supercategory': ClassLabel(shape=(), dtype=int64, num_classes=13),
})

Feature documentation:

Feature	Class	Shape	Dtype
	FeaturesDict
id	Text		string
image	Image	(None, None, 3)	uint8
label	ClassLabel		int64
supercategory	ClassLabel		int64

Supervised keys (See as_supervised doc): ('image', 'label')
Figure (tfds.show_examples):

Visualization

Examples (tfds.as_dataframe):

Citation:

@InProceedings{Horn_2018_CVPR,
author = {
Van Horn, Grant and Mac Aodha, Oisin and Song, Yang and Cui, Yin and Sun, Chen
and Shepard, Alex and Adam, Hartwig and Perona, Pietro and Belongie, Serge},
title = {The INaturalist Species Classification and Detection Dataset},
booktitle = {
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}
}