i_naturalist2017

This dataset contains a total of 5,089 categories, across 579,184 training images and 95,986 validation images. For the training set, the distribution of images per category follows the observation frequency of that category by the iNaturalist community.

Although the original dataset contains some images with bounding boxes, currently, only image-level annotations are provided (single label/image). In addition, the organizers have not published the test labels, so we only provide the test images (label = -1).

Features

FeaturesDict({
    'id': Text(shape=(), dtype=tf.string),
    'image': Image(shape=(None, None, 3), dtype=tf.uint8),
    'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=5089),
    'supercategory': ClassLabel(shape=(), dtype=tf.int64, num_classes=13),
})

Statistics

Split Examples
ALL 857,877
TRAIN 579,184
TEST 182,707
VALIDATION 95,986

Homepage

Supervised keys (for as_supervised=True)

(u'image', u'label')

Citation

@InProceedings{Horn_2018_CVPR,
author = {
Van Horn, Grant and Mac Aodha, Oisin and Song, Yang and Cui, Yin and Sun, Chen
and Shepard, Alex and Adam, Hartwig and Perona, Pietro and Belongie, Serge},
title = {The INaturalist Species Classification and Detection Dataset},
booktitle = {
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}
}