The EMNIST dataset is a set of handwritten character digits derived from the NIST Special Database 19 and converted to a 28x28 pixel image format and dataset structure that directly matches the MNIST dataset.
- URL: https://www.nist.gov/node/1298471/emnist-dataset
DatasetBuilder
:tfds.image.mnist.EMNIST
emnist
is configured with tfds.image.mnist.EMNISTConfig
and has the
following configurations predefined (defaults to the first one):
byclass
(v1.0.1
) (Size: 535.73 MiB
): EMNIST ByClassbymerge
(v1.0.1
) (Size: 535.73 MiB
): EMNIST ByMergebalanced
(v1.0.1
) (Size: 535.73 MiB
): EMNIST Balancedletters
(v1.0.1
) (Size: 535.73 MiB
): EMNIST Lettersdigits
(v1.0.1
) (Size: 535.73 MiB
): EMNIST Digitsmnist
(v1.0.1
) (Size: 535.73 MiB
): EMNIST MNIST
emnist/byclass
EMNIST ByClass
Versions:
1.0.1
(default):3.0.0
: New split API (https://tensorflow.org/datasets/splits)
Statistics
Split | Examples |
---|---|
ALL | 814,255 |
TRAIN | 697,932 |
TEST | 116,323 |
Features
FeaturesDict({
'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=62),
})
Homepage
Supervised keys (for as_supervised=True
)
(u'image', u'label')
emnist/bymerge
EMNIST ByMerge
Versions:
1.0.1
(default):3.0.0
: New split API (https://tensorflow.org/datasets/splits)
Statistics
Split | Examples |
---|---|
ALL | 814,255 |
TRAIN | 697,932 |
TEST | 116,323 |
Features
FeaturesDict({
'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=47),
})
Homepage
Supervised keys (for as_supervised=True
)
(u'image', u'label')
emnist/balanced
EMNIST Balanced
Versions:
1.0.1
(default):3.0.0
: New split API (https://tensorflow.org/datasets/splits)
Statistics
Split | Examples |
---|---|
ALL | 131,600 |
TRAIN | 112,800 |
TEST | 18,800 |
Features
FeaturesDict({
'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=47),
})
Homepage
Supervised keys (for as_supervised=True
)
(u'image', u'label')
emnist/letters
EMNIST Letters
Versions:
1.0.1
(default):3.0.0
: New split API (https://tensorflow.org/datasets/splits)
Statistics
Split | Examples |
---|---|
ALL | 103,600 |
TRAIN | 88,800 |
TEST | 14,800 |
Features
FeaturesDict({
'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=37),
})
Homepage
Supervised keys (for as_supervised=True
)
(u'image', u'label')
emnist/digits
EMNIST Digits
Versions:
1.0.1
(default):3.0.0
: New split API (https://tensorflow.org/datasets/splits)
Statistics
Split | Examples |
---|---|
ALL | 280,000 |
TRAIN | 240,000 |
TEST | 40,000 |
Features
FeaturesDict({
'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=10),
})
Homepage
Supervised keys (for as_supervised=True
)
(u'image', u'label')
emnist/mnist
EMNIST MNIST
Versions:
1.0.1
(default):3.0.0
: New split API (https://tensorflow.org/datasets/splits)
Statistics
Split | Examples |
---|---|
ALL | 70,000 |
TRAIN | 60,000 |
TEST | 10,000 |
Features
FeaturesDict({
'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=10),
})
Homepage
Supervised keys (for as_supervised=True
)
(u'image', u'label')
Citation
@article{cohen_afshar_tapson_schaik_2017,
title={EMNIST: Extending MNIST to handwritten letters},
DOI={10.1109/ijcnn.2017.7966217},
journal={2017 International Joint Conference on Neural Networks (IJCNN)},
author={Cohen, Gregory and Afshar, Saeed and Tapson, Jonathan and Schaik, Andre Van},
year={2017}
}