emnist

The EMNIST dataset is a set of handwritten character digits derived from the NIST Special Database 19 and converted to a 28x28 pixel image format and dataset structure that directly matches the MNIST dataset.

emnist is configured with tfds.image.mnist.EMNISTConfig and has the following configurations predefined (defaults to the first one):

  • byclass (v1.0.1) (Size: 535.73 MiB): EMNIST ByClass

  • bymerge (v1.0.1) (Size: 535.73 MiB): EMNIST ByMerge

  • balanced (v1.0.1) (Size: 535.73 MiB): EMNIST Balanced

  • letters (v1.0.1) (Size: 535.73 MiB): EMNIST Letters

  • digits (v1.0.1) (Size: 535.73 MiB): EMNIST Digits

  • mnist (v1.0.1) (Size: 535.73 MiB): EMNIST MNIST

emnist/byclass

EMNIST ByClass

Versions:

  • 1.0.1 (default):
  • 3.0.0: New split API (https://tensorflow.org/datasets/splits)

Statistics

Split Examples
ALL 814,255
TRAIN 697,932
TEST 116,323

Features

FeaturesDict({
    'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
    'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=62),
})

Homepage

Supervised keys (for as_supervised=True)

(u'image', u'label')

emnist/bymerge

EMNIST ByMerge

Versions:

  • 1.0.1 (default):
  • 3.0.0: New split API (https://tensorflow.org/datasets/splits)

Statistics

Split Examples
ALL 814,255
TRAIN 697,932
TEST 116,323

Features

FeaturesDict({
    'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
    'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=47),
})

Homepage

Supervised keys (for as_supervised=True)

(u'image', u'label')

emnist/balanced

EMNIST Balanced

Versions:

  • 1.0.1 (default):
  • 3.0.0: New split API (https://tensorflow.org/datasets/splits)

Statistics

Split Examples
ALL 131,600
TRAIN 112,800
TEST 18,800

Features

FeaturesDict({
    'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
    'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=47),
})

Homepage

Supervised keys (for as_supervised=True)

(u'image', u'label')

emnist/letters

EMNIST Letters

Versions:

  • 1.0.1 (default):
  • 3.0.0: New split API (https://tensorflow.org/datasets/splits)

Statistics

Split Examples
ALL 103,600
TRAIN 88,800
TEST 14,800

Features

FeaturesDict({
    'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
    'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=37),
})

Homepage

Supervised keys (for as_supervised=True)

(u'image', u'label')

emnist/digits

EMNIST Digits

Versions:

  • 1.0.1 (default):
  • 3.0.0: New split API (https://tensorflow.org/datasets/splits)

Statistics

Split Examples
ALL 280,000
TRAIN 240,000
TEST 40,000

Features

FeaturesDict({
    'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
    'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=10),
})

Homepage

Supervised keys (for as_supervised=True)

(u'image', u'label')

emnist/mnist

EMNIST MNIST

Versions:

  • 1.0.1 (default):
  • 3.0.0: New split API (https://tensorflow.org/datasets/splits)

Statistics

Split Examples
ALL 70,000
TRAIN 60,000
TEST 10,000

Features

FeaturesDict({
    'image': Image(shape=(28, 28, 1), dtype=tf.uint8),
    'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=10),
})

Homepage

Supervised keys (for as_supervised=True)

(u'image', u'label')

Citation

@article{cohen_afshar_tapson_schaik_2017,
    title={EMNIST: Extending MNIST to handwritten letters},
    DOI={10.1109/ijcnn.2017.7966217},
    journal={2017 International Joint Conference on Neural Networks (IJCNN)},
    author={Cohen, Gregory and Afshar, Saeed and Tapson, Jonathan and Schaik, Andre Van},
    year={2017}
}