Missed TensorFlow Dev Summit? Check out the video playlist. Watch recordings

tfds.Split

View source on GitHub

Enum for dataset splits.

@staticmethod
tfds.Split(
    cls, name
)

Datasets are typically split into different subsets to be used at various stages of training and evaluation.

  • TRAIN: the training data.
  • VALIDATION: the validation data. If present, this is typically used as evaluation data while iterating on a model (e.g. changing hyperparameters, model architecture, etc.).
  • TEST: the testing data. This is the data to report metrics on. Typically you do not want to use this during model iteration as you may overfit to it.
  • ALL: Special value, never defined by a dataset, but corresponding to all defined splits of a dataset merged together.

See the guide on splits for more information.

Class Variables

  • ALL
  • TEST
  • TRAIN
  • VALIDATION