- Description:
Dataset describing the survival status of individual passengers on the Titanic. Missing values in the original dataset are represented using ?. Float and int missing values are replaced with -1, string missing values are replaced with 'Unknown'.
Homepage: https://www.openml.org/d/40945
Source code:
tfds.structured.Titanic
Versions:
2.0.0
(default): New split API (https://tensorflow.org/datasets/splits)
Download size:
114.98 KiB
Dataset size:
Unknown size
Auto-cached (documentation): Unknown
Splits:
Split | Examples |
---|---|
'train' |
1,309 |
- Features:
FeaturesDict({
'features': FeaturesDict({
'age': tf.float32,
'boat': tf.string,
'body': tf.int32,
'cabin': tf.string,
'embarked': ClassLabel(shape=(), dtype=tf.int64, num_classes=4),
'fare': tf.float32,
'home.dest': tf.string,
'name': tf.string,
'parch': tf.int32,
'pclass': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
'sex': ClassLabel(shape=(), dtype=tf.int64, num_classes=2),
'sibsp': tf.int32,
'ticket': tf.string,
}),
'survived': ClassLabel(shape=(), dtype=tf.int64, num_classes=2),
})
Supervised keys (See
as_supervised
doc):('features', 'survived')
Citation:
@ONLINE {titanic,
author = "Frank E. Harrell Jr., Thomas Cason",
title = "Titanic dataset",
month = "oct",
year = "2017",
url = "https://www.openml.org/d/40945"
}
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):