titanic

  • Description:

Dataset describing the survival status of individual passengers on the Titanic. Missing values in the original dataset are represented using ?. Float and int missing values are replaced with -1, string missing values are replaced with 'Unknown'.

Split Examples
'train' 1,309
  • Feature structure:
FeaturesDict({
    'age': float32,
    'boat': string,
    'body': int32,
    'cabin': string,
    'embarked': ClassLabel(shape=(), dtype=int64, num_classes=4),
    'fare': float32,
    'home.dest': string,
    'name': string,
    'parch': int32,
    'pclass': ClassLabel(shape=(), dtype=int64, num_classes=3),
    'sex': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'sibsp': int32,
    'survived': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'ticket': string,
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
age Tensor float32
boat Tensor string
body Tensor int32
cabin Tensor string
embarked ClassLabel int64
fare Tensor float32
home.dest Tensor string
name Tensor string
parch Tensor int32
pclass ClassLabel int64
sex ClassLabel int64
sibsp Tensor int32
survived ClassLabel int64
ticket Tensor string
  • Supervised keys (See as_supervised doc): ({'age': 'age', 'boat': 'boat', 'body': 'body', 'cabin': 'cabin', 'embarked': 'embarked', 'fare': 'fare', 'home.dest': 'home.dest', 'name': 'name', 'parch': 'parch', 'pclass': 'pclass', 'sex': 'sex', 'sibsp': 'sibsp', 'ticket': 'ticket'}, 'survived')

  • Figure (tfds.show_examples): Not supported.

  • Examples (tfds.as_dataframe):

  • Citation:
@ONLINE {titanic,
author = "Frank E. Harrell Jr., Thomas Cason",
title  = "Titanic dataset",
month  = "oct",
year   = "2017",
url    = "https://www.openml.org/d/40945"
}