ML Community Day is November 9! Join us for updates from TensorFlow, JAX, and more Learn more


  • Description:

MC-TACO is a dataset of 13k question-answer pairs that require temporal commonsense comprehension. The dataset contains five temporal properties:

  1. duration (how long an event takes)
  2. temporal ordering (typical order of events)
  3. typical time (when an event occurs)
  4. frequency (how often an event occurs)
  5. stationarity (whether a state is maintained for a very long time or indefinitely)

We hope that this dataset can promote the future exploration of this particular class of reasoning problems.

Split Examples
'test' 9,442
'validation' 3,783
  • Features:
    'answer': Text(shape=(), dtype=tf.string),
    'category': ClassLabel(shape=(), dtype=tf.int64, num_classes=5),
    'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=2),
    'question': Text(shape=(), dtype=tf.string),
    'sentence': Text(shape=(), dtype=tf.string),
  • Citation:
    author = {Ben Zhou, Daniel Khashabi, Qiang Ning and Dan Roth},
    title = {"Going on a vacation" takes longer than "Going for a walk": A Study of Temporal Commonsense Understanding },
    booktitle = {EMNLP},
    year = {2019},