ML Community Day is November 9! Join us for updates from TensorFlow, JAX, and more Learn more


  • Description:

Extreme Summarization (XSum) Dataset.

There are two features: - document: Input news article. - summary: One sentence summary of the article.

This data need to manaully downloaded and extracted as described in The folder 'xsum-extracts-from-downloads' need to be compressed as 'xsum-extracts-from-downloads.tar.gz' and put in manually downloaded folder.

Split Examples
'test' 11,301
'train' 203,577
'validation' 11,305
  • Features:
    'document': Text(shape=(), dtype=tf.string),
    'summary': Text(shape=(), dtype=tf.string),
  • Citation:
  title={Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization},
  author={Shashi Narayan and Shay B. Cohen and Mirella Lapata},