multi_news

Multi-News, consists of news articles and human-written summaries of these articles from the site newser.com. Each summary is professionally written by editors and includes links to the original articles cited.

There are two features: - document: text of news articles seperated by special token "|||||". - summary: news summary.

Features

FeaturesDict({
    'document': Text(shape=(), dtype=tf.string),
    'summary': Text(shape=(), dtype=tf.string),
})

Statistics

Split Examples
ALL 56,216
TRAIN 44,972
TEST 5,622
VALIDATION 5,622

Homepage

Supervised keys (for as_supervised=True)

(u'document', u'summary')

Citation

@misc{alex2019multinews,
    title={Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model},
    author={Alexander R. Fabbri and Irene Li and Tianwei She and Suyi Li and Dragomir R. Radev},
    year={2019},
    eprint={1906.01749},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}