- Description:
A collection of email messages of employees in the Enron Corporation.
There are two features: - email_body: email body text. - subject_line: email subject text.
Homepage: https://github.com/ryanzhumich/AESLC
Source code:
tfds.summarization.Aeslc
Versions:
1.0.0
(default): No release notes.
Download size:
11.10 MiB
Dataset size:
Unknown size
Auto-cached (documentation): Unknown
Splits:
Split | Examples |
---|---|
'test' |
1,906 |
'train' |
14,436 |
'validation' |
1,960 |
- Features:
FeaturesDict({
'email_body': Text(shape=(), dtype=tf.string),
'subject_line': Text(shape=(), dtype=tf.string),
})
Supervised keys (See
as_supervised
doc):('email_body', 'subject_line')
Citation:
@misc{zhang2019email,
title={This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation},
author={Rui Zhang and Joel Tetreault},
year={2019},
eprint={1906.03497},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):