billsum

  • Description:

BillSum, summarization of US Congressional and California state bills.

There are several features:

  • text: bill text.
  • summary: summary of the bills.
  • title: title of the bills. features for us bills. ca bills does not have.
  • text_len: number of chars in text.
  • sum_len: number of chars in summary.

  • Homepage: https://github.com/FiscalNote/BillSum

  • Source code: tfds.summarization.Billsum

  • Versions:

    • 3.0.0 (default): No release notes.
  • Download size: 64.14 MiB

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Splits:

Split Examples
'ca_test' 1,237
'test' 3,269
'train' 18,949
  • Features:
FeaturesDict({
    'summary': Text(shape=(), dtype=tf.string),
    'text': Text(shape=(), dtype=tf.string),
    'title': Text(shape=(), dtype=tf.string),
})
@misc{kornilova2019billsum,
    title={BillSum: A Corpus for Automatic Summarization of US Legislation},
    author={Anastassia Kornilova and Vlad Eidelman},
    year={2019},
    eprint={1910.00523},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}