Tune in to the first Women in ML Symposium this Tuesday, October 19 at 9am PST Register now


  • Description:

The Opinosis Opinion Dataset consists of sentences extracted from reviews for 51 topics. Topics and opinions are obtained from Tripadvisor, Edmunds.com and Amazon.com.

Split Examples
'train' 51
  • Features:
    'review_sents': Text(shape=(), dtype=tf.string),
    'summaries': Sequence(Text(shape=(), dtype=tf.string)),
  • Citation:
  title={Opinosis: a graph-based approach to abstractive summarization of highly redundant opinions},
  author={Ganesan, Kavita and Zhai, ChengXiang and Han, Jiawei},
  booktitle={Proceedings of the 23rd International Conference on Computational Linguistics},
  organization={Association for Computational Linguistics}