ro_sts_parallel

References:

ro_sts_parallel

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:ro_sts_parallel/ro_sts_parallel')
  • Description:
The RO-STS-Parallel (a Parallel Romanian English dataset - translation of the Semantic Textual Similarity) contains 17256 sentences in Romanian and English. It is a high-quality translation of the English STS benchmark dataset into Romanian.
  • License: CC BY-SA 4.0 License
  • Version: 1.0.0
  • Splits:
Split Examples
'test' 2759
'train' 11499
'validation' 3001
  • Features:
{
    "translation": {
        "languages": [
            "ro",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

rosts-parallel-en-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:ro_sts_parallel/rosts-parallel-en-ro')
  • Description:
The RO-STS-Parallel (a Parallel Romanian English dataset - translation of the Semantic Textual Similarity) contains 17256 sentences in Romanian and English. It is a high-quality translation of the English STS benchmark dataset into Romanian.
  • License: No known license
  • Version: 1.0.0
  • Splits:
Split Examples
'test' 2759
'train' 11499
'validation' 3001
  • Features:
{
    "translation": {
        "languages": [
            "en",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}