autshumato

References:

autshumato-en-tn

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:autshumato/autshumato-en-tn')
  • Description:
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • License: No known license
  • Version: 1.0.0
  • Splits:
Split Examples
'train' 159000
  • Features:
{
    "translation": {
        "languages": [
            "en",
            "tn"
        ],
        "id": null,
        "_type": "Translation"
    }
}

autshumato-en-zu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:autshumato/autshumato-en-zu')
  • Description:
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • License: No known license
  • Version: 1.0.0
  • Splits:
Split Examples
'train' 35489
  • Features:
{
    "translation": {
        "languages": [
            "en",
            "zu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

autshumato-en-ts

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:autshumato/autshumato-en-ts')
  • Description:
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • License: No known license
  • Version: 1.0.0
  • Splits:
Split Examples
'train' 450000
  • Features:
{
    "translation": {
        "languages": [
            "en",
            "ts"
        ],
        "id": null,
        "_type": "Translation"
    }
}

autshumato-en-ts-manual

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:autshumato/autshumato-en-ts-manual')
  • Description:
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • License: No known license
  • Version: 1.0.0
  • Splits:
Split Examples
'train' 92396
  • Features:
{
    "translation": {
        "languages": [
            "en",
            "ts"
        ],
        "id": null,
        "_type": "Translation"
    }
}

autshumato-tn

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:autshumato/autshumato-tn')
  • Description:
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • License: No known license
  • Version: 1.0.0
  • Splits:
Split Examples
'train' 38206
  • Features:
{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

autshumato-ts

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:autshumato/autshumato-ts')
  • Description:
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • License: No known license
  • Version: 1.0.0
  • Splits:
Split Examples
'train' 58398
  • Features:
{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}