References:
ar-de
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/ar-de')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
165090 |
- Features:
{
"translation": {
"languages": [
"ar",
"de"
],
"id": null,
"_type": "Translation"
}
}
ar-en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/ar-en')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
9759125 |
- Features:
{
"translation": {
"languages": [
"ar",
"en"
],
"id": null,
"_type": "Translation"
}
}
ar-es
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/ar-es')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
10119379 |
- Features:
{
"translation": {
"languages": [
"ar",
"es"
],
"id": null,
"_type": "Translation"
}
}
ar-fr
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/ar-fr')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
9929567 |
- Features:
{
"translation": {
"languages": [
"ar",
"fr"
],
"id": null,
"_type": "Translation"
}
}
ar-ru
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/ar-ru')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
10206243 |
- Features:
{
"translation": {
"languages": [
"ar",
"ru"
],
"id": null,
"_type": "Translation"
}
}
ar-zh
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/ar-zh')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
9832293 |
- Features:
{
"translation": {
"languages": [
"ar",
"zh"
],
"id": null,
"_type": "Translation"
}
}
de-en
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/de-en')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
162981 |
- Features:
{
"translation": {
"languages": [
"de",
"en"
],
"id": null,
"_type": "Translation"
}
}
de-es
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/de-es')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
162078 |
- Features:
{
"translation": {
"languages": [
"de",
"es"
],
"id": null,
"_type": "Translation"
}
}
de-fr
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/de-fr')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
164025 |
- Features:
{
"translation": {
"languages": [
"de",
"fr"
],
"id": null,
"_type": "Translation"
}
}
de-ru
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/de-ru')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
164792 |
- Features:
{
"translation": {
"languages": [
"de",
"ru"
],
"id": null,
"_type": "Translation"
}
}
de-zh
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/de-zh')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
176933 |
- Features:
{
"translation": {
"languages": [
"de",
"zh"
],
"id": null,
"_type": "Translation"
}
}
en-es
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/en-es')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
11350967 |
- Features:
{
"translation": {
"languages": [
"en",
"es"
],
"id": null,
"_type": "Translation"
}
}
en-fr
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/en-fr')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
13172019 |
- Features:
{
"translation": {
"languages": [
"en",
"fr"
],
"id": null,
"_type": "Translation"
}
}
en-ru
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/en-ru')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
11654416 |
- Features:
{
"translation": {
"languages": [
"en",
"ru"
],
"id": null,
"_type": "Translation"
}
}
en-zh
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/en-zh')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
9564315 |
- Features:
{
"translation": {
"languages": [
"en",
"zh"
],
"id": null,
"_type": "Translation"
}
}
es-fr
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/es-fr')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
11441889 |
- Features:
{
"translation": {
"languages": [
"es",
"fr"
],
"id": null,
"_type": "Translation"
}
}
es-ru
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/es-ru')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
10605056 |
- Features:
{
"translation": {
"languages": [
"es",
"ru"
],
"id": null,
"_type": "Translation"
}
}
es-zh
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/es-zh')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
9847770 |
- Features:
{
"translation": {
"languages": [
"es",
"zh"
],
"id": null,
"_type": "Translation"
}
}
fr-ru
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/fr-ru')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
11761738 |
- Features:
{
"translation": {
"languages": [
"fr",
"ru"
],
"id": null,
"_type": "Translation"
}
}
fr-zh
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/fr-zh')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
9690914 |
- Features:
{
"translation": {
"languages": [
"fr",
"zh"
],
"id": null,
"_type": "Translation"
}
}
ru-zh
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:un_multi/ru-zh')
- Description:
This is a collection of translated documents from the United Nations. This corpus is available in all 6 official languages of the UN, consisting of around 300 million words per language
- License: No known license
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'train' |
9557007 |
- Features:
{
"translation": {
"languages": [
"ru",
"zh"
],
"id": null,
"_type": "Translation"
}
}