- Description:
This dataset contains machine translations of MNLI into each of the XNLI languages. The translation data is provided by XTREME. Note that this is different from the machine translated data provided by the original XNLI paper.
Source code:
tfds.text.xtreme_xnli.XtremeXnli
Versions:
1.1.0
(default): No release notes.
Download size:
2.31 GiB
Dataset size:
1.59 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
392,570 |
- Feature structure:
FeaturesDict({
'hypothesis': TranslationVariableLanguages({
'language': Text(shape=(), dtype=string),
'translation': Text(shape=(), dtype=string),
}),
'label': ClassLabel(shape=(), dtype=int64, num_classes=3),
'premise': Translation({
'ar': Text(shape=(), dtype=string),
'bg': Text(shape=(), dtype=string),
'de': Text(shape=(), dtype=string),
'el': Text(shape=(), dtype=string),
'en': Text(shape=(), dtype=string),
'es': Text(shape=(), dtype=string),
'fr': Text(shape=(), dtype=string),
'hi': Text(shape=(), dtype=string),
'ru': Text(shape=(), dtype=string),
'sw': Text(shape=(), dtype=string),
'th': Text(shape=(), dtype=string),
'tr': Text(shape=(), dtype=string),
'ur': Text(shape=(), dtype=string),
'vi': Text(shape=(), dtype=string),
'zh': Text(shape=(), dtype=string),
}),
})
- Feature documentation:
Feature | Class | Shape | Dtype | Description |
---|---|---|---|---|
FeaturesDict | ||||
hypothesis | TranslationVariableLanguages | |||
hypothesis/language | Text | string | ||
hypothesis/translation | Text | string | ||
label | ClassLabel | int64 | ||
premise | Translation | |||
premise/ar | Text | string | ||
premise/bg | Text | string | ||
premise/de | Text | string | ||
premise/el | Text | string | ||
premise/en | Text | string | ||
premise/es | Text | string | ||
premise/fr | Text | string | ||
premise/hi | Text | string | ||
premise/ru | Text | string | ||
premise/sw | Text | string | ||
premise/th | Text | string | ||
premise/tr | Text | string | ||
premise/ur | Text | string | ||
premise/vi | Text | string | ||
premise/zh | Text | string |
Supervised keys (See
as_supervised
doc):None
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):
- Citation:
@article{hu2020xtreme,
author = {Junjie Hu and Sebastian Ruder and Aditya Siddhant and Graham Neubig and Orhan Firat and Melvin Johnson},
title = {XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization},
journal = {CoRR},
volume = {abs/2003.11080},
year = {2020},
archivePrefix = {arXiv},
eprint = {2003.11080}
}