- Deskripsi :
Dataset ini berisi terjemahan mesin MNLI ke setiap bahasa XNLI. Data terjemahan disediakan oleh XTREME. Perhatikan bahwa ini berbeda dari data terjemahan mesin yang disediakan oleh kertas XNLI asli.
Situs web : https://www.nyu.edu/projects/bowman/xnli/
Kode sumber :
tfds.text.xtreme_xnli.XtremeXnli
Versi :
-
1.1.0
(default): Tidak ada catatan rilis.
-
Ukuran unduhan :
2.31 GiB
Ukuran set data :
1.59 GiB
Cache otomatis ( dokumentasi ): Tidak
Split :
Membagi | Contoh |
---|---|
'train' | 392.570 |
- Fitur :
FeaturesDict({
'hypothesis': TranslationVariableLanguages({
'language': Text(shape=(), dtype=tf.string),
'translation': Text(shape=(), dtype=tf.string),
}),
'label': ClassLabel(shape=(), dtype=tf.int64, num_classes=3),
'premise': Translation({
'ar': Text(shape=(), dtype=tf.string),
'bg': Text(shape=(), dtype=tf.string),
'de': Text(shape=(), dtype=tf.string),
'el': Text(shape=(), dtype=tf.string),
'en': Text(shape=(), dtype=tf.string),
'es': Text(shape=(), dtype=tf.string),
'fr': Text(shape=(), dtype=tf.string),
'hi': Text(shape=(), dtype=tf.string),
'ru': Text(shape=(), dtype=tf.string),
'sw': Text(shape=(), dtype=tf.string),
'th': Text(shape=(), dtype=tf.string),
'tr': Text(shape=(), dtype=tf.string),
'ur': Text(shape=(), dtype=tf.string),
'vi': Text(shape=(), dtype=tf.string),
'zh': Text(shape=(), dtype=tf.string),
}),
})
Kunci yang diawasi (Lihat
as_supervised
doc ):None
Kutipan :
@article{hu2020xtreme,
author = {Junjie Hu and Sebastian Ruder and Aditya Siddhant and Graham Neubig and Orhan Firat and Melvin Johnson},
title = {XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization},
journal = {CoRR},
volume = {abs/2003.11080},
year = {2020},
archivePrefix = {arXiv},
eprint = {2003.11080}
}
Gambar ( tfds.show_examples ): Tidak didukung.
Contoh ( tfds.as_dataframe ):