試験

参考文献:

アライメント

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/alignments')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'full' 10834
  • 特徴
{
    "source_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "target_id_list": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

多言語

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/multilingual')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'test' 13510
'train' 7961
'validation' 2672
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

多言語_with_para

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/multilingual_with_para')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'test' 13510
'train' 7961
'validation' 2672
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_テスト

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_test')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'test' 19736
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_test

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_test')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'test' 19736
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_bg

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_bg')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2344
'validation' 593
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

Crosslingual_with_para_bg

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_bg')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2344
'validation' 593
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_hr

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_hr')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2341
'validation' 538
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_hr

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_hr')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 2341
'validation' 538
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_hu

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_hu')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1731年
'validation' 536
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_hu

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_hu')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1731年
'validation' 536
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_it

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_it')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1010
'validation' 246
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_it

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_it')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1010
'validation' 246
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_mk

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_mk')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1665年
'validation' 410
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_mk

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_mk')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1665年
'validation' 410
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_pl

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_pl')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1577年
'validation' 394
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_pl

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_pl')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1577年
'validation' 394
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_pt

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_pt')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 740
'validation' 184
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_pt

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_pt')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 740
'validation' 184
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_sq

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_sq')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1194
'validation' 311
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_sq

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_sq')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1194
'validation' 311
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_sr

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_sr')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1323
'validation' 314
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_sr

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_sr')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1323
'validation' 314
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_tr

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_tr')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1571年
'validation' 393
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_tr

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_tr')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1571年
'validation' 393
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

クロスリンガル_vi

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_vi')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1955年
'validation' 488
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}

crosslingual_with_para_vi

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:exams/crosslingual_with_para_vi')
  • 説明
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
  • ライセンス: CC-BY-SA-4.0
  • バージョン: 1.0.0
  • 分割:
スプリット
'train' 1955年
'validation' 488
  • 特徴
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "stem": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "choices": {
            "feature": {
                "text": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "label": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                },
                "para": {
                    "dtype": "string",
                    "id": null,
                    "_type": "Value"
                }
            },
            "length": -1,
            "id": null,
            "_type": "Sequence"
        }
    },
    "answerKey": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "info": {
        "grade": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "subject": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "language": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        }
    }
}