onestop_qa

참조:

다음 명령을 사용하여 TFDS에서 이 데이터세트를 로드합니다.

ds = tfds.load('huggingface:onestop_qa')
  • 설명 :
OneStopQA is a multiple choice reading comprehension dataset annotated according to the STARC (Structured Annotations for Reading Comprehension) scheme. The reading materials are Guardian articles taken from the [OneStopEnglish corpus](https://github.com/nishkalavallabhi/OneStopEnglishCorpus). Each article comes in three difficulty levels, Elementary, Intermediate and Advanced. Each paragraph is annotated with three multiple choice reading comprehension questions. The reading comprehension questions can be answered based on any of the three paragraph levels.
  • 라이선스 : Creative Commons Attribution-ShareAlike 4.0 국제 라이선스
  • 버전 : 1.1.0
  • 분할 :
나뉘다
'train' 1458
  • 특징 :
{
    "title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "paragraph": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "level": {
        "num_classes": 3,
        "names": [
            "Adv",
            "Int",
            "Ele"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    },
    "question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "paragraph_index": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "answers": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": 4,
        "id": null,
        "_type": "Sequence"
    },
    "a_span": {
        "feature": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "d_span": {
        "feature": {
            "dtype": "int32",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}