lc_quad

参照:

次のコマンドを使用して、このデータセットをTFDSにロードします。

ds = tfds.load('huggingface:lc_quad')
  • 説明
LC-QuAD 2.0 is a Large Question Answering dataset with 30,000 pairs of question and its corresponding SPARQL query. The target knowledge base is Wikidata and DBpedia, specifically the 2018 version. Please see our paper for details about the dataset creation process and framework.
  • ライセンス:既知のライセンスはありません
  • バージョン:2.0.0
  • 分割
スプリット
'test' 4781
'train' 19293
  • 特徴
{
    "NNQT_question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "uid": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "subgraph": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "template_index": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sparql_wikidata": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sparql_dbpedia18": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "template": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "paraphrased_question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}