lc_quad

Riferimenti:

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:lc_quad')
  • Descrizione :
LC-QuAD 2.0 is a Large Question Answering dataset with 30,000 pairs of question and its corresponding SPARQL query. The target knowledge base is Wikidata and DBpedia, specifically the 2018 version. Please see our paper for details about the dataset creation process and framework.
  • Licenza : nessuna licenza nota
  • Versione : 2.0.0
  • Spaccature :
Diviso Esempi
'test' 4781
'train' 19293
  • Caratteristiche :
{
    "NNQT_question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "uid": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "subgraph": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "template_index": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sparql_wikidata": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sparql_dbpedia18": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "template": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "paraphrased_question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}