break_data

Riferimenti:

QDMR-alto livello

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:break_data/QDMR-high-level')
  • Descrizione :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licenza : nessuna licenza conosciuta
  • Versione : 1.0.0
  • Divide :
Diviso Esempi
'test' 3195
'train' 17503
'validation' 3130
  • Caratteristiche :
{
    "question_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "decomposition": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "operators": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "split": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

QDMR-lessico di alto livello

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:break_data/QDMR-high-level-lexicon')
  • Descrizione :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licenza : nessuna licenza conosciuta
  • Versione : 1.0.0
  • Divide :
Diviso Esempi
'test' 3195
'train' 17503
'validation' 3130
  • Caratteristiche :
{
    "source": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "allowed_tokens": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

QDMR

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:break_data/QDMR')
  • Descrizione :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licenza : nessuna licenza conosciuta
  • Versione : 1.0.0
  • Divide :
Diviso Esempi
'test' 8069
'train' 44321
'validation' 7760
  • Caratteristiche :
{
    "question_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "decomposition": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "operators": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "split": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Lessico QDMR

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:break_data/QDMR-lexicon')
  • Descrizione :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licenza : nessuna licenza conosciuta
  • Versione : 1.0.0
  • Divide :
Diviso Esempi
'test' 8069
'train' 44321
'validation' 7760
  • Caratteristiche :
{
    "source": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "allowed_tokens": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

forme logiche

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:break_data/logical-forms')
  • Descrizione :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licenza : nessuna licenza conosciuta
  • Versione : 1.0.0
  • Divide :
Diviso Esempi
'test' 8006
'train' 44098
'validation' 7719
  • Caratteristiche :
{
    "question_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "decomposition": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "operators": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "split": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "program": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}