break_data

Referências:

QDMR de alto nível

Use o seguinte comando para carregar esse conjunto de dados no TFDS:

ds = tfds.load('huggingface:break_data/QDMR-high-level')
  • Descrição :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licença : Nenhuma licença conhecida
  • Versão : 1.0.0
  • Divisões :
Dividir Exemplos
'test' 3195
'train' 17503
'validation' 3130
  • Características :
{
    "question_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "decomposition": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "operators": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "split": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

QDMR-léxico de alto nível

Use o seguinte comando para carregar esse conjunto de dados no TFDS:

ds = tfds.load('huggingface:break_data/QDMR-high-level-lexicon')
  • Descrição :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licença : Nenhuma licença conhecida
  • Versão : 1.0.0
  • Divisões :
Dividir Exemplos
'test' 3195
'train' 17503
'validation' 3130
  • Características :
{
    "source": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "allowed_tokens": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

QDMR

Use o seguinte comando para carregar esse conjunto de dados no TFDS:

ds = tfds.load('huggingface:break_data/QDMR')
  • Descrição :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licença : Nenhuma licença conhecida
  • Versão : 1.0.0
  • Divisões :
Dividir Exemplos
'test' 8069
'train' 44321
'validation' 7760
  • Características :
{
    "question_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "decomposition": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "operators": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "split": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

QDMR-léxico

Use o seguinte comando para carregar esse conjunto de dados no TFDS:

ds = tfds.load('huggingface:break_data/QDMR-lexicon')
  • Descrição :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licença : Nenhuma licença conhecida
  • Versão : 1.0.0
  • Divisões :
Dividir Exemplos
'test' 8069
'train' 44321
'validation' 7760
  • Características :
{
    "source": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "allowed_tokens": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

formas lógicas

Use o seguinte comando para carregar esse conjunto de dados no TFDS:

ds = tfds.load('huggingface:break_data/logical-forms')
  • Descrição :
Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.
  • Licença : Nenhuma licença conhecida
  • Versão : 1.0.0
  • Divisões :
Dividir Exemplos
'test' 8006
'train' 44098
'validation' 7719
  • Características :
{
    "question_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "decomposition": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "operators": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "split": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "program": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}