ブレークデータ

参考文献:

QDMR-高レベル

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:break_data/QDMR-high-level')

説明：

Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.

ライセンス: 不明なライセンス
バージョン: 1.0.0
分割:

スプリット	例
`'test'`	3195
`'train'`	17503
`'validation'`	3130

特徴：

{
    "question_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "decomposition": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "operators": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "split": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

QDMR-高レベル辞書

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:break_data/QDMR-high-level-lexicon')

説明：

Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.

ライセンス: 不明なライセンス
バージョン: 1.0.0
分割:

スプリット	例
`'test'`	3195
`'train'`	17503
`'validation'`	3130

特徴：

{
    "source": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "allowed_tokens": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

QDMR

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:break_data/QDMR')

説明：

Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.

ライセンス: 不明なライセンス
バージョン: 1.0.0
分割:

スプリット	例
`'test'`	8069
`'train'`	44321
`'validation'`	7760

特徴：

{
    "question_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "decomposition": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "operators": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "split": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

QDMR-辞書

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:break_data/QDMR-lexicon')

説明：

Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.

ライセンス: 不明なライセンス
バージョン: 1.0.0
分割:

スプリット	例
`'test'`	8069
`'train'`	44321
`'validation'`	7760

特徴：

{
    "source": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "allowed_tokens": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

論理形式

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:break_data/logical-forms')

説明：

Break is a human annotated dataset of natural language questions and their Question Decomposition Meaning Representations
(QDMRs). Break consists of 83,978 examples sampled from 10 question answering datasets over text, images and databases. 
This repository contains the Break dataset along with information on the exact data format.

ライセンス: 不明なライセンス
バージョン: 1.0.0
分割:

スプリット	例
`'test'`	8006
`'train'`	44098
`'validation'`	7719

特徴：

{
    "question_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question_text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "decomposition": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "operators": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "split": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "program": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}