swedish_medical_ner

参考文献:

ウィキ

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:swedish_medical_ner/wiki')
  • 説明
SwedMedNER is a dataset for training and evaluating Named Entity Recognition systems on medical texts in Swedish.
It is derived from medical articles on the Swedish Wikipedia, Läkartidningen, and 1177 Vårdguiden.
  • ライセンス: Creative Commons Attribution-ShareAlike 4.0 International Public License (CC BY-SA 4.0) ライセンスの概要については、 http://creativecommons.org/licenses/by-sa/4.0/を参照してください。

  • バージョン: 1.0.0

  • 分割:

スプリット
'train' 48720
  • 特徴
{
    "sid": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "entities": {
        "feature": {
            "start": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "end": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "text": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "type": {
                "num_classes": 3,
                "names": [
                    "Disorder and Finding",
                    "Pharmaceutical Drug",
                    "Body Structure"
                ],
                "names_file": null,
                "id": null,
                "_type": "ClassLabel"
            }
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

それ

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:swedish_medical_ner/lt')
  • 説明
SwedMedNER is a dataset for training and evaluating Named Entity Recognition systems on medical texts in Swedish.
It is derived from medical articles on the Swedish Wikipedia, Läkartidningen, and 1177 Vårdguiden.
  • ライセンス: Creative Commons Attribution-ShareAlike 4.0 International Public License (CC BY-SA 4.0) ライセンスの概要については、 http://creativecommons.org/licenses/by-sa/4.0/を参照してください。

  • バージョン: 1.0.0

  • 分割:

スプリット
'train' 745753
  • 特徴
{
    "sid": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "entities": {
        "feature": {
            "start": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "end": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "text": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "type": {
                "num_classes": 3,
                "names": [
                    "Disorder and Finding",
                    "Pharmaceutical Drug",
                    "Body Structure"
                ],
                "names_file": null,
                "id": null,
                "_type": "ClassLabel"
            }
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

1177

次のコマンドを使用して、このデータセットを TFDS にロードします。

ds = tfds.load('huggingface:swedish_medical_ner/1177')
  • 説明
SwedMedNER is a dataset for training and evaluating Named Entity Recognition systems on medical texts in Swedish.
It is derived from medical articles on the Swedish Wikipedia, Läkartidningen, and 1177 Vårdguiden.
  • ライセンス: Creative Commons Attribution-ShareAlike 4.0 International Public License (CC BY-SA 4.0) ライセンスの概要については、 http://creativecommons.org/licenses/by-sa/4.0/を参照してください。

  • バージョン: 1.0.0

  • 分割:

スプリット
'train' 927
  • 特徴
{
    "sid": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "entities": {
        "feature": {
            "start": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "end": {
                "dtype": "int32",
                "id": null,
                "_type": "Value"
            },
            "text": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "type": {
                "num_classes": 3,
                "names": [
                    "Disorder and Finding",
                    "Pharmaceutical Drug",
                    "Body Structure"
                ],
                "names_file": null,
                "id": null,
                "_type": "ClassLabel"
            }
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}