xed_en_fi

Referências:

en_annotated

Use o seguinte comando para carregar este conjunto de dados no TFDS:

ds = tfds.load('huggingface:xed_en_fi/en_annotated')

Descrição :

A multilingual fine-grained emotion dataset. The dataset consists of human annotated Finnish (25k) and English sentences (30k). Plutchik’s
core emotions are used to annotate the dataset with the addition of neutral to create a multilabel multiclass
dataset. The dataset is carefully evaluated using language-specific BERT models and SVMs to
show that XED performs on par with other similar datasets and is therefore a useful tool for
sentiment analysis and emotion detection.

Licença : Licença: Licença Creative Commons Atribuição 4.0 Internacional (CC-BY)
Versão : 1.1.0
Divisões :

Dividir	Exemplos
`'train'`	17528

Características :

{
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 9,
            "names": [
                "neutral",
                "anger",
                "anticipation",
                "disgust",
                "fear",
                "joy",
                "sadness",
                "surprise",
                "trust"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

en_neutro

Use o seguinte comando para carregar este conjunto de dados no TFDS:

ds = tfds.load('huggingface:xed_en_fi/en_neutral')

Descrição :

A multilingual fine-grained emotion dataset. The dataset consists of human annotated Finnish (25k) and English sentences (30k). Plutchik’s
core emotions are used to annotate the dataset with the addition of neutral to create a multilabel multiclass
dataset. The dataset is carefully evaluated using language-specific BERT models and SVMs to
show that XED performs on par with other similar datasets and is therefore a useful tool for
sentiment analysis and emotion detection.

Licença : Licença: Licença Creative Commons Atribuição 4.0 Internacional (CC-BY)
Versão : 1.1.0
Divisões :

Dividir	Exemplos
`'train'`	9675

Características :

{
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "num_classes": 9,
        "names": [
            "neutral",
            "anger",
            "anticipation",
            "disgust",
            "fear",
            "joy",
            "sadness",
            "surprise",
            "trust"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

fi_anotado

Use o seguinte comando para carregar este conjunto de dados no TFDS:

ds = tfds.load('huggingface:xed_en_fi/fi_annotated')

Descrição :

A multilingual fine-grained emotion dataset. The dataset consists of human annotated Finnish (25k) and English sentences (30k). Plutchik’s
core emotions are used to annotate the dataset with the addition of neutral to create a multilabel multiclass
dataset. The dataset is carefully evaluated using language-specific BERT models and SVMs to
show that XED performs on par with other similar datasets and is therefore a useful tool for
sentiment analysis and emotion detection.

Licença : Licença: Licença Creative Commons Atribuição 4.0 Internacional (CC-BY)
Versão : 1.1.0
Divisões :

Dividir	Exemplos
`'train'`	14449

Características :

{
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 9,
            "names": [
                "neutral",
                "anger",
                "anticipation",
                "disgust",
                "fear",
                "joy",
                "sadness",
                "surprise",
                "trust"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

fi_neutro

Use o seguinte comando para carregar este conjunto de dados no TFDS:

ds = tfds.load('huggingface:xed_en_fi/fi_neutral')

Descrição :

A multilingual fine-grained emotion dataset. The dataset consists of human annotated Finnish (25k) and English sentences (30k). Plutchik’s
core emotions are used to annotate the dataset with the addition of neutral to create a multilabel multiclass
dataset. The dataset is carefully evaluated using language-specific BERT models and SVMs to
show that XED performs on par with other similar datasets and is therefore a useful tool for
sentiment analysis and emotion detection.

Licença : Licença: Licença Creative Commons Atribuição 4.0 Internacional (CC-BY)
Versão : 1.1.0
Divisões :

Dividir	Exemplos
`'train'`	10794

Características :

{
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "num_classes": 9,
        "names": [
            "neutral",
            "anger",
            "anticipation",
            "disgust",
            "fear",
            "joy",
            "sadness",
            "surprise",
            "trust"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}