sem_eval_2018_task_1

Références:

subtask5.anglais

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:sem_eval_2018_task_1/subtask5.english')
  • Descriptif :
SemEval-2018 Task 1: Affect in Tweets: SubTask 5: Emotion Classification.
 This is a dataset for multilabel emotion classification for tweets.
 'Given a tweet, classify it as 'neutral or no emotion' or as one, or more, of eleven given emotions that best represent the mental state of the tweeter.'
 It contains 22467 tweets in three languages manually annotated by crowdworkers using Best–Worst Scaling.
  • Licence : Aucune licence connue
  • Version : 1.1.0
  • Fractionnements :
Diviser Exemples
'test' 3259
'train' 6838
'validation' 886
  • Caractéristiques :
{
    "ID": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Tweet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "anticipation": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "pessimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "trust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    }
}

sous-tâche5.espagnol

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:sem_eval_2018_task_1/subtask5.spanish')
  • Descriptif :
SemEval-2018 Task 1: Affect in Tweets: SubTask 5: Emotion Classification.
 This is a dataset for multilabel emotion classification for tweets.
 'Given a tweet, classify it as 'neutral or no emotion' or as one, or more, of eleven given emotions that best represent the mental state of the tweeter.'
 It contains 22467 tweets in three languages manually annotated by crowdworkers using Best–Worst Scaling.
  • Licence : Aucune licence connue
  • Version : 1.1.0
  • Fractionnements :
Diviser Exemples
'test' 2854
'train' 3561
'validation' 679
  • Caractéristiques :
{
    "ID": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Tweet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "anticipation": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "pessimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "trust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    }
}

sous-tâche5.arabe

Utilisez la commande suivante pour charger cet ensemble de données dans TFDS :

ds = tfds.load('huggingface:sem_eval_2018_task_1/subtask5.arabic')
  • Descriptif :
SemEval-2018 Task 1: Affect in Tweets: SubTask 5: Emotion Classification.
 This is a dataset for multilabel emotion classification for tweets.
 'Given a tweet, classify it as 'neutral or no emotion' or as one, or more, of eleven given emotions that best represent the mental state of the tweeter.'
 It contains 22467 tweets in three languages manually annotated by crowdworkers using Best–Worst Scaling.
  • Licence : Aucune licence connue
  • Version : 1.1.0
  • Fractionnements :
Diviser Exemples
'test' 1518
'train' 2278
'validation' 585
  • Caractéristiques :
{
    "ID": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Tweet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "anticipation": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "pessimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "trust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    }
}