sem_eval_2018_task_1

Referências:

subtarefa5.inglês

Use o seguinte comando para carregar esse conjunto de dados no TFDS:

ds = tfds.load('huggingface:sem_eval_2018_task_1/subtask5.english')
  • Descrição :
SemEval-2018 Task 1: Affect in Tweets: SubTask 5: Emotion Classification.
 This is a dataset for multilabel emotion classification for tweets.
 'Given a tweet, classify it as 'neutral or no emotion' or as one, or more, of eleven given emotions that best represent the mental state of the tweeter.'
 It contains 22467 tweets in three languages manually annotated by crowdworkers using Best–Worst Scaling.
  • Licença : Nenhuma licença conhecida
  • Versão : 1.1.0
  • Divisões :
Dividir Exemplos
'test' 3259
'train' 6838
'validation' 886
  • Características :
{
    "ID": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Tweet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "anticipation": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "pessimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "trust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    }
}

subtarefa5.espanhol

Use o seguinte comando para carregar esse conjunto de dados no TFDS:

ds = tfds.load('huggingface:sem_eval_2018_task_1/subtask5.spanish')
  • Descrição :
SemEval-2018 Task 1: Affect in Tweets: SubTask 5: Emotion Classification.
 This is a dataset for multilabel emotion classification for tweets.
 'Given a tweet, classify it as 'neutral or no emotion' or as one, or more, of eleven given emotions that best represent the mental state of the tweeter.'
 It contains 22467 tweets in three languages manually annotated by crowdworkers using Best–Worst Scaling.
  • Licença : Nenhuma licença conhecida
  • Versão : 1.1.0
  • Divisões :
Dividir Exemplos
'test' 2854
'train' 3561
'validation' 679
  • Características :
{
    "ID": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Tweet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "anticipation": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "pessimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "trust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    }
}

subtarefa5.árabe

Use o seguinte comando para carregar esse conjunto de dados no TFDS:

ds = tfds.load('huggingface:sem_eval_2018_task_1/subtask5.arabic')
  • Descrição :
SemEval-2018 Task 1: Affect in Tweets: SubTask 5: Emotion Classification.
 This is a dataset for multilabel emotion classification for tweets.
 'Given a tweet, classify it as 'neutral or no emotion' or as one, or more, of eleven given emotions that best represent the mental state of the tweeter.'
 It contains 22467 tweets in three languages manually annotated by crowdworkers using Best–Worst Scaling.
  • Licença : Nenhuma licença conhecida
  • Versão : 1.1.0
  • Divisões :
Dividir Exemplos
'test' 1518
'train' 2278
'validation' 585
  • Características :
{
    "ID": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Tweet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "anticipation": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "pessimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "trust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    }
}