sem_eval_2018_task_1

อ้างอิง:

งานย่อย5.ภาษาอังกฤษ

ใช้คำสั่งต่อไปนี้เพื่อโหลดชุดข้อมูลนี้ใน TFDS:

ds = tfds.load('huggingface:sem_eval_2018_task_1/subtask5.english')
  • คำอธิบาย :
SemEval-2018 Task 1: Affect in Tweets: SubTask 5: Emotion Classification.
 This is a dataset for multilabel emotion classification for tweets.
 'Given a tweet, classify it as 'neutral or no emotion' or as one, or more, of eleven given emotions that best represent the mental state of the tweeter.'
 It contains 22467 tweets in three languages manually annotated by crowdworkers using Best–Worst Scaling.
  • ใบอนุญาต : ไม่มีใบอนุญาตที่รู้จัก
  • เวอร์ชั่น : 1.1.0
  • แยก :
แยก ตัวอย่าง
'test' 3259
'train' 6838
'validation' 886
  • คุณสมบัติ :
{
    "ID": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Tweet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "anticipation": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "pessimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "trust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    }
}

งานย่อย5.ภาษาสเปน

ใช้คำสั่งต่อไปนี้เพื่อโหลดชุดข้อมูลนี้ใน TFDS:

ds = tfds.load('huggingface:sem_eval_2018_task_1/subtask5.spanish')
  • คำอธิบาย :
SemEval-2018 Task 1: Affect in Tweets: SubTask 5: Emotion Classification.
 This is a dataset for multilabel emotion classification for tweets.
 'Given a tweet, classify it as 'neutral or no emotion' or as one, or more, of eleven given emotions that best represent the mental state of the tweeter.'
 It contains 22467 tweets in three languages manually annotated by crowdworkers using Best–Worst Scaling.
  • ใบอนุญาต : ไม่มีใบอนุญาตที่รู้จัก
  • เวอร์ชั่น : 1.1.0
  • แยก :
แยก ตัวอย่าง
'test' 2854
'train' 3561
'validation' 679
  • คุณสมบัติ :
{
    "ID": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Tweet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "anticipation": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "pessimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "trust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    }
}

งานย่อย5.ภาษาอาหรับ

ใช้คำสั่งต่อไปนี้เพื่อโหลดชุดข้อมูลนี้ใน TFDS:

ds = tfds.load('huggingface:sem_eval_2018_task_1/subtask5.arabic')
  • คำอธิบาย :
SemEval-2018 Task 1: Affect in Tweets: SubTask 5: Emotion Classification.
 This is a dataset for multilabel emotion classification for tweets.
 'Given a tweet, classify it as 'neutral or no emotion' or as one, or more, of eleven given emotions that best represent the mental state of the tweeter.'
 It contains 22467 tweets in three languages manually annotated by crowdworkers using Best–Worst Scaling.
  • ใบอนุญาต : ไม่มีใบอนุญาตที่รู้จัก
  • เวอร์ชั่น : 1.1.0
  • แยก :
แยก ตัวอย่าง
'test' 1518
'train' 2278
'validation' 585
  • คุณสมบัติ :
{
    "ID": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "Tweet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "anticipation": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "pessimism": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "trust": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    }
}