interpress_news_category_tr

Referensi:

270k

Gunakan perintah berikut untuk memuat kumpulan data ini di TFDS:

ds = tfds.load('huggingface:interpress_news_category_tr/270k')
  • Keterangan :
It is a Turkish news data set consisting of 273601 news in 17 categories, compiled from print media and news websites between 2010 and 2017 by the Interpress (https://www.interpress.com/) media monitoring company.
  • Lisensi : Tidak ada lisensi yang diketahui
  • Versi : 1.0.0
  • Perpecahan :
Membelah Contoh
'test' 54721
'train' 218880
  • Fitur :
{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "content": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "category": {
        "num_classes": 17,
        "names": [
            "aktuel",
            "bilisim",
            "egitim",
            "ekonomi",
            "gida",
            "iletisim",
            "kultursanat",
            "magazin",
            "saglik",
            "savunma",
            "seyahat",
            "siyasi",
            "spor",
            "teknoloji",
            "ticaret",
            "turizm",
            "yasam"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    },
    "categorycode": {
        "num_classes": 17,
        "names": [
            "0",
            "1",
            "2",
            "3",
            "4",
            "5",
            "6",
            "7",
            "8",
            "9",
            "10",
            "11",
            "12",
            "13",
            "14",
            "15",
            "16"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    },
    "publishdatetime": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}