interpress_news_category_tr

Referências:

270 mil

Use o seguinte comando para carregar este conjunto de dados no TFDS:

ds = tfds.load('huggingface:interpress_news_category_tr/270k')
  • Descrição :
It is a Turkish news data set consisting of 273601 news in 17 categories, compiled from print media and news websites between 2010 and 2017 by the Interpress (https://www.interpress.com/) media monitoring company.
  • Licença : Nenhuma licença conhecida
  • Versão : 1.0.0
  • Divisões :
Dividir Exemplos
'test' 54721
'train' 218880
  • Características :
{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "content": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "category": {
        "num_classes": 17,
        "names": [
            "aktuel",
            "bilisim",
            "egitim",
            "ekonomi",
            "gida",
            "iletisim",
            "kultursanat",
            "magazin",
            "saglik",
            "savunma",
            "seyahat",
            "siyasi",
            "spor",
            "teknoloji",
            "ticaret",
            "turizm",
            "yasam"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    },
    "categorycode": {
        "num_classes": 17,
        "names": [
            "0",
            "1",
            "2",
            "3",
            "4",
            "5",
            "6",
            "7",
            "8",
            "9",
            "10",
            "11",
            "12",
            "13",
            "14",
            "15",
            "16"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    },
    "publishdatetime": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}