Referências:
270 mil
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:interpress_news_category_tr/270k')
- Descrição :
It is a Turkish news data set consisting of 273601 news in 17 categories, compiled from print media and news websites between 2010 and 2017 by the Interpress (https://www.interpress.com/) media monitoring company.
- Licença : Nenhuma licença conhecida
- Versão : 1.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'test' | 54721 |
'train' | 218880 |
- Características :
{
"id": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"content": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"category": {
"num_classes": 17,
"names": [
"aktuel",
"bilisim",
"egitim",
"ekonomi",
"gida",
"iletisim",
"kultursanat",
"magazin",
"saglik",
"savunma",
"seyahat",
"siyasi",
"spor",
"teknoloji",
"ticaret",
"turizm",
"yasam"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"categorycode": {
"num_classes": 17,
"names": [
"0",
"1",
"2",
"3",
"4",
"5",
"6",
"7",
"8",
"9",
"10",
"11",
"12",
"13",
"14",
"15",
"16"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"publishdatetime": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}